Introduction to CuPy
0.1.0
  • Prerequisite
  • Learning Outcomes
  • Modules
  • Python Virtual Environment
  • CuPy
    • GPU Parallelism
    • CuPy Function Overview: Key Operations and Utilities
    • User defined Kernels
    • CUDA Events
    • CUDA Streams
  • Reference
  • Contributers
Introduction to CuPy
  • CuPy
  • View page source

CuPy

In this tutorial, we’ll be using the Gadi HPC machine at NCI. A Python virtual environment will be provided for you during the session.

  • GPU Parallelism
    • GPU Execution Model
      • Thread Indexing
      • How do we assign a unique thread id to each thread?
      • Warps and Warp Schedulers
    • Data Movement in GPUs
    • Streams
    • Exercise
  • CuPy Function Overview: Key Operations and Utilities
    • Device Count
    • Device Properties
    • Current Device
    • CuPy ndarray
      • Comparison with NumPy ndarray
      • Allocating ndarray on a Specific GPU
      • Finding the GPU where the ndarray is located
    • Data Movement in CuPy
    • Transferring Data from GPU to Host
    • Building Device Agnostic Code
    • Explicit data transferDifferences
    • Automatic Data Transfer
  • User defined Kernels
    • Elementwise Kernel
    • Reduction Kernel
      • Raw Kernel
  • CUDA Events
  • CUDA Streams

GitHub Repo: https://github.com/NCI900-Training-Organisation/intro-to-cupy.git

Previous Next

© Copyright 2025, National Computational Infrastructure.

Built with Sphinx using a theme provided by Read the Docs.