Skip to main content
Version: main

profile_cuda

The profile_cuda gadget profiles CUDA memory allocations in libcuda.so (Driver API). It traces the following CUDA Driver API functions:

  • cuMemAlloc_v2 - Allocate device memory
  • cuMemAllocHost_v2 - Allocate page-locked host memory
  • cuMemAllocManaged - Allocate managed memory
  • cuMemAllocPitch_v2 - Allocate pitched device memory

The gadget aggregates memory allocations by user stack trace, allowing you to identify which code paths are responsible for CUDA memory usage.

Getting started

Running the gadget:

$ kubectl gadget run ghcr.io/inspektor-gadget/gadget/profile_cuda:latest [flags]

Guide

TODO