-
|
i use cudaGraphKernelNodeGetParams get the node params
struct cudaKernelNodeParams
{
void* func;
dim3 gridDim;
dim3 blockDim;
unsigned int sharedMemBytes;
void **kernelParams;
void **extra;
};
i capture a big cuda graph, i wan't to get all params of each kernel, but it seems that has no api can do that |
Beta Was this translation helpful? Give feedback.
Answered by
galv
Apr 6, 2025
Replies: 1 comment 1 reply
-
|
Hi @pipul Indeed it seems while |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello pipul, there is not a direct API, but you can do this https://github.com/pytorch/pytorch/pull/137318/files#diff-d7302d133bb5e0890fc94de9aeea4d9d442555a3b40772c9db10edb5cf36a35cR424-R429
In other words:
I'm sure you can translate that to python.