I'd like to implement this after all the existing issues are implemented.
Current obstacle is that I don't have access to multiple GPUs I can develop on.
The GPUs at my work are on CUDA 12, which requires NVHPC 22.3, which requires a newer OS than what we have.
I'd like to implement this after all the existing issues are implemented.
Current obstacle is that I don't have access to multiple GPUs I can develop on.
The GPUs at my work are on CUDA 12, which requires NVHPC 22.3, which requires a newer OS than what we have.