-
Notifications
You must be signed in to change notification settings - Fork 53
Open
Description
How to run SpMM with half precision? I see that Sputnik only provides an interface for half2. If I have a CSR matrix with val in half precision, do I need to preprocess it? Also, why is the type of column idx is short2? If the matrix is very large, won't it be impossible to store the column indices?
cudaError_t CudaSpmm(int m, int k, int n, int nonzeros,
const int* __restrict__ row_indices,
const half2* __restrict__ values,
const int* __restrict__ row_offsets,
const short2* __restrict__ column_indices,
const half2* __restrict__ dense_matrix,
half2* __restrict__ output_matrix, cudaStream_t stream)Any response would be helpful, thank you.
Metadata
Metadata
Assignees
Labels
No labels