-
Notifications
You must be signed in to change notification settings - Fork 19
Open
Description
Hi @wangmaolin,
I find your work very interesting.
I saw your implementation of int4_conv in cutlass extension, and found that the weight is unsigned:
| cutlass::TensorRef<ElementInputB, LayoutInputB> weight_ref(reinterpret_cast<ElementInputB *>(weight.data<uint8_t>()), LayoutInputB::packed(filter_size)); |
Did you test this function in real CNNs? Since weights are normally zero-centered, any conversion on weights should be done before using this int4_conv function in order to get the right results?
Thanks in advance.
Metadata
Metadata
Assignees
Labels
No labels