Question about  quantizer

Hello, I noticed that in the quantization process you use the operation `w_bar = tf.round(tf.stop_gradient(w_hard - w_soft) + w_soft)`. However, `tf.round` is a non-differentiable operation, which will prevent the gradients from being backpropagated to the encoder part, resulting in the encoder parameters not being updated throughout the training process. I believe the correct operation should be `w_bar = tf.stop_gradient(w_hard - w_soft) + w_soft`.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about quantizer #51

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question about quantizer #51

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions