Difference with weight normalization #7

blueardour · 2022-04-27T10:45:45Z

Hi,

Thanks for providing the interesting work. It gives me new insight on the quantization.

In my previous reading of quantization work, weight normalization is a common trick for performance enhancement. By weight normalization, it means the latent weight should be substruct the mean and divided by the std before sent to be quantized. Based my understand of the RobustQuantization, the weight normalization aims to let the Kt to be zero rather 1.8. In practical, it improved the quantization performance on many tasks.

From Figure 3(b) in this paper, accuacry went higher with the decrease of Kt. I wonder if you have try any Kt lower than 1.8. I am just very curious whether it shares some common benefit with weight normalization.

Thanks

Peng

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference with weight normalization #7

Difference with weight normalization #7

blueardour commented Apr 27, 2022 •

edited

Loading

Difference with weight normalization #7

Difference with weight normalization #7

Comments

blueardour commented Apr 27, 2022 • edited Loading

blueardour commented Apr 27, 2022 •

edited

Loading