nan problem of Qwen2-72B quantization #519

baoyf4244 · 2024-06-24T08:53:13Z

change weight scaling formulation, fix nan problem when quantize Qwen2-72B model

For #498 , casper-hansen #516 and Qwen team yangyo@32bf03c?diff=split&w=1 fix this problem by set nan or inf to 1 to walkaround it, But I think this is unreasonble.

I found the occurence of nan was caused by the process of weight scaling, where part of some weights like mlp.gate_proj, mlp.up_proj exceed the range of float16. So those weights become to 0 when loading model. The nans occur when cacluating 0/0. Add some small value to denominator can solve this problem.

…2-72B model

casper-hansen · 2024-06-24T11:44:36Z

Hi @baoyf4244, thanks for the fix! This seems much more appropriate than skipping the scaling of certain values.

change weight scaling formulation, fix nan problem when quantize Qwen…

987d604

…2-72B model

casper-hansen merged commit c53cc7e into casper-hansen:main Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nan problem of Qwen2-72B quantization #519

nan problem of Qwen2-72B quantization #519

baoyf4244 commented Jun 24, 2024

casper-hansen commented Jun 24, 2024

nan problem of Qwen2-72B quantization #519

nan problem of Qwen2-72B quantization #519

Conversation

baoyf4244 commented Jun 24, 2024

casper-hansen commented Jun 24, 2024