INT8 Quantization #298

sriram487 · 2025-02-04T11:34:17Z

I'm working on quantizing the model, and while the accuracy remains stable when using FP16 precision, the model's performance degrades significantly when quantizing to INT8 or FP8. The issue could be related to the calibration data. For INT8 extrinsic quantization, it's typically recommended to use at least 500 images for CNNs. However, I'm not sure about the minimum number of samples required and the appropriate batch size for the calibration data, especially the batch size changes during registration and tracking, which could affect the calibration parameters.

What data should be used for calibration? Since the model is trained on synthetic data, is it ok to use a portion of that synthetic training data for calibration?

wenbowen123 · 2025-02-05T04:41:03Z

I haven't tried quantizing. But I'm also curious. You are welcome to report back your findings : )

A7eNg · 2025-02-05T08:18:29Z

+1 waiting for the quantization result

sriram487 changed the title ~~INT8 Calibration.~~ INT8 Quantization Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

INT8 Quantization #298

INT8 Quantization #298

sriram487 commented Feb 4, 2025

wenbowen123 commented Feb 5, 2025

A7eNg commented Feb 5, 2025

INT8 Quantization #298

INT8 Quantization #298

Comments

sriram487 commented Feb 4, 2025

wenbowen123 commented Feb 5, 2025

A7eNg commented Feb 5, 2025