I used pytorch-quantization to perform PTQ int8 quantization on ResNet50 and exported it to onnx, followed by exporting it to engine. trt. When reasoning, I found that the speed did not increase, but instead slowed down. What went wrong. #6716
Triggered via issue
January 13, 2025 05:44
Status
Skipped
Total duration
5s
Artifacts
–