Fix subgraph quantization regression in onnxruntime 1.17 #19421

fxmarty · 2024-02-05T16:05:27Z

As per title, fixes #19418

ONNX Runtime 1.17 broke the quantization of ONNX models with subgraphs where initializers are placed on the top-level graph, while different subgraphs use the same initializer.

fxmarty · 2024-02-05T16:06:02Z

cc @tianleiwu @yufenglee @xadupre

xadupre · 2024-02-05T16:09:17Z

Is it possible to add a unit test to make sure it does not break again?

fxmarty · 2024-02-05T16:24:24Z

Sure will do!

fxmarty · 2024-02-06T10:12:06Z

@xadupre I added a test.

tianleiwu · 2024-02-07T18:08:30Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

tianleiwu · 2024-02-07T18:08:37Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, ONNX Runtime React Native CI Pipeline, Windows x64 QNN CI Pipeline

tianleiwu · 2024-02-07T18:08:43Z

/azp run Linux MIGraphX CI Pipeline, orttraining-amd-gpu-ci-pipeline

azure-pipelines · 2024-02-07T18:08:54Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-02-07T18:09:11Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-07T18:09:12Z

Azure Pipelines successfully started running 9 pipeline(s).

onnxruntime/test/python/quantization/test_subgraph.py

tianleiwu · 2024-02-08T03:49:16Z

@fxmarty, could you setup lintrunner and run lintrunner -a to format python script. See:
https://github.com/microsoft/onnxruntime/blob/main/docs/Coding_Conventions_and_Standards.md#linting

onnxruntime/test/python/quantization/test_subgraph.py

+                for attr in node.attribute:
+                    if attr.type == onnx.AttributeProto.GRAPH:
+                        for initializer in attr.g.initializer:
+                            self.assertTrue("shared.weight" not in initializer.name)


tianleiwu · 2024-02-08T17:58:40Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

tianleiwu · 2024-02-08T17:58:48Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, ONNX Runtime React Native CI Pipeline, Windows x64 QNN CI Pipeline

tianleiwu · 2024-02-08T17:59:06Z

/azp run Linux MIGraphX CI Pipeline, orttraining-amd-gpu-ci-pipeline, Big Models

azure-pipelines · 2024-02-08T17:59:18Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-08T17:59:21Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-02-08T17:59:24Z

Azure Pipelines successfully started running 9 pipeline(s).

As per title, fixes #19418 ONNX Runtime 1.17 broke the quantization of ONNX models with subgraphs where initializers are placed on the top-level graph, while different subgraphs use the same initializer.

fix subgraph quantization bug

a3cec52

precise comment

f01260a

yufenglee added the release:1.17.1 label Feb 5, 2024

add test

818d5b0

This was referenced Feb 6, 2024

Avoid overriding model_type in TasksManager huggingface/optimum#1647

Merged

re-enable decoder sequence classification huggingface/optimum#1679

Merged

sophies927 added the triage:approved Approved for cherrypicks for release label Feb 7, 2024

github-advanced-security bot found potential problems Feb 7, 2024

View reviewed changes

onnxruntime/test/python/quantization/test_subgraph.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Feb 7, 2024

View reviewed changes

onnxruntime/test/python/quantization/test_subgraph.py Fixed Show fixed Hide fixed

xadupre reviewed Feb 8, 2024

View reviewed changes

onnxruntime/test/python/quantization/test_subgraph.py Outdated Show resolved Hide resolved

fxmarty mentioned this pull request Feb 8, 2024

Issue in Quantizer decoeder_model_merged.onnx of MT5 huggingface/optimum#1687

Closed

4 tasks

lint

a9a1f24

github-advanced-security bot found potential problems Feb 8, 2024

View reviewed changes

sophies927 removed the triage:approved Approved for cherrypicks for release label Feb 13, 2024

tianleiwu approved these changes Feb 13, 2024

View reviewed changes

tianleiwu merged commit 1e10cdb into microsoft:main Feb 13, 2024
76 of 78 checks passed

tianleiwu mentioned this pull request Feb 14, 2024

Cherry-pick for 1.17.1 patch release #19477

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix subgraph quantization regression in onnxruntime 1.17 #19421

Fix subgraph quantization regression in onnxruntime 1.17 #19421

fxmarty commented Feb 5, 2024

fxmarty commented Feb 5, 2024

xadupre commented Feb 5, 2024

fxmarty commented Feb 5, 2024

fxmarty commented Feb 6, 2024

tianleiwu commented Feb 7, 2024

tianleiwu commented Feb 7, 2024

tianleiwu commented Feb 7, 2024

azure-pipelines bot commented Feb 7, 2024

azure-pipelines bot commented Feb 7, 2024

azure-pipelines bot commented Feb 7, 2024

tianleiwu commented Feb 8, 2024

tianleiwu commented Feb 8, 2024

tianleiwu commented Feb 8, 2024

tianleiwu commented Feb 8, 2024

azure-pipelines bot commented Feb 8, 2024

azure-pipelines bot commented Feb 8, 2024

azure-pipelines bot commented Feb 8, 2024

Fix subgraph quantization regression in onnxruntime 1.17 #19421

Fix subgraph quantization regression in onnxruntime 1.17 #19421

Conversation

fxmarty commented Feb 5, 2024

fxmarty commented Feb 5, 2024

xadupre commented Feb 5, 2024

fxmarty commented Feb 5, 2024

fxmarty commented Feb 6, 2024

tianleiwu commented Feb 7, 2024

tianleiwu commented Feb 7, 2024

tianleiwu commented Feb 7, 2024

azure-pipelines bot commented Feb 7, 2024

azure-pipelines bot commented Feb 7, 2024

azure-pipelines bot commented Feb 7, 2024

tianleiwu commented Feb 8, 2024

tianleiwu commented Feb 8, 2024

tianleiwu commented Feb 8, 2024

tianleiwu commented Feb 8, 2024

azure-pipelines bot commented Feb 8, 2024

azure-pipelines bot commented Feb 8, 2024

azure-pipelines bot commented Feb 8, 2024