Fixed issue for T5 base model quantization issue with IPEX smoothquant #946

PenghuiCheng · 2023-12-15T08:11:49Z

Type of Change

bug fix
No API changed

Description

Fixed quantization failure when T5 base model be quantized with IPEX smoothquant.

Expected Behavior & Potential Risk

Quantize successfully

How has this PR been tested?

Local tested

VincyZhang · 2023-12-15T08:26:54Z

https://inteltf-jenk.sh.intel.com/view/ITREX-release/job/ITREX-1.3-release-test/35/

VincyZhang · 2023-12-16T08:21:03Z

https://inteltf-jenk.sh.intel.com/view/ITREX-release/job/ITREX-1.3-release-test/37

VincyZhang · 2023-12-17T10:51:58Z

@PenghuiCheng please check!
Traceback (most recent call last):
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/quantization.py", line 234, in fit
strategy.traverse()
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/strategy/auto.py", line 140, in traverse
super().traverse()
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/strategy/strategy.py", line 482, in traverse
self._setup_pre_tuning_algo_scheduler()
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/strategy/strategy.py", line 360, in _setup_pre_tuning_algo_scheduler
self.model = self._pre_tuning_algo_scheduler("pre_quantization")
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/algorithm/algorithm.py", line 127, in call
self._q_model = algo(self._origin_model, self._q_model, self._adaptor, self._dataloader, self._calib_iter)
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/algorithm/smooth_quant.py", line 89, in call
q_model = adaptor.smooth_quant(
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/adaptor/pytorch.py", line 1796, in smooth_quant
model._model = self.sq.transform(
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/adaptor/torch_utils/smooth_quant.py", line 1286, in transform
out_pre_sq = model_forward_per_sample(self.model, example_inputs, self.device)
File "/home/sdp/miniconda3/envs/pt-ipex-2.1.0+cpu-3.10-spr/lib/python3.10/site-packages/neural_compressor/adaptor/torch_utils/smooth_quant.py", line 110, in model_forward_per_sample
output = forward_wrapper(model, sample[0], device)
KeyError: 0

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

VincyZhang · 2023-12-19T06:34:37Z

https://inteltf-jenk.sh.intel.com/view/ITREX-release/job/ITREX-1.3-release-test/40/

VincyZhang · 2023-12-19T06:51:05Z

https://inteltf-jenk.sh.intel.com/view/ITREX-release/job/ITREX-1.3-release-test/50/

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

VincyZhang · 2023-12-19T09:05:22Z

https://inteltf-jenk.sh.intel.com/job/ITREX-1.3-release-test/52/

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

VincyZhang · 2023-12-20T01:42:54Z

https://inteltf-jenk.sh.intel.com/job/nlp_toolkit_optimize_validation_localtest/3482/

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

VincyZhang · 2023-12-20T08:16:07Z

https://inteltf-jenk.sh.intel.com/job/nlp_toolkit_optimize_validation_localtest/3500/

hshen14 · 2023-12-20T11:59:27Z

requirements.txt

@@ -3,6 +3,7 @@ cmake>=3.16
 py-cpuinfo
 setuptools>=65
 setuptools_scm[toml]>=6.2
+wheel


is it a real package?

VincyZhang · 2023-12-20T14:59:50Z

https://inteltf-jenk.sh.intel.com/job/nlp_toolkit_optimize_validation_localtest/3500/

@PenghuiCheng please check the error, your json format is not correct

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

VincyZhang · 2023-12-21T03:33:46Z

https://inteltf-jenk.sh.intel.com/job/nlp_toolkit_optimize_validation_localtest/3547/

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

VincyZhang · 2023-12-21T07:49:48Z

https://inteltf-jenk.sh.intel.com/job/nlp_toolkit_optimize_validation_localtest/3556/

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

VincyZhang · 2023-12-21T09:03:44Z

https://inteltf-jenk.sh.intel.com/job/nlp_toolkit_optimize_validation_localtest/3557/

VincyZhang · 2023-12-21T10:36:30Z

@PenghuiCheng Tuning and benchmark are ok, but acc can not run. Please fix in next PR.

PenghuiCheng requested review from changwangss and VincyZhang December 15, 2023 08:12

VincyZhang added bug Something isn't working ITREX-1.3 labels Dec 15, 2023

delock pushed a commit to delock/intel-extension-for-transformers that referenced this pull request Dec 16, 2023

update CLM readme (intel#946)

7b1d8b7

Fixed issue for T5 base model quantization issue with IPEX smoothquant

43b1a20

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

Fixed example code typo

4b2e58d

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

PenghuiCheng force-pushed the penghuic/Fixed_t5_model_issue branch from 0a79242 to 4b2e58d Compare December 19, 2023 08:44

Fixed import error

1f085a5

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

Fixed pre-CI test fail error

c124720

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

hshen14 reviewed Dec 20, 2023

View reviewed changes

Update requirements.txt

d67c20f

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

VincyZhang and others added 2 commits December 21, 2023 11:36

Update requirements.txt

d0ead21

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

Update requirements.txt

f7dfc27

Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com>

Update requirements.txt

1afad31

Signed-off-by: VincyZhang <wenxin.zhang@intel.com>

VincyZhang merged commit 5caf330 into main Dec 21, 2023
2 checks passed

VincyZhang deleted the penghuic/Fixed_t5_model_issue branch December 21, 2023 10:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed issue for T5 base model quantization issue with IPEX smoothquant #946

Fixed issue for T5 base model quantization issue with IPEX smoothquant #946

PenghuiCheng commented Dec 15, 2023

VincyZhang commented Dec 15, 2023

VincyZhang commented Dec 16, 2023

VincyZhang commented Dec 17, 2023

VincyZhang commented Dec 19, 2023

VincyZhang commented Dec 19, 2023

VincyZhang commented Dec 19, 2023 •

edited

Loading

VincyZhang commented Dec 20, 2023

VincyZhang commented Dec 20, 2023

hshen14 Dec 20, 2023

VincyZhang commented Dec 20, 2023

VincyZhang commented Dec 21, 2023

VincyZhang commented Dec 21, 2023

VincyZhang commented Dec 21, 2023

VincyZhang commented Dec 21, 2023

Fixed issue for T5 base model quantization issue with IPEX smoothquant #946

Fixed issue for T5 base model quantization issue with IPEX smoothquant #946

Conversation

PenghuiCheng commented Dec 15, 2023

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

VincyZhang commented Dec 15, 2023

VincyZhang commented Dec 16, 2023

VincyZhang commented Dec 17, 2023

VincyZhang commented Dec 19, 2023

VincyZhang commented Dec 19, 2023

VincyZhang commented Dec 19, 2023 • edited Loading

VincyZhang commented Dec 20, 2023

VincyZhang commented Dec 20, 2023

hshen14 Dec 20, 2023

Choose a reason for hiding this comment

VincyZhang commented Dec 20, 2023

VincyZhang commented Dec 21, 2023

VincyZhang commented Dec 21, 2023

VincyZhang commented Dec 21, 2023

VincyZhang commented Dec 21, 2023

VincyZhang commented Dec 19, 2023 •

edited

Loading