Move `autoround` from `generate.py` to `eval.py` #868

yiliu30 · 2024-09-11T00:56:11Z

As Auto-Round is an algorithm focused on improving accuracy instead of perf, move it from generate.py to eval.py.

cc @thuang6

Signed-off-by: yiliu30 <yi4.liu@intel.com>

pytorch-bot · 2024-09-11T00:56:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/868

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 77889a1 with merge base b4d0768 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

yiliu30 · 2024-09-11T00:56:55Z

Hi @HDCharles @jerryzh168, please have a look, thanks!

Signed-off-by: yiliu30 <yi4.liu@intel.com>

torchao/_models/llama/eval_acc.sh

Signed-off-by: yiliu30 <yi4.liu@intel.com>

HDCharles

otherwise looks good

yiliu30 · 2024-09-13T05:50:28Z

Ready to merge.

* move autoround from generate to eval Signed-off-by: yiliu30 <yi4.liu@intel.com> * add llama3 back Signed-off-by: yiliu30 <yi4.liu@intel.com> * update the scripts Signed-off-by: yiliu30 <yi4.liu@intel.com> * update the scripts Signed-off-by: yiliu30 <yi4.liu@intel.com> * rename eval_acc.sh -> evals.sh Signed-off-by: yiliu30 <yi4.liu@intel.com> * update Signed-off-by: yiliu30 <yi4.liu@intel.com> * update Signed-off-by: yiliu30 <yi4.liu@intel.com> --------- Signed-off-by: yiliu30 <yi4.liu@intel.com>

llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema

* Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Update Quant call using llama.cpp (pytorch#868) llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (pytorch#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867)

* Removing all references to HQQ * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Update Quant call using llama.cpp (pytorch#868) llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (pytorch#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Creating an initial Quantization Directory (pytorch#863) * Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Update Quant call using llama.cpp (pytorch#868) llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (pytorch#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867)

* Removing GPTQ from all of torchchat * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Rebase + Add back accidental deletion * Update Quant call using llama.cpp (pytorch#868) llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (pytorch#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Creating an initial Quantization Directory (pytorch#863) * Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Update Quant call using llama.cpp (pytorch#868) llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (pytorch#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Removing all references to HQQ (pytorch#869) * Removing all references to HQQ * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Update Quant call using llama.cpp (pytorch#868) llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (pytorch#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Creating an initial Quantization Directory (pytorch#863) * Initial Creation of a quantization directory * Moving qops * updating import * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867) * Update Quant call using llama.cpp (pytorch#868) llama.cpp did a BC breaking refactor: ggerganov/llama.cpp@1c641e6 resulting in some of our CI breaking This updates our CI to match llama.cpp's schema * Updating torch nightly to pick up aoti improvements in 128339 (pytorch#862) * Updating torch nightly to pick up aoti improvements in 128339 * Update the torch version to 2.5 * Updating lm_eval version (pytorch#865) Fixing CI related to EleutherAI/wikitext_document_level change requirements from using HF Datasets * Pinning numpy to under 2.0 (pytorch#867)

yiliu30 added 2 commits September 10, 2024 20:49

move autoround from generate to eval

efe1844

Signed-off-by: yiliu30 <yi4.liu@intel.com>

add llama3 back

6636637

Signed-off-by: yiliu30 <yi4.liu@intel.com>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 11, 2024

yiliu30 added 2 commits September 10, 2024 20:58

update the scripts

2e21cab

Signed-off-by: yiliu30 <yi4.liu@intel.com>

update the scripts

aecb775

Signed-off-by: yiliu30 <yi4.liu@intel.com>

jerryzh168 approved these changes Sep 11, 2024

View reviewed changes

jerryzh168 reviewed Sep 11, 2024

View reviewed changes

torchao/_models/llama/eval_acc.sh Outdated Show resolved Hide resolved

yiliu30 added 4 commits September 10, 2024 22:43

rename eval_acc.sh -> evals.sh

4b8cdd1

Signed-off-by: yiliu30 <yi4.liu@intel.com>

resolve conflicts

3a22a0b

Signed-off-by: yiliu30 <yi4.liu@intel.com>

update

d9580e3

Signed-off-by: yiliu30 <yi4.liu@intel.com>

update

77889a1

Signed-off-by: yiliu30 <yi4.liu@intel.com>

HDCharles approved these changes Sep 11, 2024

View reviewed changes

yiliu30 mentioned this pull request Sep 13, 2024

Enhance Auto-Round #870

Merged

1 task

HDCharles merged commit d2bce6a into pytorch:main Sep 14, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move `autoround` from `generate.py` to `eval.py` #868

Move `autoround` from `generate.py` to `eval.py` #868

yiliu30 commented Sep 11, 2024

pytorch-bot bot commented Sep 11, 2024 •

edited

Loading

yiliu30 commented Sep 11, 2024

HDCharles left a comment

yiliu30 commented Sep 13, 2024 •

edited

Loading

Move autoround from generate.py to eval.py #868

Move autoround from generate.py to eval.py #868

Conversation

yiliu30 commented Sep 11, 2024

pytorch-bot bot commented Sep 11, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/868

✅ No Failures

yiliu30 commented Sep 11, 2024

HDCharles left a comment

Choose a reason for hiding this comment

yiliu30 commented Sep 13, 2024 • edited Loading

Move `autoround` from `generate.py` to `eval.py` #868

Move `autoround` from `generate.py` to `eval.py` #868

pytorch-bot bot commented Sep 11, 2024 •

edited

Loading

yiliu30 commented Sep 13, 2024 •

edited

Loading