-
Notifications
You must be signed in to change notification settings - Fork 207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
eval script for llama #374
Conversation
Summary: previously we were only doing this in the tests but now we have an eval script to along with generate.py Test Plan: python eval.py -q "int4wo-64-gptq" Reviewers: Subscribers: Tasks: Tags:
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/374
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit efb18ce with merge base bc2f8b7 (): This comment was automatically generated by Dr. CI and updates every 15 minutes. |
@@ -0,0 +1,134 @@ | |||
# Copyright (c) Meta Platforms, Inc. and affiliates. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! since this is not tested in CI mind just printing the expected output and putting it in the github issue description
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i added the values to the PR description
* eval script for llama Summary: previously we were only doing this in the tests but now we have an eval script to along with generate.py Test Plan: python eval.py -q "int4wo-64-gptq" Reviewers: Subscribers: Tasks: Tags: * new api Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:
Summary: previously we were only doing this in the tests but now we have an eval script to along with generate.py
Test Plan:
python eval.py -q "int4wo-64-gptq"
expected results:
(using meta-llama/Llama-2-7b-chat-hf weights)
Base (bfloat16) | 12.212 |
int8dq | 12.262 |
int8wo | 12.204 |
int4wo-64 | 12.843 |
int4wo-64-GPTQ | 12.489
Reviewers:
Subscribers:
Tasks:
Tags: