forked from pytorch/ao
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* eval and GPTQ work Summary: fleshing out the eval code so it works reliably, adding ci, adding gptq. fixed defaults for eval/gptq so they generally working meaningfully without being specified. note, we need a better way to save/load gptq models since they take so long to quantize. I tried using .so but it doesn't seem to work reliably. also added eval and gptq to ci. Test Plan: python eval.py --checkpoint-path checkpoints/$MODEL_REPO/model.pth \ --device cuda --dtype bfloat16 python eval.py --checkpoint-path checkpoints/$MODEL_REPO/model.pth \ --dtype bfloat16 --device cuda \ --quant '{"linear:int4" : {"groupsize" : 32} }' \ --compile python eval.py --checkpoint-path checkpoints/$MODEL_REPO/model.pth \ --dtype bfloat16 --device cuda \ --quant '{"linear:int4" : {"groupsize" : 32} }' python eval.py --checkpoint-path checkpoints/$MODEL_REPO/model.pth \ --dtype bfloat16 --device cuda \ --quant '{"linear:int4-gptq" : {"groupsize" : 32} }' ...running... Reviewers: Subscribers: Tasks: Tags: * fix language in help doc Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * declare scales_and_zeros --------- Co-authored-by: HDCharles <charlesdavidhernandez@gmail.com>
- Loading branch information
Showing
10 changed files
with
576 additions
and
51 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.