Skip to content

Commit

Permalink
adding quantizing notes for pytorch files
Browse files Browse the repository at this point in the history
  • Loading branch information
Shaun Prince committed Mar 26, 2024
1 parent 7cd9a24 commit 7d11054
Show file tree
Hide file tree
Showing 3 changed files with 8 additions and 5 deletions.
5 changes: 5 additions & 0 deletions awq/NOTES.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# QuantKit

```bash
quantkit safetensor cognitivecomputations/dolphin-2.8-gemma-2b --delete-original
```
4 changes: 1 addition & 3 deletions awq/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,8 +1,6 @@
#autoawq==0.1.8
#autoawq-kernels==0.0.1
autoawq
autoawq-kernels
transformers
torch
pandas
#exllamav2
llm-quantkit
4 changes: 2 additions & 2 deletions awq/run-quant-awq.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,8 +8,8 @@

access_token = os.getenv('HUGGING_FACE_HUB_TOKEN')

model_path = "ResplendentAI/DaturaCookie_7B"
quant_path = 'temp/DaturaCookie_7B-AWQ'
model_path = "temp/dolphin-2.8-gemma-2b"
quant_path = 'temp/dolphin-2.8-gemma-2b-AWQ'

quant_config = {
"zero_point": True,
Expand Down

0 comments on commit 7d11054

Please sign in to comment.