Update README for new CLI #178

qihqi · 2024-08-30T23:17:07Z

Update README to refer to new CLI instead of old scripts.

This README is not complete yet. But it will be completed with more details on flags before next release.

wang2yn84

In general good, some nits and small issues. Thank you!

README.md

wang2yn84 · 2024-09-03T16:56:05Z

README.md

-python run_interactive.py --model_name=$model_name --batch_size=128 --max_cache_length=2048 --quantize_weights=$quantize_weights --quantize_type=$quantize_type --quantize_kv_cache=$quantize_weights --checkpoint_path=$output_ckpt_dir --tokenizer_path=$tokenizer_path --sharding_config=default_shardings/$model_name.yaml
+To pass hf token, add `--hf_token` flag
+```
+jpt serve --model_id --model_id meta-llama/Meta-Llama-3-8B-Instruct --hf_token=...


wang2yn84 · 2024-09-03T16:58:47Z

README.md


-* `--sharding_config=<path>` This makes use of alternative sharding config instead of
-  the ones in default_shardings directory.
+Weights downloaded from HuggingFace will be stored by default in `checkpoints` folder.


Do we the options to store weight separately? Even we have problem storing the weights in gcp vm directly.

For gs bucket it need to be brought locally or use mount using Fuse.

The working dir can be edited. Added paragraph to describe that.

It'll be great if you can add how to change the working dir. Cuz for us, we also need to direct to the external ssd. I will approve the PR to unblock you for now.

jetstream_pt/fetch_models.py

install cli into a command

56da057

qihqi force-pushed the hanq_sampler branch from 4b5ce40 to 56da057 Compare August 30, 2024 23:19

qihqi requested review from lsy323, wang2yn84 and FanhaiLu1 August 30, 2024 23:21

lsy323 approved these changes Sep 3, 2024

View reviewed changes

wang2yn84 requested changes Sep 3, 2024

View reviewed changes

FanhaiLu1 approved these changes Sep 4, 2024

View reviewed changes

qihqi added 2 commits September 6, 2024 16:37

Reword, address comments

7d99123

lint

eabc3c2

qihqi force-pushed the hanq_sampler branch from c5610a8 to eabc3c2 Compare September 9, 2024 23:19

qihqi changed the title ~~Update Jetstream, add optional sampler args.~~ Update README for new CLI Sep 10, 2024

wang2yn84 approved these changes Sep 10, 2024

View reviewed changes

qihqi merged commit ec4ac8f into main Sep 10, 2024
4 checks passed

qihqi deleted the hanq_sampler branch September 10, 2024 02:57

qihqi mentioned this pull request Sep 10, 2024

[RFC] Formalizing commandline arguments. #182

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update README for new CLI #178

Update README for new CLI #178

qihqi commented Aug 30, 2024 •

edited

Loading

wang2yn84 left a comment

wang2yn84 Sep 3, 2024

wang2yn84 Sep 3, 2024

qihqi Sep 6, 2024

wang2yn84 Sep 10, 2024

Update README for new CLI #178

Update README for new CLI #178

Conversation

qihqi commented Aug 30, 2024 • edited Loading

wang2yn84 left a comment

Choose a reason for hiding this comment

wang2yn84 Sep 3, 2024

Choose a reason for hiding this comment

wang2yn84 Sep 3, 2024

Choose a reason for hiding this comment

qihqi Sep 6, 2024

Choose a reason for hiding this comment

wang2yn84 Sep 10, 2024

Choose a reason for hiding this comment

qihqi commented Aug 30, 2024 •

edited

Loading