Skip to content

Commit

Permalink
[Misc] Update benchmark_prefix_caching.py fixed example usage (vllm-p…
Browse files Browse the repository at this point in the history
…roject#11920)

Signed-off-by: Ren MinMin <renmm6@chinaunicom.cn>
Co-authored-by: Ren MinMin <renmm6@chinaunicom.cn>
Signed-off-by: Bowen Wang <abmfy@icloud.com>
  • Loading branch information
2 people authored and abmfy committed Jan 24, 2025
1 parent f075824 commit 8a16c31
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion benchmarks/benchmark_prefix_caching.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,8 @@
--model meta-llama/Llama-2-7b-chat-hf \
--enable-prefix-caching \
--num-prompts 1 \
--repeat-count 100
--repeat-count 100 \
--input-length-range 128:256
ShareGPT example usage:
# This command samples 20 prompts with input lengths
Expand Down

0 comments on commit 8a16c31

Please sign in to comment.