-
-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Turn shareGPT data into a standard benchmark #45
Comments
tianyil1
pushed a commit
to tianyil1/vllm
that referenced
this issue
Jun 5, 2024
fxmarty
pushed a commit
to fxmarty/vllm-public
that referenced
this issue
Jun 12, 2024
yukavio
pushed a commit
to yukavio/vllm
that referenced
this issue
Jul 3, 2024
…t#45) Tested by checking the help message in openai server: ``` python -m vllm.entrypoints.openai.api_server --help ``` Before: ``` --sparsity {sparse_w16a16,None}, -s {sparse_w16a16,None} Method used to compress sparse weights. If None, we first check the `sparsity_config` attribute in the model config file. If that is None we assume the model weights are dense ``` After: ``` --sparsity {None,sparse_w16a16,semi_structured_sparse_w16a16}, -s {None,sparse_w16a16,semi_structured_sparse_w16a16} Method used to compress sparse weights. If None, we first check the `sparsity_config` attribute in the model config file. If that is None we assume the model weights are dense ```
3 tasks
Closed
jikunshang
pushed a commit
to jikunshang/vllm
that referenced
this issue
Aug 19, 2024
jikunshang
pushed a commit
to jikunshang/vllm
that referenced
this issue
Sep 11, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The text was updated successfully, but these errors were encountered: