Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support int8 KVCacheQuant and W8A8 inference in vllm #1112
Support int8 KVCacheQuant and W8A8 inference in vllm #1112
Changes from all commits
e08acaa
387c804
68cd1e0
ca088d6
6bde51e
931e51c
c0c2a4d
3bb6e31
bc9fada
976874d
2c0c311
27e3b4b
e6f45ff
96c10ca
4be7d83
be6f7b8
5ffc537
347397c
030a100
2805edc
06cfa3f
97b5c69
4d5c1a7
9176b1f
9f872d9
892c589
538947d
bf3eb58
a0be417
52af06e
627b766
dfc9572
e025b66
9eba3c3
1e60348
eab850d
d3735c7
3e7874c
4ee29a9
b746c0c
8893069
b3bdc50
219738f
074e86b
d69100d
3e81f3d
0ea256f
29939aa
d8f7d5a
74bd08f
e9b2fa4
6f88787
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing