Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Properly check if all fused layers are in the list of targets ready ONLY add when PR is ready to merge/full CI is needed
#12666 opened Feb 2, 2025 by eldarkurtic Loading…
Fix broken cmake on AMD platform ci/build
#12665 opened Feb 2, 2025 by kagurazakasanae Loading…
[AMD][ROCm] Enable DeepSeek model on ROCm
#12662 opened Feb 2, 2025 by hongxiayang Loading…
[Frontend] support AWS SageMaker inference id documentation Improvements or additions to documentation frontend
#12652 opened Feb 1, 2025 by bmuskalla Loading…
[V1][Metrics] Add several request timing histograms ready ONLY add when PR is ready to merge/full CI is needed v1
#12644 opened Feb 1, 2025 by markmc Draft
[Core] choice-based structured output with xgrammar ci/build ready ONLY add when PR is ready to merge/full CI is needed structured-output
#12632 opened Jan 31, 2025 by russellb Loading…
[Core] Add Additional Metrics to vLLM Server
#12627 opened Jan 31, 2025 by sahelib25 Loading…
[ROCm] Using a more precise memory profiling
#12624 opened Jan 31, 2025 by gshtras Loading…
[Core] Improve hash collision avoidance in prefix caching needs-rebase ready ONLY add when PR is ready to merge/full CI is needed v1
#12621 opened Jan 31, 2025 by russellb Loading…
Fix quark fp8 format loading
#12612 opened Jan 31, 2025 by fxmarty-amd Loading…
ProTip! What’s not been updated in a month: updated:<2025-01-02.