Skip to content

[serve.llm][bugfix] Fix the wrong device_capability issue in vllm on quantized models.#51007

Merged
kouroshHakha merged 5 commits intoray-project:masterfrom kouroshHakha:kh/fix-serve-quantizedMar 3, 2025

Commits

Commits on Mar 1, 2025