[NeuralChat] Support Neuralchat-vLLM serving with Docker #1187

Spycsh · 2024-01-24T08:19:54Z

Type of Change

feature
API not changed

Support Neuralchat-vLLM serving with Docker

Support Neuralchat-vLLM serving with Docker

on NV GPU

None

hshen14 · 2024-01-26T12:02:17Z

Can we evaluate how to wrapper in a python API?

Spycsh · 2024-01-29T02:13:36Z

@hshen14 #1120 we already offer the Python API
by passing the PipelineConfig(serving_config=ServingConfig(...))

for more information, see https://pre-commit.ci

Spycsh requested a review from lvliang-intel as a code owner January 24, 2024 08:19

Spycsh and others added 2 commits January 24, 2024 16:30

add vllm docker

ca50ebc

Merge branch 'main' into vllm_docker

229abbd

lvliang-intel approved these changes Jan 24, 2024

View reviewed changes

Spycsh and others added 3 commits January 25, 2024 11:18

fix

cb8a271

correct cuda params

d962ca8

Merge branch 'main' into vllm_docker

25546df

lvliang-intel and others added 2 commits February 2, 2024 21:36

Merge branch 'main' into vllm_docker

a5012dd

[pre-commit.ci] auto fixes from pre-commit.com hooks

f57b9b9

for more information, see https://pre-commit.ci

hshen14 approved these changes Feb 5, 2024

View reviewed changes

hshen14 merged commit 1988ddc into main Feb 5, 2024
12 checks passed

hshen14 deleted the vllm_docker branch February 5, 2024 05:41