LangChain and LlamaIndex support #233

ktolias · 2023-06-25T06:43:57Z

Excellent job, it made my LLM blazing fast. I tried it on T4 (16GB vRAM) and it seems to lower inference time from 36 secs to just 9 secs.

I then tried to use it along with LangChain and LlamaIndex but I got the following error:

ValidationError: 1 validation error for LLMChain
llm
value is not a valid dict (type=type_error.dict)

Can you please provide any guidance?

beratcmn · 2023-06-25T10:00:51Z

Langchain integration is easier than it looks. You can add vllm.LLM as a custom LLM. Doc link

ktolias · 2023-06-25T14:23:57Z

Even though my custom LLM responses fast using vLLM, when I hook it on Langchain or LlamaIndex it just hanging.

Any ideas?

CustomLLM

Inference with CustomLLM using vLLM

Inference with Langchain (keeps hanging)

Inference with LlamaIndex (keeps hanging)

ktolias · 2023-06-26T17:08:39Z

It seems that there is a problem with the specific chain (RetrievalQA).
When I revert to the simple LLMChain, everything worked fine with LangChain.
The problem with LlamaIndex still remains.

zhuohan123 · 2023-08-07T21:45:27Z

Please refer to https://python.langchain.com/docs/integrations/llms/vllm for the latest langchain integration!

Add extra mark_step() on each decode layer to optimize the performance on Gaudi. Signed-off-by: Bob Zhu <bob.zhu@intel.com>

zhuohan123 mentioned this issue Jun 25, 2023

[Roadmap] vLLM Development Roadmap: H2 2023 #244

Closed

76 tasks

ktolias mentioned this issue Jun 27, 2023

Prompt size limits? It keeps hanging with prompts longer than 120 tokens #276

Closed

zhuohan123 closed this as completed Aug 7, 2023

jikunshang pushed a commit to jikunshang/vllm that referenced this issue Sep 24, 2024

optimize qwen2 model on Gaudi (vllm-project#233)

12d7033

Add extra mark_step() on each decode layer to optimize the performance on Gaudi. Signed-off-by: Bob Zhu <bob.zhu@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LangChain and LlamaIndex support #233

LangChain and LlamaIndex support #233

ktolias commented Jun 25, 2023

beratcmn commented Jun 25, 2023

ktolias commented Jun 25, 2023 •

edited

Loading

ktolias commented Jun 26, 2023

zhuohan123 commented Aug 7, 2023

LangChain and LlamaIndex support #233

LangChain and LlamaIndex support #233

Comments

ktolias commented Jun 25, 2023

beratcmn commented Jun 25, 2023

ktolias commented Jun 25, 2023 • edited Loading

ktolias commented Jun 26, 2023

zhuohan123 commented Aug 7, 2023

ktolias commented Jun 25, 2023 •

edited

Loading