Unable to use ChatVertexAI as provider due to default chat's ChatPromptTemplate #264

michaelchia · 2023-07-13T04:56:32Z

Problem

I am unable to use ChatVertexAI as a Provider for chat as the final message ChatPromptTemplate is an AIMessage.

jupyter-ai/packages/jupyter-ai/jupyter_ai/chat_handlers/default.py

Lines 35 to 52 in 0f61906

    
           def create_llm_chain( 
        
               self, provider: Type[BaseProvider], provider_params: Dict[str, str] 
        
           ): 
        
               llm = provider(**provider_params) 
        
               prompt_template = ChatPromptTemplate.from_messages( 
        
                   [ 
        
                       SystemMessagePromptTemplate.from_template(SYSTEM_PROMPT).format( 
        
                           provider_name=llm.name, local_model_id=llm.model_id 
        
                       ), 
        
                       MessagesPlaceholder(variable_name="history"), 
        
                       HumanMessagePromptTemplate.from_template("{input}"), 
        
                       AIMessage(content=""), 
        
                   ] 
        
               ) 
        
               self.llm = llm 
        
               self.llm_chain = ConversationChain( 
        
                   llm=llm, prompt=prompt_template, verbose=True, memory=self.memory 
        
               )

Langchain's ChatVertexAI requires that the last message is a HumanMessage and not an AIMessage or any other type.
https://github.com/hwchase17/langchain/blob/8f5eca236fa399bab81ee7a533cc37efd27a257d/langchain/chat_models/vertexai.py#L122-L126

Proposed Solution

I am not sure what the purpose of the empty AIMessage is. Perhaps review if it is absolutely necessary.

dlqqq · 2023-07-24T16:06:11Z

@michaelchia Hey Michael, I had sent a reply but it appears to have been lost by GitHub 😭. Sorry for the late response.

The reason we add the empty AI message is to indicate that to the LLM that it should generate a response to the prompt instead of generating a continuation of the prompt. From our extensive testing with the providers we offer, we determined this to be necessary for certain providers like AI21 and Cohere. The empty AI message is also part of LangChain's default prompt template for conversation chains: https://github.com/langchain-ai/langchain/blob/master/libs/langchain/langchain/chains/conversation/prompt.py

Hence, I'm very surprised that the ChatVertexAI provider explicitly raises an error when the last message is an AI message, as this goes against the self-consistency of LangChain itself. Though I haven't used it, a glance at the code suggests that ChatVertexAI would fail with the default conversation chain. Since VertexAI seems to be the exception, I'm inclined to argue that the solution to this issue would be to add a custom Pydantic attribute on the ChatVertexAI provider to indicate it should not include an empty AI message suffixed. Then our backend will make sure to check that attribute before building the prompt template.

michaelchia · 2023-07-24T16:11:39Z

Yep, Makes sense. I will argue not bothering even adding this extra attribute on your end, adds unnecessary complexity. Seems like a VertexAI issue that should be solved within their langchain object. On my end, I have a workaround that isn't too hacky (overriding the _generate method to remove that extra AIMessage). Thanks for the consideration.

dlqqq · 2023-07-24T16:17:00Z

Haha, part of the reason we subclass all of the LangChain providers we offer is precisely to work around upstream issues until they're patched. We are inclined to add VertexAI to Jupyter AI, so for other users, the additional attribute would be necessary.

JasonWeill · 2023-07-24T23:42:02Z

See also #226 for customizing prompts for models/providers.

hinthornw · 2023-08-04T14:21:28Z

Hi @dlqqq - Will from the LangChain team here - love what you all are doing with Jupyter AI! We'd love to set up a slack channel with your team to make sure we can prioritize fixes like this and that the modules you are using stably support the project. If you send an email to support@langchain.dev we'll open that line of communication. Thank you!

I know Piyush has made a lot of contributions to the project as well :)

michaelchia added the enhancement New feature or request label Jul 13, 2023

JasonWeill added bug Something isn't working @jupyter-ai/chatui and removed enhancement New feature or request labels Jul 13, 2023

JasonWeill mentioned this issue Jul 21, 2023

Upgrade langchain version to >0.0.239 #285

Closed

JasonWeill added this to the 2.2.0 Release milestone Jul 28, 2023

JasonWeill modified the milestones: 2.2.0 Release, 2.3.0 Release Aug 28, 2023

krassowski mentioned this issue Nov 27, 2023

Prompt template not honored for LlamaCpp mode provider #491

Open

andrii-i mentioned this issue Dec 1, 2023

Allow prompt template customization for Jupyter AI chat. #498

Closed

JasonWeill removed this from the 2.3.0 Release milestone Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to use ChatVertexAI as provider due to default chat's ChatPromptTemplate #264

Unable to use ChatVertexAI as provider due to default chat's ChatPromptTemplate #264

michaelchia commented Jul 13, 2023

dlqqq commented Jul 24, 2023

michaelchia commented Jul 24, 2023

dlqqq commented Jul 24, 2023

JasonWeill commented Jul 24, 2023

hinthornw commented Aug 4, 2023

Unable to use ChatVertexAI as provider due to default chat's ChatPromptTemplate #264

Unable to use ChatVertexAI as provider due to default chat's ChatPromptTemplate #264

Comments

michaelchia commented Jul 13, 2023

Problem

Proposed Solution

dlqqq commented Jul 24, 2023

michaelchia commented Jul 24, 2023

dlqqq commented Jul 24, 2023

JasonWeill commented Jul 24, 2023

hinthornw commented Aug 4, 2023