Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial commit of Amazon Q Business runnable for langchain #301

Open
wants to merge 10 commits into
base: main
Choose a base branch
from

Conversation

tomron-aws
Copy link

No description provided.

Copy link
Collaborator

@michaelnchin michaelnchin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the contribution @tomron-aws ! Added a few comments.

Also, can you provide some additional context on what use-cases you're targeting with this ChatSync API integration?

Depending on your goals, it may be better to write this as a base Runnable interface, which can be simpler and more flexible to use than the specialized LLM interface that implements it.

Code changes should be minimal if you decide to switch (i.e. change _call/_acall to invoke/ainvoke, remove _llm_type and _identifying_params).

libs/aws/langchain_aws/llms/q_business.py Outdated Show resolved Hide resolved
libs/aws/langchain_aws/llms/q_business.py Outdated Show resolved Hide resolved
Comment on lines 83 to 84
print("Prompt Length (Amazon Q ChatSync API takes a maximum of 7000 chars)")
print(len(prompt))
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These look like debug print statements. Should they be removed or replaced with logger statements?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've been debating that. Currently Amazon Q ChatSync API only supports a max of 7000 chars. Its helpful to debug in the case that you are passing along additional context with the prompt.

I think a good place for that log would be in the error handling block, which I added. Let me know what you think

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks - let's remove L83/84 in favor of the new changes.

libs/aws/langchain_aws/llms/q_business.py Outdated Show resolved Hide resolved
libs/aws/langchain_aws/llms/q_business.py Outdated Show resolved Hide resolved
}

# Call Amazon Q
response = self.client.chat_sync(**request)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Before attempting to call chat_sync API, can we run some validation code to check if the QBusiness boto client has already been defined externally? If it doesn't exist, we should attempt to create it.

Refer here for an example: https://github.com/langchain-ai/langchain-aws/blob/main/libs/aws/langchain_aws/llms/sagemaker_endpoint.py#L271

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I implemented this as well. Let me know if that satisfies the requirements.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thanks!

@3coins 3coins added the enhancement New feature or request label Dec 16, 2024
logging.info(f"Prompt Length: {len(prompt)}")
print(f"""Prompt:
{prompt}""")
raise ValueError(f"Error raised by Amazon Q service: {e}")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove this line, as it's redundant with L123. Or change the ValueError message here to specifically reference the prompt length constraint

…te client using credentials sine anonymous invocation of chatsync with userId is no longer suppported
@tomron-aws
Copy link
Author

Changed to runnable. Also finished implementing and testing the client validator

Comment on lines 53 to 56
@property
def _llm_type(self) -> str:
"""Return type of llm."""
return "amazon_q"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove this method for non-LLM class

Comment on lines 61 to 67
def _call(
self,
prompt: str,
stop: Optional[List[str]] = None,
run_manager: Optional[CallbackManagerForLLMRun] = None,
**kwargs: Any,
) -> str:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"""Method to access the full response from the last call"""
return self._last_response

async def _acall(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

libs/aws/langchain_aws/runnables/q_business.py Outdated Show resolved Hide resolved
libs/aws/langchain_aws/runnables/q_business.py Outdated Show resolved Hide resolved
libs/aws/langchain_aws/runnables/q_business.py Outdated Show resolved Hide resolved
libs/aws/langchain_aws/runnables/q_business.py Outdated Show resolved Hide resolved
libs/aws/langchain_aws/runnables/q_business.py Outdated Show resolved Hide resolved
libs/aws/langchain_aws/runnables/q_business.py Outdated Show resolved Hide resolved
Comment on lines 119 to 131
async def ainvoke(
self,
input: str,
) -> str:
"""Async call to Amazon Q service."""

def _execute_call():
return self.invoke(input)

# Run the synchronous call in a thread pool
return await asyncio.get_running_loop().run_in_executor(
None, _execute_call
)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe Runnable defaults to this behavior, so this might be redundant. Can we test after removing this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants