Issue : Groq API Rate Limit Error Handling #601

jiveshkalra · 2024-06-13T18:05:16Z

Describe your issue

Currently the Groq client in the src/llm/groq_client.py doesnt have any error handling , while such code might be okay for other APIs which have bigger token limit but Groq Llama3 70b has a rate limit of 6k tokens PER MINUTE, which get reached very easily by devika
and as soon as that limit is reached , error pops up in the devika backend console and the current task gets stopped.

How To Reproduce

Steps to reproduce the behavior (example):

Setup Devika with GROQ API
Select Llama3 70B as a model
Give a big problem that requires longer token limit (tbh any complex task will do)

Expected behavior

If the rate limit is reached for that minute , the agent should take a pause and then resume the task instead of abruptly stopping everything

Screenshots and logs

Configuration

- OS: Windows
- Python version: 3.11.5
- Node version:  20.9.0 
- search engine: DuckDuckGo
- Model: Groq Llama 3 70B

The text was updated successfully, but these errors were encountered:

jiveshkalra changed the title ~~GROQ API RATE LIMIT ERROR HANDLING~~ Issue : Groq API Rate Limit Error Handling Jun 13, 2024

jiveshkalra mentioned this issue Jun 14, 2024

Fix : Groq TPM Limit Handling #602

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue : Groq API Rate Limit Error Handling #601

Issue : Groq API Rate Limit Error Handling #601

jiveshkalra commented Jun 13, 2024

Issue : Groq API Rate Limit Error Handling #601

Issue : Groq API Rate Limit Error Handling #601

Comments

jiveshkalra commented Jun 13, 2024

Describe your issue

How To Reproduce

Expected behavior

Screenshots and logs

Configuration