Add timeout param to all chat_async methods, add backup model param #36

rishsriv · 2024-12-13T08:57:53Z

Context

We sometimes have scenarios where the LLM we are using goes down sporadically, or just has an insanely long response time. For example, sonnet will (in about 1 in 200 requests) take 60+ to generate a response that it usually does in 5 seconds.

Solution

We want to:

Add a timeout parameter to our LLM models (the OpenAI and Anthropic SDKs support this, Google's python-genai does not yet from what I can tell)
If the original model either fails or does not yield a response in the desired period, then use the backup model

This PR implements this solution.

rishsriv added 4 commits December 13, 2024 16:44

add timeout param to all chat_async methods

02bb1ed

add backup model parameter

1500286

raise exception in case of error, instead of returning None

c88ee19

fix backup model logic

519959f

rishsriv merged commit 2aa9110 into main Dec 13, 2024
1 check passed

rishsriv deleted the rishabh/add-timeout branch December 13, 2024 08:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add timeout param to all chat_async methods, add backup model param #36

Add timeout param to all chat_async methods, add backup model param #36

rishsriv commented Dec 13, 2024

Add timeout param to all chat_async methods, add backup model param #36

Add timeout param to all chat_async methods, add backup model param #36

Conversation

rishsriv commented Dec 13, 2024

Context

Solution