automatically handle rate limit #33

aa755 · 2024-06-05T14:58:31Z

Do any one have a variant of openai-chat that would automatically handle rate limit: retry automatically after the time openai suggests in the error?

jcs090218 · 2024-06-05T18:46:19Z

Not sure what to handle. The rate limit means your account is running out of quota. 🤔 Retry wouldn't help in these circumstances.

aa755 · 2024-06-07T13:12:18Z

Here is an example response I was getting from the openai-chat function in this repo:

((error (message . Rate limit reached for gpt-4o in organization xxxxxxxxxx on tokens per min (TPM): Limit 30000, Used 25317, Requested 8450. Please try again in 7.534s. Visit https://platform.openai.com/account/rate-limits to learn more.) (type . tokens) (param) (code . rate_limit_exceeded)))

It was a long prompt (27KB text).
For now, I added a 60s sleep before all calls to openai-chat and I never get this error anymore. But this is too conservative. Ideally, openai-chat should have an option to automatically try after waiting for the time mentioned in the error message (7.534s in this case)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatically handle rate limit #33

automatically handle rate limit #33

aa755 commented Jun 5, 2024

jcs090218 commented Jun 5, 2024

aa755 commented Jun 7, 2024 •

edited

Loading

automatically handle rate limit #33

automatically handle rate limit #33

Comments

aa755 commented Jun 5, 2024

jcs090218 commented Jun 5, 2024

aa755 commented Jun 7, 2024 • edited Loading

aa755 commented Jun 7, 2024 •

edited

Loading