Use async/multi-threaded requests #258

krrishdholakia · 2023-08-08T16:40:17Z

krrishdholakia
Aug 8, 2023

have y'all considered using async/ threading here to make this call faster?

https://github.com/explosion/spacy-llm/blob/f03da9094ee49626ae3aaccd3129e7c3237454ee/spacy_llm/models/rest/anthropic/model.py#L96C1-L99C10

Happy to make a PR to help out here. Working on a library to simplify LLM API calling - noticed y'all call the REST endpoints which is awesome!

rmitsch · 2023-08-09T08:50:39Z

rmitsch
Aug 9, 2023
Maintainer

Hi @krrishdholakia! Let me convert this into a discussion.

0 replies

rmitsch · 2023-08-09T09:06:34Z

rmitsch
Aug 9, 2023
Maintainer

Thanks for offering to submit a PR! We've considering threading here. The issue is that some LLM providers rate-limit or block requests sent with the same token if too many occur in parallel. I know for sure that OpenAI does this, I suspect that Anthropic and Cohere aren't much different in this regard.

We are aware that executing prompts one after the other is a very unsatisfactory solution. OpenAI supports batching in their deprecated /completions endpoint (you can see the built-in support for batching here), but not in their newer /chat/completions endpoint for whatever reason. Batching support by Cohere and Anthropic is also lacking. We're still holding out hope for these providers to finally add support for batching...

1 reply

rmitsch Aug 9, 2023
Maintainer

So in a nutshell: if you want to experiment with async/threading for Anthropic/Cohere calls, you're very welcome! I do strongly suspect you'll run into rate-limiting issues though, in which case we don't want to support this in spacy-llm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use async/multi-threaded requests #258

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Use async/multi-threaded requests #258

krrishdholakia Aug 8, 2023

Replies: 2 comments · 1 reply

rmitsch Aug 9, 2023 Maintainer

rmitsch Aug 9, 2023 Maintainer

rmitsch Aug 9, 2023 Maintainer

krrishdholakia
Aug 8, 2023

Replies: 2 comments 1 reply

rmitsch
Aug 9, 2023
Maintainer

rmitsch
Aug 9, 2023
Maintainer

rmitsch Aug 9, 2023
Maintainer