429 Too Many Requests #388

arnomoonens · 2023-09-07T13:47:42Z

arnomoonens
Sep 7, 2023

I am getting a 429 error from the OpenAI API when iterating over a model.pipe result.
I am using @llm_tasks = "spacy.NER.v2" and

[components.llm.model]
@llm_models = "spacy.GPT-3-5.v1"
name = "gpt-3.5-turbo"
config = {"temperature": 0.3}

This is the traceback:

  File "/workspaces/project/llm.py", line 16, in <genexpr>
    model_result_fin = (convert_llm_entities(doc) for doc in model_result_fin)
                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/spacy/language.py", line 1611, in pipe
    for doc in docs:
  File "/usr/local/lib/python3.11/site-packages/spacy/util.py", line 1705, in _pipe
    yield from proc.pipe(docs, **kwargs)
  File "spacy/pipeline/pipe.pyx", line 55, in pipe
  File "/usr/local/lib/python3.11/site-packages/spacy/util.py", line 1705, in _pipe
    yield from proc.pipe(docs, **kwargs)
  File "/usr/local/lib/python3.11/site-packages/spacy_llm/pipeline/llm.py", line 175, in pipe
    error_handler(self._name, self, doc_batch, e)
  File "/usr/local/lib/python3.11/site-packages/spacy/util.py", line 1724, in raise_error
    raise e
  File "/usr/local/lib/python3.11/site-packages/spacy_llm/pipeline/llm.py", line 173, in pipe
    yield from iter(self._process_docs(doc_batch))
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/spacy_llm/pipeline/llm.py", line 199, in _process_docs
    responses_iters = tee(self._model(prompts_iters[0]), n_iters)
                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/spacy_llm/models/rest/openai/model.py", line 115, in __call__
    responses = _request(
                ^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/spacy_llm/models/rest/openai/model.py", line 86, in _request
    r = self.retry(
        ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/spacy_llm/models/rest/base.py", line 147, in retry
    raise ConnectionError(
ConnectionError: API could not be reached after 32.592 seconds in total and attempting to connect 5 times. Check your network connection and the API's availability.
429     Too Many Requests

Could you please help with this?
Is there some kind of throttling mechanism that could resolve this?

rmitsch · 2023-09-07T14:13:05Z

rmitsch
Sep 7, 2023
Maintainer

Hi @arnomoonens, this looks like you hit OpenAI's API rate limit. There's not much we can do about that directly, but you can, as for all models, increase the max_tries and timeout parameters to better avoid hitting the rate limit (see docs).

0 replies

arnomoonens · 2023-09-08T12:10:30Z

arnomoonens
Sep 8, 2023
Author

Thanks for the information.
However, the actual issue was that my account was somehow switched to prepaid billing and I didn't have any credit. When using openai package directly, I get the right error about exceeding my quota, but when using spacy-llm I get the 429 error. This is quite confusing I think...

0 replies

rmitsch · 2023-09-08T13:38:16Z

rmitsch
Sep 8, 2023
Maintainer

Yeah, that's confusing. Can you let me know the exact error message you're getting with the openai package?

0 replies

arnomoonens · 2023-09-08T14:07:52Z

arnomoonens
Sep 8, 2023
Author

RateLimitError: You exceeded your current quota, please check your plan and billing details.

0 replies

rmitsch · 2023-09-11T07:27:07Z

rmitsch
Sep 11, 2023
Maintainer

Thanks! Added this to our backlog, we'll improve the error message for this.

0 replies

ishaan-jaff · 2023-11-29T18:58:38Z

ishaan-jaff
Nov 29, 2023

@arnomoonens @rmitsch

I'm the maintainer of LiteLLM we provide an Open source proxy for load balancing Azure + OpenAI + Any LiteLLM supported LLM
It can process (500+ requests/second)

From this thread it looks like you're running in 429 rate limit errors. Our proxy will allow you to maximize throughput by load balancing between Azure OpenAI instance - I hope our solution makes it easier for you. (i'd love feedback if you're trying to do this)

Here's the quick start:

Doc: https://docs.litellm.ai/docs/simple_proxy#load-balancing---multiple-instances-of-1-model

Step 1 Create a Config.yaml

model_list:
  - model_name: gpt-4
    litellm_params:
      model: azure/chatgpt-v-2
      api_base: https://openai-gpt-4-test-v-1.openai.azure.com/
      api_version: "2023-05-15"
      api_key: 
  - model_name: gpt-4
    litellm_params:
      model: azure/gpt-4
      api_key: 
      api_base: https://openai-gpt-4-test-v-2.openai.azure.com/
  - model_name: gpt-4
    litellm_params:
      model: azure/gpt-4
      api_key: 
      api_base: https://openai-gpt-4-test-v-2.openai.azure.com/

Step 2: Start the litellm proxy:

litellm --config /path/to/config.yaml

Step3 Make Request to LiteLLM proxy:

curl --location 'http://0.0.0.0:8000/chat/completions' \
--header 'Content-Type: application/json' \
--data ' {
      "model": "gpt-4",
      "messages": [
        {
          "role": "user",
          "content": "what llm are you"
        }
      ],
    }
'

0 replies

rmitsch · 2023-11-30T08:31:10Z

rmitsch
Nov 30, 2023
Maintainer

Thanks for letting us know! We'll take it into consideration. Will also convert this into a discussion, as this is not a bug/feature request per se.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

429 Too Many Requests #388

{{title}}

Replies: 7 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

429 Too Many Requests #388

arnomoonens Sep 7, 2023

Replies: 7 comments

rmitsch Sep 7, 2023 Maintainer

arnomoonens Sep 8, 2023 Author

rmitsch Sep 8, 2023 Maintainer

arnomoonens Sep 8, 2023 Author

rmitsch Sep 11, 2023 Maintainer

ishaan-jaff Nov 29, 2023

Here's the quick start:

Step 1 Create a Config.yaml

Step 2: Start the litellm proxy:

Step3 Make Request to LiteLLM proxy:

rmitsch Nov 30, 2023 Maintainer

arnomoonens
Sep 7, 2023

rmitsch
Sep 7, 2023
Maintainer

arnomoonens
Sep 8, 2023
Author

rmitsch
Sep 8, 2023
Maintainer

arnomoonens
Sep 8, 2023
Author

rmitsch
Sep 11, 2023
Maintainer

ishaan-jaff
Nov 29, 2023

rmitsch
Nov 30, 2023
Maintainer