Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPT-4 1106-Preview often gives error ".. exceeded token rate limit of your current OpenAI S0 pricing tier". #406

Open
marwic-norlys opened this issue Nov 21, 2023 · 2 comments
Labels

Comments

@marwic-norlys
Copy link

Been using this for some time, all runs good. Just clean chat, no history etc.

After updating my application to use GPT-4 Turbo, users often get this error:

Requests to the Creates a completion for the chat message Operation under Azure OpenAI API version 2023-03-15-preview have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 9 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.

In my application settings I correctly put

AZURE_OPENAI_PREVIEW_API_VERSION=2023-07-01-preview

So wondering if this API version "2023-03-15-preview" you reference in the error message is hardcoded, or there is some other issue I dont see?

Btw im pretty sure the token rate limit is actually NOT exceeded, this happens also during evenings where employees are not using it.

@dmbuk
Copy link

dmbuk commented Jan 4, 2024

Just wonder @marwic-norlys , what model and model name you've configured for the web app? I've deployed GPT turbo as gpt-4-turbo-1106 and put the same into both AZURE_OPENAI_MODEL and AZURE_OPENAI_MODEL_NAME. It works, and faster than on previous GPT4 model, but insists it does not know anything beyond Sep 2021.

Copy link

This issue is stale because it has been open for 30 days with no activity.

@github-actions github-actions bot added the stale label Jan 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants