You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Been using this for some time, all runs good. Just clean chat, no history etc.
After updating my application to use GPT-4 Turbo, users often get this error:
Requests to the Creates a completion for the chat message Operation under Azure OpenAI API version 2023-03-15-preview have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 9 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.
Just wonder @marwic-norlys , what model and model name you've configured for the web app? I've deployed GPT turbo as gpt-4-turbo-1106 and put the same into both AZURE_OPENAI_MODEL and AZURE_OPENAI_MODEL_NAME. It works, and faster than on previous GPT4 model, but insists it does not know anything beyond Sep 2021.
Been using this for some time, all runs good. Just clean chat, no history etc.
After updating my application to use GPT-4 Turbo, users often get this error:
Requests to the Creates a completion for the chat message Operation under Azure OpenAI API version 2023-03-15-preview have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 9 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.
In my application settings I correctly put
AZURE_OPENAI_PREVIEW_API_VERSION=2023-07-01-preview
So wondering if this API version "2023-03-15-preview" you reference in the error message is hardcoded, or there is some other issue I dont see?
Btw im pretty sure the token rate limit is actually NOT exceeded, this happens also during evenings where employees are not using it.
The text was updated successfully, but these errors were encountered: