This page displays rate limits for OpenAI API models, including token limits (TPM), request limits (RPM), and batch queue limits.
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
gpt-3.5-turbo | 200,000 TPM |
500 RPM 10,000 RPD |
2,000,000 TPD |
gpt-3.5-turbo-0125 | 200,000 TPM |
500 RPM 10,000 RPD |
2,000,000 TPD |
gpt-3.5-turbo-1106 | 200,000 TPM |
500 RPM 10,000 RPD |
2,000,000 TPD |
gpt-3.5-turbo-16k | 200,000 TPM |
500 RPM 10,000 RPD |
2,000,000 TPD |
gpt-3.5-turbo-instruct | 90,000 TPM |
3,500 RPM |
200,000 TPD |
gpt-3.5-turbo-instruct-0914 | 90,000 TPM |
3,500 RPM |
200,000 TPD |
gpt-4 | 10,000 TPM |
500 RPM 10,000 RPD |
100,000 TPD |
gpt-4-0613 | 10,000 TPM |
500 RPM 10,000 RPD |
100,000 TPD |
gpt-4-turbo | 30,000 TPM |
500 RPM |
90,000 TPD |
gpt-4.5-preview | 125,000 TPM |
1,000 RPM |
50,000 TPD |
gpt-4o | 30,000 TPM |
500 RPM |
90,000 TPD |
gpt-4o-mini | 200,000 TPM |
500 RPM 10,000 RPD |
2,000,000 TPD |
gpt-4o-mini-search-preview | 6,000 TPM |
100 RPM |
|
gpt-4o-mini-search-preview-2025-03-11 | 6,000 TPM |
100 RPM |
|
gpt-4o-search-preview | 6,000 TPM |
100 RPM |
|
gpt-4o-search-preview-2025-03-11 | 6,000 TPM |
100 RPM |
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
dall-e-2 | 500 RPM 5 images per minute |
||
dall-e-3 | 500 RPM 5 images per minute |
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
whisper-1 | 500 RPM |
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
Default limits for all other models | 250,000 TPM |
3,000 RPM |
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
babbage-002 | 250,000 TPM |
3,000 RPM |
|
chatgpt-4o-latest | 500,000 TPM |
200 RPM |
|
davinci-002 | 250,000 TPM |
3,000 RPM |
|
o1 | 30,000 TPM |
500 RPM |
90,000 TPD |
o1-2024-12-17 | 30,000 TPM |
500 RPM |
90,000 TPD |
o1-mini | 200,000 TPM |
500 RPM |
2,000,000 TPD |
o1-mini-2024-09-12 | 200,000 TPM |
500 RPM |
2,000,000 TPD |
o1-preview | 30,000 TPM |
500 RPM |
90,000 TPD |
o1-preview-2024-09-12 | 30,000 TPM |
500 RPM |
90,000 TPD |
o3-mini | 200,000 TPM |
500 RPM |
2,000,000 TPD |
text-embedding-3-large | 1,000,000 TPM |
3,000 RPM |
3,000,000 TPD |
text-embedding-3-small | 1,000,000 TPM |
3,000 RPM |
3,000,000 TPD |
text-embedding-ada-002 | 1,000,000 TPM |
3,000 RPM |
3,000,000 TPD |
tts-1 | 500 RPM |
||
tts-1-1106 | 500 RPM |
||
tts-1-hd | 500 RPM |
||
tts-1-hd-1106 | 500 RPM |
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
gpt-4o-mini-realtime-preview | 40,000 TPM |
200 RPM 1,000 RPD |
|
gpt-4o-realtime-preview | 40,000 TPM |
200 RPM 1,000 RPD |
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
omni-moderation-2024-09-26 | 10,000 TPM |
500 RPM 10,000 RPD |
|
omni-moderation-latest | 10,000 TPM |
500 RPM 10,000 RPD |
|
text-moderation-latest | 150,000 TPM |
1,000 RPM |
|
text-moderation-stable | 150,000 TPM |
1,000 RPM |
Model | Token Limits | Request Limits | Batch Queue Limits |
---|---|---|---|
babbage-002 | 250,000 TPM |
3,000 RPM |
|
davinci-002 | 250,000 TPM |
3,000 RPM |
|
gpt-3.5-turbo-0125 | 200,000 TPM |
500 RPM |
|
gpt-3.5-turbo-0613 | 200,000 TPM |
500 RPM |
|
gpt-3.5-turbo-1106 | 200,000 TPM |
500 RPM |
|
gpt-4-0613 | 10,000 TPM |
500 RPM |
|
gpt-4o-2024-05-13 | 30,000 TPM |
500 RPM |
|
gpt-4o-mini-2024-07-18 | 200,000 TPM |
500 RPM |
Model | Active/Queued Jobs | Jobs Per Day | |
---|---|---|---|
babbage-002 | 3 |
48 |
|
davinci-002 | 3 |
48 |
|
gpt-3.5-turbo-0613 | 3 |
48 |