OpenAI API Rate Limits

This page displays rate limits for OpenAI API models, including token limits (TPM), request limits (RPM), and batch queue limits.

Search results: Showing all models
Chat Models
Model Token Limits Request Limits Batch Queue Limits
gpt-3.5-turbo
200,000 TPM
500 RPM
10,000 RPD
2,000,000 TPD
gpt-3.5-turbo-0125
200,000 TPM
500 RPM
10,000 RPD
2,000,000 TPD
gpt-3.5-turbo-1106
200,000 TPM
500 RPM
10,000 RPD
2,000,000 TPD
gpt-3.5-turbo-16k
200,000 TPM
500 RPM
10,000 RPD
2,000,000 TPD
gpt-3.5-turbo-instruct
90,000 TPM
3,500 RPM
200,000 TPD
gpt-3.5-turbo-instruct-0914
90,000 TPM
3,500 RPM
200,000 TPD
gpt-4
10,000 TPM
500 RPM
10,000 RPD
100,000 TPD
gpt-4-0613
10,000 TPM
500 RPM
10,000 RPD
100,000 TPD
gpt-4-turbo
Shared limits:
gpt-4-turbo
gpt-4-turbo-2024-04-09
gpt-4-turbo-preview
gpt-4-0125-preview
gpt-4-1106-preview
30,000 TPM
500 RPM
90,000 TPD
gpt-4.5-preview
Shared limits:
gpt-4.5-preview-2025-02-27
125,000 TPM
1,000 RPM
50,000 TPD
gpt-4o
Shared limits:
gpt-4o-2024-05-13
gpt-4o-2024-08-06
gpt-4o-2024-11-20
gpt-4o-audio-preview
gpt-4o-audio-preview-2024-10-01
gpt-4o-audio-preview-2024-12-17
30,000 TPM
500 RPM
90,000 TPD
gpt-4o-mini
Shared limits:
gpt-4o-mini-2024-07-18
gpt-4o-mini-audio-preview
gpt-4o-mini-audio-preview-2024-12-17
200,000 TPM
500 RPM
10,000 RPD
2,000,000 TPD
gpt-4o-mini-search-preview
6,000 TPM
100 RPM
gpt-4o-mini-search-preview-2025-03-11
6,000 TPM
100 RPM
gpt-4o-search-preview
6,000 TPM
100 RPM
gpt-4o-search-preview-2025-03-11
6,000 TPM
100 RPM
Image Models
Model Token Limits Request Limits Batch Queue Limits
dall-e-2
500 RPM
5 images per minute
dall-e-3
500 RPM
5 images per minute
Audio Models
Model Token Limits Request Limits Batch Queue Limits
whisper-1
500 RPM
Other Models
Model Token Limits Request Limits Batch Queue Limits
Default limits for all other models
250,000 TPM
3,000 RPM
Text Models
Model Token Limits Request Limits Batch Queue Limits
babbage-002
250,000 TPM
3,000 RPM
chatgpt-4o-latest
500,000 TPM
200 RPM
davinci-002
250,000 TPM
3,000 RPM
o1
30,000 TPM
500 RPM
90,000 TPD
o1-2024-12-17
30,000 TPM
500 RPM
90,000 TPD
o1-mini
200,000 TPM
500 RPM
2,000,000 TPD
o1-mini-2024-09-12
200,000 TPM
500 RPM
2,000,000 TPD
o1-preview
30,000 TPM
500 RPM
90,000 TPD
o1-preview-2024-09-12
30,000 TPM
500 RPM
90,000 TPD
o3-mini
Shared limits:
o3-mini-2025-01-31
200,000 TPM
500 RPM
2,000,000 TPD
text-embedding-3-large
1,000,000 TPM
3,000 RPM
3,000,000 TPD
text-embedding-3-small
1,000,000 TPM
3,000 RPM
3,000,000 TPD
text-embedding-ada-002
1,000,000 TPM
3,000 RPM
3,000,000 TPD
tts-1
500 RPM
tts-1-1106
500 RPM
tts-1-hd
500 RPM
tts-1-hd-1106
500 RPM
Realtime Models
Model Token Limits Request Limits Batch Queue Limits
gpt-4o-mini-realtime-preview
Shared limits:
gpt-4o-mini-realtime-preview-2024-12-17
40,000 TPM
200 RPM
1,000 RPD
gpt-4o-realtime-preview
Shared limits:
gpt-4o-realtime-preview-2024-10-01
gpt-4o-realtime-preview-2024-12-17
40,000 TPM
200 RPM
1,000 RPD
Moderation Models
Model Token Limits Request Limits Batch Queue Limits
omni-moderation-2024-09-26
10,000 TPM
500 RPM
10,000 RPD
omni-moderation-latest
10,000 TPM
500 RPM
10,000 RPD
text-moderation-latest
150,000 TPM
1,000 RPM
text-moderation-stable
150,000 TPM
1,000 RPM
Fine-Tuning Inference
Model Token Limits Request Limits Batch Queue Limits
babbage-002
250,000 TPM
3,000 RPM
davinci-002
250,000 TPM
3,000 RPM
gpt-3.5-turbo-0125
200,000 TPM
500 RPM
gpt-3.5-turbo-0613
200,000 TPM
500 RPM
gpt-3.5-turbo-1106
200,000 TPM
500 RPM
gpt-4-0613
10,000 TPM
500 RPM
gpt-4o-2024-05-13
30,000 TPM
500 RPM
gpt-4o-mini-2024-07-18
200,000 TPM
500 RPM
Fine-tuning Training
Model Active/Queued Jobs Jobs Per Day
babbage-002
3
48
davinci-002
3
48
gpt-3.5-turbo-0613
3
48