GPT-4o Mini Pricing: The Cheapest Way to Use OpenAI (April 2026)

GPT-4o Mini is OpenAI's cheapest model in the GPT-4 family at $0.15/$0.60 per million tokens. Launched July 2024, it handles classification, extraction, routing, and simple generation at 94% less than GPT-4o. Now competes with GPT-4.1 Nano ($0.10/$0.40) for the budget tier.

Most Popular

GPT-4o Mini

$0.15 / $0.60 per 1M input / output tokens
  • Cheapest GPT-4 family model from OpenAI
  • 128K context window
  • Function calling and structured outputs
  • Vision support for image inputs

GPT-4o Mini (Cached)

$0.075 / $0.60 per 1M input / output tokens
  • 50% discount on input tokens with prompt caching
  • Output tokens remain at standard rate
  • Best for apps with repeated system prompts
  • Cache hits automatic — no code changes needed

GPT-4o Mini (Batch)

$0.075 / $0.30 per 1M input / output tokens
  • 50% off both input and output tokens
  • 24-hour turnaround SLA
  • Same quality as synchronous API
  • Best for bulk classification and labeling

GPT-4o Mini (Fine-tuned)

$0.30 / $1.20 per 1M input / output tokens
  • Double base price for custom fine-tuned models
  • Training costs $0.30 per 1M tokens on top of inference
  • Best for domain-specific classification tasks
  • Requires at least 10 training examples

GPT-4o Mini Pricing Table (April 2026)

E

v

e

r

y

G

P

T

-

4

o

M

i

n

i

p

r

i

c

i

n

g

v

a

r

i

a

n

t

i

n

o

n

e

t

a

b

l

e

.

A

l

l

p

r

i

c

e

s

a

r

e

p

e

r

1

m

i

l

l

i

o

n

t

o

k

e

n

s

.

VariantInputCached InputOutputContext
GPT-4o Mini$0.15$0.075$0.60128K
GPT-4o Mini (Batch)$0.075-$0.30128K
GPT-4o Mini (Fine-tuned)$0.30$0.15$1.20128K

GPT-4o Mini vs GPT-4.1 Nano: Which Budget Model Wins?

G

P

T

-

4

.

1

N

a

n

o

l

a

u

n

c

h

e

d

a

s

O

p

e

n

A

I

'

s

n

e

w

u

l

t

r

a

-

c

h

e

a

p

o

p

t

i

o

n

a

n

d

d

i

r

e

c

t

l

y

c

o

m

p

e

t

e

s

w

i

t

h

G

P

T

-

4

o

M

i

n

i

.

N

a

n

o

i

s

3

3

%

c

h

e

a

p

e

r

o

n

b

o

t

h

i

n

p

u

t

a

n

d

o

u

t

p

u

t

t

o

k

e

n

s

.

B

o

t

h

m

o

d

e

l

s

t

a

r

g

e

t

t

h

e

s

a

m

e

u

s

e

c

a

s

e

s

:

c

l

a

s

s

i

f

i

c

a

t

i

o

n

,

e

x

t

r

a

c

t

i

o

n

,

r

o

u

t

i

n

g

,

a

n

d

s

i

m

p

l

e

g

e

n

e

r

a

t

i

o

n

.

T

h

e

k

e

y

d

i

f

f

e

r

e

n

c

e

s

a

r

e

c

o

n

t

e

x

t

w

i

n

d

o

w

s

i

z

e

(

N

a

n

o

s

u

p

p

o

r

t

s

1

M

t

o

k

e

n

s

v

e

r

s

u

s

M

i

n

i

'

s

1

2

8

K

)

a

n

d

r

e

c

e

n

c

y

(

N

a

n

o

b

e

n

e

f

i

t

s

f

r

o

m

n

e

w

e

r

t

r

a

i

n

i

n

g

d

a

t

a

a

n

d

a

r

c

h

i

t

e

c

t

u

r

e

i

m

p

r

o

v

e

m

e

n

t

s

)

.

F

o

r

m

o

s

t

s

i

m

p

l

e

t

a

s

k

s

,

N

a

n

o

i

s

t

h

e

b

e

t

t

e

r

d

e

f

a

u

l

t

.

M

i

n

i

s

t

i

l

l

w

i

n

s

i

f

y

o

u

h

a

v

e

f

i

n

e

-

t

u

n

e

d

m

o

d

e

l

s

y

o

u

d

o

n

o

t

w

a

n

t

t

o

r

e

t

r

a

i

n

,

o

r

i

f

y

o

u

r

s

p

e

c

i

f

i

c

t

a

s

k

b

e

n

c

h

m

a

r

k

s

h

i

g

h

e

r

o

n

M

i

n

i

.

FeatureGPT-4o MiniGPT-4.1 Nano
Input price$0.15/1M$0.10/1M
Output price$0.60/1M$0.40/1M
Context window128K tokens1M tokens
Batch input$0.075/1M$0.05/1M
Batch output$0.30/1M$0.20/1M
Fine-tuningAvailableAvailable
VisionYesNo
Best forLegacy fine-tunes, visionNew projects, budget default

GPT-4o Mini vs GPT-4o: When to Upgrade

G

P

T

-

4

o

c

o

s

t

s

$

2

.

5

0

/

$

1

0

.

0

0

p

e

r

m

i

l

l

i

o

n

t

o

k

e

n

s

r

o

u

g

h

l

y

1

7

x

m

o

r

e

t

h

a

n

M

i

n

i

o

n

i

n

p

u

t

a

n

d

o

u

t

p

u

t

.

T

h

e

q

u

a

l

i

t

y

g

a

p

m

a

t

t

e

r

s

m

o

s

t

o

n

c

o

m

p

l

e

x

r

e

a

s

o

n

i

n

g

,

n

u

a

n

c

e

d

w

r

i

t

i

n

g

,

m

u

l

t

i

-

s

t

e

p

t

o

o

l

u

s

e

,

a

n

d

l

o

n

g

-

f

o

r

m

c

o

n

t

e

n

t

g

e

n

e

r

a

t

i

o

n

.

M

i

n

i

m

a

t

c

h

e

s

G

P

T

-

4

o

c

l

o

s

e

l

y

o

n

s

i

m

p

l

e

c

l

a

s

s

i

f

i

c

a

t

i

o

n

(

9

0

%

+

a

g

r

e

e

m

e

n

t

o

n

b

i

n

a

r

y

t

a

s

k

s

)

b

u

t

f

a

l

l

s

b

e

h

i

n

d

o

n

a

n

y

t

h

i

n

g

r

e

q

u

i

r

i

n

g

c

h

a

i

n

-

o

f

-

t

h

o

u

g

h

t

r

e

a

s

o

n

i

n

g

o

r

s

u

b

t

l

e

d

i

s

t

i

n

c

t

i

o

n

s

.

A

c

o

m

m

o

n

a

r

c

h

i

t

e

c

t

u

r

e

u

s

e

s

M

i

n

i

f

o

r

i

n

i

t

i

a

l

t

r

i

a

g

e

a

n

d

r

o

u

t

e

s

o

n

l

y

c

o

m

p

l

e

x

r

e

q

u

e

s

t

s

t

o

G

P

T

-

4

o

o

r

G

P

T

-

4

.

1

,

c

u

t

t

i

n

g

c

o

s

t

s

b

y

7

0

-

8

0

%

.

FeatureGPT-4o MiniGPT-4o
Input price$0.15/1M$2.50/1M
Output price$0.60/1M$10.00/1M
Context window128K128K
Complex reasoningWeakStrong
ClassificationStrongStrong
VisionYesYes
AudioNoYes
Function callingYesYes

Batch API Pricing for GPT-4o Mini

T

h

e

B

a

t

c

h

A

P

I

p

r

o

c

e

s

s

e

s

r

e

q

u

e

s

t

s

a

s

y

n

c

h

r

o

n

o

u

s

l

y

a

t

5

0

%

o

f

f

b

o

t

h

i

n

p

u

t

a

n

d

o

u

t

p

u

t

t

o

k

e

n

s

.

G

P

T

-

4

o

M

i

n

i

B

a

t

c

h

p

r

i

c

i

n

g

i

s

$

0

.

0

7

5

i

n

p

u

t

a

n

d

$

0

.

3

0

o

u

t

p

u

t

p

e

r

m

i

l

l

i

o

n

t

o

k

e

n

s

.

J

o

b

s

c

o

m

p

l

e

t

e

w

i

t

h

i

n

2

4

h

o

u

r

s

.

S

u

b

m

i

t

a

J

S

O

N

L

f

i

l

e

o

f

C

h

a

t

C

o

m

p

l

e

t

i

o

n

s

r

e

q

u

e

s

t

s

a

n

d

p

o

l

l

f

o

r

r

e

s

u

l

t

s

.

B

e

s

t

f

o

r

:

d

a

t

a

s

e

t

l

a

b

e

l

i

n

g

,

c

o

n

t

e

n

t

c

l

a

s

s

i

f

i

c

a

t

i

o

n

,

b

u

l

k

e

x

t

r

a

c

t

i

o

n

,

a

n

d

e

v

a

l

u

a

t

i

o

n

p

i

p

e

l

i

n

e

s

w

h

e

r

e

l

a

t

e

n

c

y

i

s

n

o

t

c

r

i

t

i

c

a

l

.

Pricing ModeInput/1MOutput/1MLatency
Synchronous$0.15$0.60Real-time
Batch$0.075$0.30Up to 24 hours
Cached + Batch$0.075$0.30Up to 24 hours

Fine-Tuning Costs for GPT-4o Mini

F

i

n

e

-

t

u

n

i

n

g

G

P

T

-

4

o

M

i

n

i

c

o

s

t

s

$

0

.

3

0

p

e

r

1

M

t

r

a

i

n

i

n

g

t

o

k

e

n

s

.

O

n

c

e

t

r

a

i

n

e

d

,

i

n

f

e

r

e

n

c

e

c

o

s

t

s

d

o

u

b

l

e

:

$

0

.

3

0

i

n

p

u

t

a

n

d

$

1

.

2

0

o

u

t

p

u

t

p

e

r

m

i

l

l

i

o

n

t

o

k

e

n

s

.

A

t

y

p

i

c

a

l

f

i

n

e

-

t

u

n

i

n

g

j

o

b

o

n

1

0

0

K

t

r

a

i

n

i

n

g

t

o

k

e

n

s

c

o

s

t

s

a

b

o

u

t

$

0

.

0

3

f

o

r

t

r

a

i

n

i

n

g

a

l

o

n

e

.

T

h

e

r

e

a

l

c

o

s

t

i

s

i

n

f

e

r

e

n

c

e

i

f

y

o

u

r

u

n

1

M

r

e

q

u

e

s

t

s

p

e

r

m

o

n

t

h

,

t

h

e

2

x

i

n

f

e

r

e

n

c

e

p

r

e

m

i

u

m

a

d

d

s

u

p

f

a

s

t

.

C

o

n

s

i

d

e

r

w

h

e

t

h

e

r

p

r

o

m

p

t

e

n

g

i

n

e

e

r

i

n

g

o

r

f

e

w

-

s

h

o

t

e

x

a

m

p

l

e

s

c

a

n

a

c

h

i

e

v

e

t

h

e

s

a

m

e

q

u

a

l

i

t

y

b

e

f

o

r

e

c

o

m

m

i

t

t

i

n

g

t

o

f

i

n

e

-

t

u

n

i

n

g

.

F

i

n

e

-

t

u

n

e

d

M

i

n

i

m

o

d

e

l

s

s

t

i

l

l

c

a

n

n

o

t

m

a

t

c

h

t

h

e

r

e

a

s

o

n

i

n

g

a

b

i

l

i

t

y

o

f

b

a

s

e

G

P

T

-

4

.

1

o

r

G

P

T

-

4

o

.

Cost ComponentBase MiniFine-tuned Mini
Training-$0.30/1M tokens
Input inference$0.15/1M$0.30/1M
Output inference$0.60/1M$1.20/1M
Cached input$0.075/1M$0.15/1M

Real-World Cost Examples

W

h

a

t

G

P

T

-

4

o

M

i

n

i

a

c

t

u

a

l

l

y

c

o

s

t

s

f

o

r

c

o

m

m

o

n

w

o

r

k

l

o

a

d

s

a

t

d

i

f

f

e

r

e

n

t

s

c

a

l

e

s

.

A

l

l

e

s

t

i

m

a

t

e

s

a

s

s

u

m

e

a

v

e

r

a

g

e

t

o

k

e

n

c

o

u

n

t

s

p

e

r

r

e

q

u

e

s

t

.

Use CaseInput TokensOutput TokensCost per RequestCost per 1K RequestsCost per 100K Requests
Chatbot reply~500~200$0.0002$0.20$19.50
Text classification~300~20$0.0001$0.06$5.70
Entity extraction~800~100$0.0002$0.18$18.00
Content routing~200~10$0.00004$0.04$3.60
Summarization (short doc)~2,000~300$0.0005$0.48$48.00

Rate Limits for GPT-4o Mini

O

p

e

n

A

I

s

e

t

s

r

a

t

e

l

i

m

i

t

s

b

a

s

e

d

o

n

y

o

u

r

c

u

m

u

l

a

t

i

v

e

A

P

I

s

p

e

n

d

i

n

g

.

G

P

T

-

4

o

M

i

n

i

s

h

a

r

e

s

t

h

e

s

a

m

e

t

i

e

r

s

y

s

t

e

m

a

s

o

t

h

e

r

m

o

d

e

l

s

b

u

t

t

y

p

i

c

a

l

l

y

h

a

s

h

i

g

h

e

r

t

o

k

e

n

-

p

e

r

-

m

i

n

u

t

e

a

l

l

o

w

a

n

c

e

s

d

u

e

t

o

i

t

s

l

o

w

e

r

c

o

s

t

.

N

e

w

a

c

c

o

u

n

t

s

s

t

a

r

t

a

t

t

h

e

f

r

e

e

t

i

e

r

.

TierRequirementRPMTPMRPD
FreeVerified account500200K500
Tier 1$5+ spent500200K10K
Tier 2$50+ spent5K2M-
Tier 3$100+ spent5K4M-
Tier 4$250+ spent10K10M-
Tier 5$1,000+ spent30K150M-

When GPT-4o Mini Is NOT Enough

G

P

T

-

4

o

M

i

n

i

f

a

i

l

s

o

r

u

n

d

e

r

p

e

r

f

o

r

m

s

o

n

s

e

v

e

r

a

l

t

a

s

k

c

a

t

e

g

o

r

i

e

s

.

C

o

m

p

l

e

x

m

u

l

t

i

-

s

t

e

p

r

e

a

s

o

n

i

n

g

c

o

n

s

i

s

t

e

n

t

l

y

p

r

o

d

u

c

e

s

e

r

r

o

r

s

t

h

a

t

G

P

T

-

4

.

1

o

r

G

P

T

-

4

o

h

a

n

d

l

e

c

o

r

r

e

c

t

l

y

.

N

u

a

n

c

e

d

w

r

i

t

i

n

g

t

a

s

k

s

(

t

o

n

e

m

a

t

c

h

i

n

g

,

p

e

r

s

u

a

s

i

v

e

c

o

p

y

,

c

r

e

a

t

i

v

e

f

i

c

t

i

o

n

)

s

h

o

w

a

c

l

e

a

r

q

u

a

l

i

t

y

g

a

p

.

L

o

n

g

-

c

o

n

t

e

x

t

r

e

t

r

i

e

v

a

l

d

e

g

r

a

d

e

s

p

a

s

t

3

2

K

t

o

k

e

n

s

e

v

e

n

t

h

o

u

g

h

t

h

e

1

2

8

K

w

i

n

d

o

w

t

e

c

h

n

i

c

a

l

l

y

a

c

c

e

p

t

s

m

o

r

e

.

M

u

l

t

i

-

t

u

r

n

a

g

e

n

t

w

o

r

k

f

l

o

w

s

w

i

t

h

t

o

o

l

u

s

e

p

r

o

d

u

c

e

m

o

r

e

h

a

l

l

u

c

i

n

a

t

e

d

f

u

n

c

t

i

o

n

c

a

l

l

s

.

C

o

d

e

g

e

n

e

r

a

t

i

o

n

f

o

r

n

o

n

-

t

r

i

v

i

a

l

t

a

s

k

s

(

r

e

f

a

c

t

o

r

i

n

g

,

a

r

c

h

i

t

e

c

t

u

r

e

c

h

a

n

g

e

s

,

m

u

l

t

i

-

f

i

l

e

e

d

i

t

s

)

m

i

s

s

e

s

e

d

g

e

c

a

s

e

s

.

I

f

y

o

u

r

a

p

p

l

i

c

a

t

i

o

n

f

a

l

l

s

i

n

t

o

a

n

y

o

f

t

h

e

s

e

c

a

t

e

g

o

r

i

e

s

,

b

u

d

g

e

t

f

o

r

a

b

i

g

g

e

r

m

o

d

e

l

a

n

d

u

s

e

M

i

n

i

o

n

l

y

f

o

r

t

h

e

s

i

m

p

l

e

s

u

b

t

a

s

k

s

.

Hidden Costs & Gotchas

  • {'title': 'GPT-4.1 Nano is now cheaper', 'detail': "GPT-4.1 Nano costs $0.10/$0.40 per million tokens versus Mini's $0.15/$0.60. That is 33% cheaper on input and output. For simple classification and routing, Nano may be the better budget pick — benchmark your specific task before choosing."}
  • {'title': 'Output tokens cost 4x input tokens', 'detail': 'GPT-4o Mini charges $0.15 per 1M input tokens but $0.60 per 1M output tokens. Generation-heavy tasks like summarization, content writing, or code generation cost more than you expect if you budget only on input. Always estimate your output-to-input ratio.'}
  • {'title': 'Fine-tuning training costs add up', 'detail': 'Fine-tuning training costs $0.30 per 1M tokens, and inference on fine-tuned models is 2x the base rate ($0.30/$1.20). A training run on 10M tokens costs $3.00 before you make a single inference call. Multiple training iterations multiply that cost.'}
  • {'title': '128K context does not mean 128K quality', 'detail': 'GPT-4o Mini accepts 128K tokens of context, but quality degrades noticeably past 32K tokens. Long-context tasks like document Q&A over large files may produce worse results than splitting into smaller chunks. Test with your actual data before relying on full context length.'}
  • {'title': 'Vision adds image token costs', 'detail': 'Sending images to GPT-4o Mini converts them to tokens based on resolution. A low-res image costs roughly $0.002 and a high-res image costs $0.004-0.006 depending on dimensions. If you process thousands of images, these costs add up fast even at Mini pricing.'}
  • {'title': 'Rate limits start low on free tier', 'detail': 'Free tier accounts get 500 RPM for GPT-4o Mini. Tier 1 ($5+ spent) stays at 500 RPM with 30K TPM. You need to reach Tier 2 ($50+ spent) for 5K RPM. Production applications often hit rate limits before they hit cost ceilings.'}

Which Plan Do You Need?

Prototyping and hobby projects

GPT-4o Mini at standard rates ($0.15/$0.60). Cheap enough to iterate without worrying about cost. 1,000 requests with 500 input and 200 output tokens each costs about $0.20.

Classification and routing at scale

GPT-4o Mini Batch at $0.075/$0.30. If latency does not matter, the Batch API cuts costs in half. Ideal for labeling datasets, content moderation, and intent classification on bulk data.

Production with quality requirements

Upgrade to GPT-4.1 ($2/$8) or GPT-4o ($2.50/$10). Mini struggles with complex reasoning, nuanced writing, and multi-step tool use. If accuracy matters more than cost, step up to a full-size model.

Budget production workloads

Compare GPT-4.1 Nano ($0.10/$0.40) versus GPT-4o Mini ($0.15/$0.60). Nano is 33% cheaper. Run both on your actual prompts and pick whichever scores higher — the cost difference is small but compounds at scale.

The Bottom Line

GPT-4o Mini at $0.15/$0.60 is excellent for simple tasks but now faces competition from GPT-4.1 Nano ($0.10/$0.40). For classification and routing, both are fine — pick based on benchmark performance for your specific task. For anything requiring reasoning or nuance, step up to GPT-4.1 or GPT-4o.

Disclosure: Pricing information is sourced from official websites and may change. We update this page regularly but always verify current pricing on the vendor's site before purchasing.

Related Resources

Full OpenAI API pricing for all models → GPT-4o pricing and comparison → OpenAI API review and features → Anthropic API pricing (Claude models) → Best LLM frameworks for building apps →

Frequently Asked Questions

How much does GPT-4o Mini cost?

GPT-4o Mini costs $0.15 per 1 million input tokens and $0.60 per 1 million output tokens. With prompt caching, input drops to $0.075/1M. Batch API pricing is $0.075 input and $0.30 output per million tokens.

Is GPT-4o Mini free?

No. GPT-4o Mini is not free through the API. New OpenAI accounts get $5 in credits which can be used for Mini. ChatGPT Free includes limited Mini access in the consumer product, but API usage always costs money.

GPT-4o Mini vs GPT-4.1 Nano — which is cheaper?

GPT-4.1 Nano is cheaper at $0.10/$0.40 per million tokens versus Mini's $0.15/$0.60. Nano is 33% less on both input and output. For new projects, benchmark both on your task — Nano is the better default unless Mini specifically outperforms.

Can I fine-tune GPT-4o Mini?

Yes. Fine-tuning GPT-4o Mini costs $0.30 per 1M training tokens. Inference on fine-tuned models costs $0.30 input and $1.20 output per million tokens — double the base rate. You need at least 10 training examples, though 50-100 examples typically produce better results.

What is GPT-4o Mini good for?

GPT-4o Mini excels at classification, entity extraction, content routing, simple Q&A, and any task where the output is short and structured. It matches GPT-4o closely on binary classification tasks while costing 94% less.

What is the GPT-4o Mini context window?

GPT-4o Mini supports 128K tokens of context (roughly 96K words). However, quality degrades past 32K tokens for most tasks. If you need reliable long-context performance, GPT-4.1 Nano offers 1M tokens.

Is GPT-4o Mini good for coding?

GPT-4o Mini handles simple code tasks like boilerplate generation, syntax fixes, and code explanation. It struggles with complex refactoring, multi-file changes, and architectural decisions. For serious coding, use GPT-4.1 or Claude Sonnet 4.6.

How does GPT-4o Mini compare to Claude Haiku?

Claude Haiku 4.5 costs $0.80/$4.00 per million tokens — significantly more expensive than GPT-4o Mini at $0.15/$0.60. Mini is roughly 5x cheaper on input and 7x cheaper on output. Haiku offers a 200K context window and is generally stronger on nuanced text tasks, but Mini wins on price for simple workloads.