Question 1

Which GPT-4o API rates does this calculator use?

Accepted Answer

Published per-million-token rates as of 2026-05 from OpenAI's pricing page. GPT-4o: $2.50 per million input tokens / $10 per million output tokens. GPT-4o-mini: $0.15 input / $0.60 output. GPT-4o-realtime (audio-in, audio-out): $5 input / $20 output for text, with separate audio token rates that are roughly 8x higher than text. Always verify at openai.com/pricing before committing to a contract.

Question 2

How do I find my GPT-4o token volume?

Accepted Answer

Open platform.openai.com/usage and export the last 30 days of usage as CSV. The export breaks down input vs output tokens per model. If you do not have a billing relationship yet, a rough heuristic: 1,000 tokens is about 750 English words. A typical chat turn is 200-500 input tokens and 100-300 output tokens.

Question 3

Does this include prompt caching or Batch API discounts?

Accepted Answer

No. The calculator shows list-price math only. OpenAI prompt caching cuts cached input tokens by 50%. The Batch API gives 50% off both input and output for async workloads (24-hour SLA). Production workloads with stable system prompts routinely run 30-50% under list price after caching.

Question 4

Which GPT-4o variant should I pick?

Accepted Answer

GPT-4o-mini for classification, extraction, routing, short Q&A, and high-volume back-end work — 16x cheaper input than GPT-4o and matches on most narrow tasks. GPT-4o for general agents, code generation, multi-step reasoning, and where output quality matters more than cost. GPT-4o-realtime only when you specifically need bidirectional voice — its text rates are 2x GPT-4o and audio tokens are 8x more expensive than text tokens.

Question 5

Is my data sent anywhere?

Accepted Answer

No. Token volumes and rate math run locally in your browser. The page fires an anonymous pageview beacon and CTA-click events so we can measure whether the calculator is useful — no inputs, no email (unless you submit one to the cheat-sheet form), no raw IP stored.

Model	Input ($/MTok)	Output ($/MTok)	Best for
GPT-4o	$2.50	$10.00	General agents, code, multi-step reasoning
GPT-4o-mini	$0.15	$0.60	Classification, extraction, high-volume back-end
GPT-4o-realtime	$5.00	$20.00	Bidirectional voice (text rates shown; audio is ~8x)

GPT-4o API Cost Calculator

Your monthly GPT-4o usage

Published OpenAI GPT-4o rates (as of 2026-05)

Get the 2-page GPT-4o API cost cheat-sheet

How the math works

What this calculator doesn't model

Frequently Asked Questions

Which GPT-4o API rates does this calculator use?

How do I find my GPT-4o token volume?

Does this include prompt caching or Batch API discounts?

Which GPT-4o variant should I pick?

Is my data sent anywhere?

GPT-4o API Cost Calculator

Your monthly GPT-4o usage

Published OpenAI GPT-4o rates (as of 2026-05)

Get the 2-page GPT-4o API cost cheat-sheet

How the math works

What this calculator doesn't model

Frequently Asked Questions

Which GPT-4o API rates does this calculator use?

How do I find my GPT-4o token volume?

Does this include prompt caching or Batch API discounts?

Which GPT-4o variant should I pick?

Is my data sent anywhere?

The full AI API cost calculator suite