AI Reasoning Token Cost Calculator
Find the real cost of a reasoning-model API call. This tool adds the hidden reasoning (“thinking”) tokens that o-series, Claude, and Gemini bill at the output rate — so you see the true per-call and monthly bill, and how much more it is than a naive token estimate. No signup, sources cited below.
How it works
Reasoning models think before they answer. That thinking is a stream of reasoning tokens (OpenAI's term), thinking tokens (Anthropic, Google) that the model generates internally. You usually never see them — the API discards or summarises the trace — but every provider counts and bills them, and crucially they are billed at the model's output rate, not the cheaper input rate.
A naive cost estimate prices only what you can point at: the prompt you sent and the answer you received. The real bill adds a third, invisible line. The calculator above uses this model:
- Convert the published per-million rates to per-token prices:
Pin = input$/1M ÷ 1,000,000andPout = output$/1M ÷ 1,000,000. inputCost = inputTokens × PinreasonCost = reasoningTokens × Pout— the hidden cost.outputCost = visibleTokens × PoutcostPerCall = inputCost + reasonCost + outputCost, whilenaivePerCall = inputCost + outputCost. The multiplier is their ratio, and the monthly figure iscostPerCall × callsPerMonth.
Because reasoning tokens and visible tokens are both billed at Pout, the per-call cost can also be written inputCost + (reasoning + visible) × Pout. The tool computes the cost both ways and they agree to the cent — a built-in cross-check on the math. Effort levels (Low, Medium, High, Max) seed a typical reasoning-token estimate; reasoning counts are non-deterministic, so the field is editable and the methodology is a planning aid, not a guarantee. For an exact figure, read your real reasoning_tokens value from the API response.
Worked examples
Frequently asked questions
Sources & references
- Anthropic — Pricing (base input/output rates)
- Anthropic — Extended thinking (thinking tokens billed as output)
- OpenAI — API pricing
- OpenAI — Reasoning guide (reasoning_tokens billed as output)
- Google — Gemini API pricing
- Google — Gemini thinking (thinking output billing)
Anthropic rates were cross-checked against the Anthropic pricing page on 2026-06-06. OpenAI and Gemini rows are published list prices that change without notice — re-check them against the linked source before relying on a figure. No live API calls are made; pricing is a static, dated table.
Related tools
Comments & feedback
Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.
Found a bug, edge case, or want to suggest an improvement?
Email me at [email protected] — most fixes ship within 24 hours.