Realtime Voice API Cost Calculator — OpenAI & Gemini Live
Price a speech-to-speech voice agent on the OpenAI Realtime API or Gemini Live API. Models the four billed token classes — audio in, cached audio, audio out, text — and projects per-session, monthly and annual cost in USD and LKR. No signup, rates cited.
How it works
A normal LLM API bill has two lines: input tokens and output tokens. A speech-to-speech agent on the Realtime API is different — the provider meters four token classes, each on its own price tier:
- Audio input — the user's speech, tokenised.
- Cached audio input — re-sent context served from cache at a steep discount.
- Audio output — the model's spoken reply, the most expensive class.
- Text input — system prompt and instruction tokens.
The total per session is the sum of each class priced per 1,000,000 tokens:
cost = audioIn/1e6·rateIn + cachedIn/1e6·rateCached + audioOut/1e6·rateOut + textIn/1e6·rateText
When you enter minutes, audio tokens are derived first using each provider's documented tokenisation rate. OpenAI bills 1 token per 100 ms of input audio (600 tokens/minute) and 1 token per 50 ms of output audio (1,200 tokens/minute); Gemini bills 25 tokens per second (1,500 tokens/minute). Cached audio is carved out of audio input by the cached-share slider: cachedIn = audioIn × share, and the remainder is billed at the full input rate.
Monthly and annual figures are linear: monthly = perSession × sessions and annual = monthly × 12. LKR amounts multiply the USD result by your editable USD→LKR rate. Every per-token rate is pinned from the official OpenAI and Google pricing pages and carries a last-verified date (2026-06-07); nothing is fetched at runtime. To check the math, the page derives audio cost a second way — straight from the per-minute unit rate — and confirms it matches the token pipeline to the cent.
Worked examples
Frequently asked questions
Sources & references
- OpenAI — Realtime API cost guide (token classes & audio tokenisation)
- OpenAI — API pricing (gpt-realtime & gpt-4o-mini-realtime rates)
- Google — Gemini API pricing (Gemini 2.5 Flash native-audio Live API)
- Central Bank of Sri Lanka — daily indicative USD→LKR rate
Per-token rates were last cross-checked against the official OpenAI and Google pricing pages on 2026-06-07. AI prices change often; the rates are reviewed quarterly and after any provider pricing update.
Related tools
Comments & feedback
Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.
Found a bug, edge case, or want to suggest an improvement?
Email me at [email protected] — most fixes ship within 24 hours.