induwara.lk
induwara.lkAI · Cost calculator

AI Embedding Cost Calculator

Enter your corpus size and query volume; see the first-year USD and LKR cost of generating embeddings side-by-side across OpenAI, Cohere, Voyage AI, Google Gemini, and Mistral. Every per-million-token price is cited from the vendor's pricing page.

By Induwara AshinsanaUpdated May 12, 2026
Embedding cost comparison

Whole number, zero or more.

≈ 750 words per 1,000 tokens. PDF page ≈ 500–600 tokens.

0 = never. 3 = quarterly. 12 = yearly.

Each query embedding is also priced.

Typical short user query: 20–80 tokens.

Rs

CBSL daily indicative rate. Edit to match your bank's rate.

Workload presets

4 of 15 models selected · Vendors: OpenAI, Cohere, Voyage AI, Google, Mistral.

Cheapest first year
$0.58
text-embedding-3-small (OpenAI) · Rs 174
Index tokens
20,000,000
No reindex scheduled in year 1
Monthly query tokens
750,000
× 12 = 9,000,000 per year
ModelDim$/M tokensIndexQueries (yr)First year USDFirst year LKR
text-embedding-3-small Cheapest
OpenAI
1,536$0.0200$0.40$0.18$0.58Rs 174
voyage-3-lite
Voyage AI
512$0.0200$0.40$0.18$0.58Rs 174
embed-english-v3.0
Cohere
1,024$0.1000$2.00$0.90$2.90Rs 870
gemini-embedding-001
Google
3,072$0.1500$3.00$1.35$4.35Rs 1,305

First-year USD per model

text-embedding-3-small
$0.58
voyage-3-lite
$0.58
embed-english-v3.0
$2.90
gemini-embedding-001
$4.35
All math runs in your browser. No corpus, query, or API key leaves the page.

How it works

The calculator treats every embedding model the same way: take the input text in tokens, divide by 1,000,000, and multiply by the vendor's published $/M-token price. That price comes from the vendor pricing pages cited at the bottom of this page, hand-verified on 2026-05-12and stored alongside its source URL in the calculator's code.

Three line items add up to a first-year cost. The math is intentionally boring:

index_tokens         = documents × avg_tokens_per_doc
index_cost_usd       = index_tokens / 1,000,000 × price_per_million
monthly_query_tokens = queries_per_day × avg_tokens_per_query × 30
monthly_query_usd    = monthly_query_tokens / 1,000,000 × price_per_million
reindexes_year_1     = reindex_months > 0 ? floor(12 / reindex_months) - 1 : 0
first_year_usd       = index_cost_usd
                     + reindexes_year_1 × index_cost_usd
                     + 12 × monthly_query_usd
first_year_lkr       = first_year_usd × usd_to_lkr_rate

The reindex term models the realistic case where a knowledge base gets re-embedded on a schedule — content drifts, chunking strategies change, vendors ship a better model — and the initial corpus pass has to be paid again. floor(12 / reindex_months) - 1 counts the additionalreindexes in the first 12 months; quarterly is 3 extra passes, monthly is 11, yearly is 0, and 0 months means "never".

Google's Vertex AI embedding models text-embedding-005 and text-multilingual-embedding-002 publish prices per character rather than per token. The calculator normalises using Google's documented 4-chars-per-token approximation so every row compares like-for-like. The page exposes a cross-check that re-derives the per-million-token price from the published per-character figure and confirms the stored value: char→token cross-check passes ($0.10/M tokens). Where a tier doesn't publish a public price (enterprise-only contracts, fine-tuned variants) the row is omitted, not estimated.

Worked-example self-test (computed live on this page) — each line reconciles the formula above with the hand-derived numbers in the code header:

  • 5k×4k corpus, 500q/day×50t, yearly reindex — OpenAI 3-small → expected $0.58, got $0.58
  • Same workload — Cohere English v3 → expected $2.90, got $2.90
  • 50k×2k corpus, 5kq/day×80t, quarterly reindex — OpenAI 3-large → expected $70.72, got $70.72
  • Zero documents, zero queries — Voyage 3-lite → expected $0.00, got $0.00
  • Monthly reindex of 5k×4k corpus — OpenAI 3-small → expected $4.98, got $4.98

Out of scope for v1: vector-database storage (Pinecone, Weaviate, Qdrant, pgvector are a separate cost line); chat-completion / RAG generation cost (use the upcoming AI API cost calculator); token counting (use the AI Token Counter); reranker pricing; fine-tuned embedding tiers; and self-hosted GPU embeddings, which are billed by hardware-hour, not per token.

Worked examples

Product knowledge base · 5,000 docs × 4,000 tokens · 500 q/day

Indie SaaS chatbot over product manuals. Yearly reindex. Conversion at Rs 300/USD.

  1. Index tokens: 5,000 × 4,000 = 20,000,000
  2. Monthly query tokens: 500 × 50 × 30 = 750,000
  3. Reindexes in year 1: floor(12/12) − 1 = 0
  4. OpenAI 3-small @ $0.02/M: index 20 × 0.02 = $0.40, queries 12 × 0.015 = $0.18 → $0.58 (≈ Rs 174)
  5. Voyage 3-lite @ $0.02/M: tied with OpenAI 3-small at $0.58
  6. Cohere English v3 @ $0.10/M: $2.00 + 0.90 = $2.90 (≈ Rs 870)
  7. Gemini embedding 001 @ $0.15/M: $3.00 + 1.35 = $4.35 (≈ Rs 1,305)
  8. Cheapest: OpenAI 3-small / Voyage 3-lite tie at this workload.

Support knowledge base · 50,000 docs × 2,000 tokens · 5,000 q/day

Enterprise support assistant. Re-embed every 3 months. Heavy query load.

  1. Index tokens: 50,000 × 2,000 = 100,000,000
  2. Monthly query tokens: 5,000 × 80 × 30 = 12,000,000
  3. Reindexes in year 1: floor(12/3) − 1 = 3 (so 4 corpus passes total)
  4. OpenAI 3-large @ $0.13/M: index $13 + 3 × $13 + 12 × $1.56 = $70.72 (≈ Rs 21,216)
  5. OpenAI 3-small @ $0.02/M: $2 + $6 + $2.88 = $10.88 (≈ Rs 3,264)
  6. Voyage 3 @ $0.06/M: $6 + $18 + $8.64 = $32.64 (≈ Rs 9,792)
  7. Cheapest: OpenAI 3-small — 7× cheaper than 3-large at this volume.

Edge case · monthly reindex on the product KB

Same 5,000 × 4,000 corpus, but indexed fresh every month — driven by daily content updates.

  1. Reindexes in year 1: floor(12/1) − 1 = 11 (so 12 passes total)
  2. OpenAI 3-small @ $0.02/M: index $0.40 + 11 × $0.40 + 12 × $0.015 = $4.98
  3. First year cost jumps 9× versus yearly reindex ($0.58 → $4.98).
  4. Lesson: pick the slowest reindex cadence your content-change rate allows.

Frequently asked questions

Sources & references

Related tools

Rate this tool
Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Found a price that has moved, or an edge case the calculator doesn't cover?

Email me at [email protected] — most fixes ship within 24 hours.