Question 1

How much does it cost to embed a million tokens with OpenAI?

Accepted Answer

OpenAI charges $0.02 per 1M tokens for text-embedding-3-small and $0.13 per 1M tokens for text-embedding-3-large. So embedding one million tokens costs about $0.02 (≈ Rs 6) on the small model and $0.13 (≈ Rs 39) on the large model at Rs 300/USD. Query embeddings are billed the same way.

Question 2

Is Cohere embed cheaper than OpenAI text-embedding-3-small?

Accepted Answer

No. Cohere embed-english-v3.0 lists at $0.10 per 1M tokens, which is five times more expensive than OpenAI text-embedding-3-small at $0.02 per 1M tokens. Cohere is competitive only against OpenAI's larger text-embedding-3-large or ada-002 tiers, or when you need multilingual coverage that includes Sinhala or Tamil.

Question 3

How do I estimate the cost of building a RAG system?

Accepted Answer

Multiply (documents × average tokens per document) by the model's $/1M token price for the one-time indexing cost. Add (queries/day × tokens/query × 30) ÷ 1M × price for monthly query cost. Multiply monthly by 12 and add any planned re-embeds (driven by content churn, model upgrades, or chunking changes). The calculator above does all of this in your browser.

Question 4

What's the difference between text-embedding-3-small and text-embedding-3-large pricing?

Accepted Answer

text-embedding-3-small costs $0.02 per 1M tokens with 1,536 default dimensions; text-embedding-3-large costs $0.13 per 1M tokens (6.5× more) with 3,072 default dimensions. Both let you request smaller output dimensions to save on vector-storage cost. Large is worth the premium for high-recall semantic search; small is plenty for most product-style RAG.

Question 5

How many tokens are in an average PDF page?

Accepted Answer

Roughly 500–600 tokens for a typical letter-size page of body text, or about 250–300 words. Tables, code, and dense academic prose push this up; mostly-images pages can come in below 100. For budgeting purposes, 600 tokens per page is a reasonable upper bound. Use induwara.lk/tools/ai-token-counter to count exactly before you commit to a model.

Question 6

Why is Voyage 3-lite the same $0.02/M as OpenAI 3-small?

Accepted Answer

Voyage launched the 3-lite tier to price-match OpenAI's small model and undercut the rest of the field. The trade-off is output dimensionality: Voyage 3-lite emits 512-dim vectors versus OpenAI 3-small's 1,536. Smaller vectors mean cheaper Pinecone or Qdrant storage and faster ANN queries — sometimes the more important saving for high-scale RAG.

Question 7

Does this include vector-database storage cost?

Accepted Answer

No — this calculator prices the embedding API call only. Vector-database storage (Pinecone, Weaviate, Qdrant, pgvector) is a separate line item, billed per stored vector or per index size. The output-dimensions column on the comparison table is what you'd plug into a Pinecone calculator next.

Question 8

How is the LKR figure calculated?

Accepted Answer

LKR = USD × your chosen USD→LKR rate. The default is the latest Central Bank of Sri Lanka daily indicative rate (about Rs 300/USD on the verification date) — edit it to match your bank's actual conversion or what Wise or Payoneer quoted. Banks typically settle 1–3 % weaker than the CBSL indicative.

Question 9

Are these prices live?

Accepted Answer

No. Every per-million-token price is hand-verified against the vendor's pricing page and stored alongside the source URL in the calculator's source code. Last full verification was on 2026-05-12. Prices are reviewed quarterly and after any major vendor announcement.

Question 10

Why don't I see Anthropic, Meta, or Hugging Face in the list?

Accepted Answer

Anthropic does not yet sell first-party embedding API access; Meta's Llama embeddings are intended for self-hosting; and Hugging Face Inference Endpoints price by hardware-hour rather than per token. Different unit economics, so no apples-to-apples comparison row would be honest. If any of those vendors publish a per-token embedding tier, it will be added here within a quarter.

Model	Dim	$/M tokens	Index	Queries (yr)	First year USD	First year LKR
text-embedding-3-small Cheapest OpenAI	1,536	$0.0200	$0.40	$0.18	$0.58	Rs 174
voyage-3-lite Voyage AI	512	$0.0200	$0.40	$0.18	$0.58	Rs 174
embed-english-v3.0 Cohere	1,024	$0.1000	$2.00	$0.90	$2.90	Rs 870
gemini-embedding-001 Google	3,072	$0.1500	$3.00	$1.35	$4.35	Rs 1,305

AI Embedding Cost Calculator

First-year USD per model

How it works

Worked examples

Frequently asked questions

Sources & references

Related tools

AI Token Counter

AI Image Cost Calculator

Freelance Rate

Comments & feedback