induwara.lk
induwara.lkDevelopers · AI

Best Embedding Model: Compare Price, Dimensions & MTEB Quality

Compare 15 text-embedding models from OpenAI, Cohere, Voyage, Google, Mistral, and the open-source world on price, dimensions, max tokens, and MTEB score — then project the exact cost to embed your corpus. No signup, no ads, sources cited below.

By Induwara AshinsanaUpdated Jun 20, 2026
Compare embedding models

Whole number, zero or more.

≈ 750 words per 1,000 tokens. PDF page ≈ 500–600 tokens.

Rs

For the LKR cost column. Edit to match your bank's rate.

Corpus presets
Sort by
Pick: Best value
gte-large-en-v1.5
Free API(self-host)

Top quality-per-dollar — free to self-host (GPU not included).

Cheapest
gte-large-en-v1.5
Free API(self-host)

Free API cost — self-host (GPU not included).

Best quality (MTEB)
gte-large-en-v1.5
Free API(self-host)

Highest MTEB average (65.4).

All 15 of 15 models

ModelDimMax tokens$/1MMTEBLicenseCost (one-time)Cost (LKR)
gte-large-en-v1.5 Cheapest Top MTEB
Alibaba
1,0248,19265.4Apache-2.0Free API
bge-large-en-v1.5
BAAI
1,02451264.2MITFree API
bge-base-en-v1.5
BAAI
76851263.5MITFree API
nomic-embed-text-v1.5
Nomic · Matryoshka → 64d
7688,19262.3Apache-2.0Free API
e5-large-v2
intfloat
1,02451262.3MITFree API
multilingual-e5-large
intfloat
1,02451461.5MITFree API
text-embedding-3-small
OpenAI · Matryoshka → 512d
1,5368,191$0.0262.3Proprietary$1.00Rs 300
voyage-3-lite
Voyage AI
51232,000$0.02n/aProprietary$1.00Rs 300
text-embedding-ada-002
OpenAI
1,5368,191$0.1061.0Proprietary$5.00Rs 1,500
embed-english-v3.0
Cohere
1,024512$0.1064.5Proprietary$5.00Rs 1,500
embed-multilingual-v3.0
Cohere
1,024512$0.10n/aProprietary$5.00Rs 1,500
mistral-embed
Mistral
1,0248,192$0.10n/aProprietary$5.00Rs 1,500
text-embedding-3-large
OpenAI · Matryoshka → 256d
3,0728,191$0.1364.6Proprietary$6.50Rs 1,950
gemini-embedding-001
Google · Matryoshka → 768d
3,0722,048$0.15n/aProprietary$7.50Rs 2,250
voyage-3-large
Voyage AI · Matryoshka → 256d
1,02432,000$0.18n/aProprietary$9.00Rs 2,700

Open-weight models cost $0 in API fees — you pay for GPU/compute instead, which this tool does not estimate (the value score floors the divisor at $0.01so free models rank by raw MTEB). See “How it works” below.

All math runs in your browser. Prices & scores are a static 2026-06-20 snapshot — see Sources below.

How it works

Choosing a text-embedding model comes down to four numbers: how much it costs per token, how good its vectors are at retrieval, how many tokens it accepts per call, and how large each output vector is. This tool puts all four side by side for 15 of the most-used models and then projects the cost to embed your own corpus.

The cost projection uses one formula:

  1. Total tokens = number of documents × average tokens per document.
  2. Embedding cost = (total tokens ÷ 1,000,000) × price per 1M tokens. Each model's price comes from its provider's public pricing page, cited below.
  3. Open-weight models(bge, gte, e5, nomic) show $0 API cost with a “self-host” label. We deliberately do not invent a GPU/compute price — that depends on your hardware — and link to the self-hosting cost calculator instead.
  4. Monthly re-embedding multiplies the one-time cost by 12 to show annualised spend; one-time shows the single figure.

Quality is the MTEB (Massive Text Embedding Benchmark) average, displayed verbatim from the leaderboard snapshot — never computed by this page. The MTEB(eng, v1) — snapshot 2026-06-20is used so every score is comparable; the Retrieval sub-score is the most relevant one for search and RAG. Where a provider only publishes private or multilingual benchmarks, the MTEB cell shows “n/a” and that model is excluded from the quality ranking rather than guessed at.

The three recommendation cards are deterministic. Cheapest is the lowest projected cost. Best quality is the highest MTEB average. Best value maximises MTEB ÷ max(cost, $0.01) — quality per dollar — with the one-cent floor letting free self-hosted models rank by their raw MTEB instead of dividing by zero. Because all rates and scores are constants, identical inputs always produce identical output. The per-document and bulk cost formulas are cross-checked against each other in the data module to guard against arithmetic drift.

Worked examples

Freelancer RAG index (one-time)

200,000 chunks × 400 tokens = 80,000,000 tokens

  1. Total tokens: 200,000 × 400 = 80,000,000 (80M)
  2. text-embedding-3-small @ $0.02/M: 80 × 0.02 = $1.60
  3. text-embedding-3-large @ $0.13/M: 80 × 0.13 = $10.40
  4. Cohere embed-english-v3.0 @ $0.10/M: 80 × 0.10 = $8.00
  5. voyage-3-lite @ $0.02/M: 80 × 0.02 = $1.60
  6. bge-large-en-v1.5 (open): $0 API — self-host on a GPU

Startup knowledge base (re-embedded monthly)

1,000,000 docs × 600 tokens = 600,000,000 tokens, annualised ×12

  1. Total tokens: 1,000,000 × 600 = 600,000,000 (600M)
  2. text-embedding-3-small: 600 × $0.02 = $12 → ×12 = $144/yr
  3. text-embedding-3-large: 600 × $0.13 = $78 → ×12 = $936/yr
  4. mistral-embed @ $0.10/M: 600 × 0.10 = $60 → ×12 = $720/yr

Boundary check (one-time)

Exactly 1,000,000 tokens — 1,000 docs × 1,000 tokens

  1. Total tokens: 1,000 × 1,000 = 1,000,000 (1M)
  2. text-embedding-3-small @ $0.02/M: 1 × 0.02 = $0.02
  3. Zero documents on any model: $0.00 — no error, no negative cost

Frequently asked questions

Sources & references

Related tools

Rate this tool
Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Found a stale price, a missing model, or a better MTEB source?

Email me at [email protected] — most fixes ship within 24 hours.