How much storage does 1 million embeddings need?

For OpenAI text-embedding-3-large (3072 dims) at float32, raw vectors are 1,000,000 × 3072 × 4 bytes = 12.29 GB, or 11.44 GiB. On an HNSW index the graph adds about 50%, so plan for roughly 17.2 GiB of RAM. A smaller 1536-dim model halves that, and int8 quantization quarters the raw size.

How much RAM does an HNSW vector index use?

Qdrant's capacity rule is memory ≈ 1.5 × num_vectors × dimensions × 4 bytes for float32. The 1.5 factor is the raw vectors plus roughly a 0.5× HNSW graph overhead. This calculator uses that identity, so 100,000 vectors at 1536 dims need about 0.86 GiB of RAM on HNSW versus 0.57 GiB on a Flat index.

How much does Pinecone cost per month?

Pinecone Serverless bills stored data (vectors plus metadata) at about $0.33 per GB-month. So 100,000 float32 vectors at 1536 dims — 0.61 GB stored — costs roughly $0.20 per month for storage. Read, write, and query units are billed separately, so your real bill depends on traffic, not just stored size.

Does int8 quantization reduce vector database size?

Yes. int8 scalar quantization stores 1 byte per dimension instead of 4, so raw vectors shrink about 4× (a 75% saving). Binary quantization uses 1 bit per dimension, around 32× smaller. The trade-off is recall: most setups keep full-precision vectors on disk and rescore the top candidates to recover accuracy.

How big is a 1536-dimension embedding in bytes?

At float32, one 1536-dim vector is 1536 × 4 = 6,144 bytes (6 KiB). At float16 it is 3,072 bytes, at int8 it is 1,536 bytes, and at binary it is 192 bytes. Multiply by your vector count to get the raw payload before index overhead and metadata.

What is the difference between RAM and disk in the results?

Disk is the raw vectors plus metadata you must persist. RAM is what the engine holds in memory to serve fast queries — vectors plus the HNSW graph. RAM drives both your instance size and managed pricing on RAM-tier providers, which is why it is the headline figure here.

Are the provider prices live?

No. Rates are baked-in list prices verified on 2026-06-06 and cover the storage or RAM tier only — they exclude query, read, and write units and per-cluster compute minimums. Treat the monthly figure as a sizing estimate and confirm against each vendor's current pricing page before committing.

Does this tool generate or store my embeddings?

No. It only sizes what a set of embeddings will occupy. Nothing is uploaded — the math runs in your browser. To estimate the cost of generating embeddings in the first place, use the AI Embedding Cost Calculator instead; this tool is the infrastructure-sizing step that comes after.

AI · Infrastructure

Vector Database Storage & RAM Calculator

Find out how much disk, RAM, and monthly money your embeddings will cost to store and serve. Enter a vector count, model, and precision to see raw size, HNSW overhead, total RAM, and an estimated Pinecone, Qdrant, Weaviate, or self-hosted bill — no signup.

By Induwara Ashinsana— Executive Director, Ryzera TechnologiesUpdated Jun 6, 2026

Size your vector index

Verified · Qdrant 1.5× rule

Common setups

Number of vectors

One per embedded chunk / row.

Embedding model

Precision

Index type

Metadata bytes / vector

Stored payload per vector (IDs, text, tags). 0 if none.

Managed provider

USD → LKR rate

CBSL indicative rate — edit for today's value.

Total RAM needed

878.9 MiB

Fits a 1 GiB instance

Total disk

585.9 MiB

Vectors + metadata, no graph

Raw vectors

585.9 MiB

100,000 × 1,536 × 4 B

Index overhead

293.0 MiB

HNSW graph (≈0.5× raw)

Estimated monthly cost — Self-host (pgvector)

No managed fee

Self-provisioned — a 1 GiB RAM instance covers this index.

No managed fee — provision your own instance from the RAM figure.. Storage / RAM tier only — excludes read, write, and query units.

Same vectors at every precision

Precision	Bytes/dim	Raw vectors	Saved vs float32
float32 (full precision)	4	585.9 MiB	—
float16 / half (halfvec)	2	293.0 MiB	50%
int8 (scalar quantization)	1	146.5 MiB	75%
binary (1-bit quantization)	0.125	18.3 MiB	96.88%

Byte sizes follow Qdrant's capacity-planning and quantization guides. Provider rates are published list prices verified on 2026-06-06 and cover storage / RAM tiers only — confirm against each vendor's live pricing before you commit.

How it works

Vector search costs are mostly storage costs, and storage is just byte arithmetic. Every embedding is an array of numbers; this tool multiplies that out and adds the overhead each index type carries. All sizes use binary units — 1 GiB = 2³⁰ = 1,073,741,824 bytes.

Bytes per dimensionfollow the precision, from Qdrant's quantization guide: float32 = 4 bytes, float16 = 2, int8 = 1, and binary = 0.125 (one bit).
Raw vector storage = num_vectors × dimensions × bytes_per_dim. A 1536-dim float32 vector is 6,144 bytes; a million of them is 5.72 GiB.
Metadata = num_vectors × metadata_bytes_per_vector — the IDs, text snippets, and tags you store alongside each vector.
Index overhead. A Flat (exact) index adds nothing. An HNSW graph adds roughly 0.5 ×the raw vector footprint for its links, which is where Qdrant's capacity rule of thumb — memory ≈ 1.5 × num_vectors × dim × 4 for float32 — comes from.
Total RAM = raw + metadata + overhead (what the engine holds in memory to serve queries). Total disk = raw + metadata (the graph is rebuilt or memory-mapped as needed).
Managed cost. Each provider bills differently: Pinecone Serverless charges per GB of stored data ($0.33/GB-month), Qdrant Cloud is sized by RAM, and Weaviate Serverless bills per stored dimension ($0.05 per million dimensions-month). The tool applies the right basis per provider and converts to LKR using the editable CBSL indicative rate.

The float32 + HNSW path is cross-checked against Qdrant's published 1.5 × n × d × 4 identity to the byte, the same way a tax tool reconciles against the official table. Managed prices are storage / RAM tier only and exclude query, read, and write units — stated plainly so the estimate is never oversold.

Worked examples

100k vectors · text-embedding-3-small (1536) · float32 · HNSW

Raw: 100,000 × 1536 × 4 = 614,400,000 B = 0.57 GiB
HNSW overhead: 0.5 × 614,400,000 = 307,200,000 B = 0.29 GiB
Total RAM: 921,600,000 B = 0.86 GiB (= 1.5 × raw ✓ Qdrant rule)
Total disk: 0.57 GiB — fits a 1 GiB instance
Pinecone: 0.6144 GB × $0.33 = $0.20 / month

1M vectors · text-embedding-3-large (3072) · HNSW · float32 vs int8 vs binary

float32 raw: 1,000,000 × 3072 × 4 = 12.29 GB = 11.44 GiB; RAM ×1.5 = 17.17 GiB
int8 raw: 1,000,000 × 3072 × 1 = 3,072,000,000 B = 2.86 GiB; RAM ×1.5 = 4.29 GiB
int8 saving vs float32 raw: 1 − 1/4 = 75%
binary raw: 1,000,000 × 3072 × 0.125 = 384,000,000 B = 0.36 GiB
binary saving: 1 − 0.125/4 = 96.875% (4 / 0.125 = 32× smaller ✓)

Edge case — Flat index with metadata, 100k vectors · 1536 · float32 · 500 B/vector

Raw: 100,000 × 1536 × 4 = 614,400,000 B
Metadata: 100,000 × 500 = 50,000,000 B
Flat index overhead: 0 (no graph)
Total disk = total RAM: 664,400,000 B = 0.62 GiB
Switching to HNSW would add 0.29 GiB of graph on top of this.

Frequently asked questions

Sources & references

Dimension constants, byte sizes, and provider rates were last cross-checked against these sources on 2026-06-06. Provider pricing is reviewed quarterly and after any vendor pricing change.

Related tools

LiveAI

Vector DB Compare

Compare the major vector databases — Pinecone, Weaviate, Qdrant, Milvus/Zilliz, Chroma, pgvector, Redis and MongoDB Atlas — on free tier, license, hosting model, max dimensions, index types (HNSW/IVF/DiskANN) and hybrid search. Pick 2–6, sort any column, and read a cited 'which to choose' verdict. Free, no signup.

Open tool

LiveAI

RAG Chunk Size Calculator

Plan a RAG embedding pipeline before you spend a cent: enter a document length, chunk size and overlap, and instantly see the chunk count, tokens duplicated by overlap, total tokens embedded, embedding cost in USD across OpenAI, Cohere and Voyage models, the vector-store size for float32/float16/int8, and the top-k retrieval prompt budget. Pure client-side sliding-window math — no keys, no upload.

Open tool

LiveAI

Embedding Cost Calc

Estimate USD and LKR cost of generating vector embeddings across OpenAI, Cohere, Voyage AI, Google Gemini, and Mistral. One-time indexing, monthly queries, and first-year totals side-by-side with output dimensions — every price sourced.

Open tool

Rate this tool

Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Found a bug, edge case, or want a provider added?

Email me at [email protected] — most fixes ship within 24 hours.