induwara.lk
induwara.lkDevelopers · AI

AI Model ID Cheat Sheet — the exact API model strings

The single most boring, most-Googled developer question answered: what is the exact string I type to call this model? Copy the precise model ID, base endpoint and a minimal snippet for OpenAI, Anthropic, Google, Meta, Mistral, DeepSeek and xAI — filter, toggle aliased vs pinned, ship. No signup, no keys, sources cited.

By Induwara AshinsanaUpdated Jun 29, 2026
Find a model ID29 models · 7 providers
Verified Jun 29, 2026

Matches the display name, the model ID, and the provider — type part of any of them.

Provider
Modality
Snippet language
ID variant
Showing 29 of 29 models
AnthropicTextVision

Claude Opus 4.8

claude-opus-4-8

Pinned snapshot — reproducible.

https://api.anthropic.com/v1/messages

1,000,000 ctx · 128,000 max out

Source
AnthropicTextVision

Claude Opus 4.7

claude-opus-4-7

Pinned snapshot — reproducible.

https://api.anthropic.com/v1/messages

1,000,000 ctx · 128,000 max out

Source
AnthropicTextVision

Claude Sonnet 4.6

claude-sonnet-4-6

Pinned snapshot — reproducible.

https://api.anthropic.com/v1/messages

1,000,000 ctx · 64,000 max out

Source
AnthropicTextVision

Claude Haiku 4.5

claude-haiku-4-5-20251001

Pinned snapshot — reproducible.

https://api.anthropic.com/v1/messages

200,000 ctx · 64,000 max out

Source
AnthropicTextVision

Claude Opus 4.5

claude-opus-4-5-20251101

Pinned snapshot — reproducible.

https://api.anthropic.com/v1/messages
Source
AnthropicTextVision

Claude Opus 4.1

claude-opus-4-1-20250805

Pinned snapshot — reproducible.

https://api.anthropic.com/v1/messages

200,000 ctx

Deprecated — scheduled to retire 2026-08-05. Migrate to claude-opus-4-8.

Source
OpenAITextVision

GPT-4o

gpt-4o-2024-08-06

Pinned snapshot — reproducible.

https://api.openai.com/v1/chat/completions

128,000 ctx · 16,384 max out

Source
OpenAITextVision

GPT-4o mini

gpt-4o-mini-2024-07-18

Pinned snapshot — reproducible.

https://api.openai.com/v1/chat/completions

128,000 ctx · 16,384 max out

Source
OpenAITextVision

GPT-4.1

gpt-4.1-2025-04-14

Pinned snapshot — reproducible.

https://api.openai.com/v1/chat/completions

1,047,576 ctx · 32,768 max out

Source
OpenAITextVision

o3

o3-2025-04-16

Pinned snapshot — reproducible.

https://api.openai.com/v1/chat/completions

200,000 ctx · 100,000 max out

Reasoning model — set max_completion_tokens, not max_tokens.

Source
OpenAITextVision

o4-mini

o4-mini-2025-04-16

Pinned snapshot — reproducible.

https://api.openai.com/v1/chat/completions

200,000 ctx · 100,000 max out

Reasoning model — set max_completion_tokens, not max_tokens.

Source
OpenAIEmbedding

text-embedding-3-large

text-embedding-3-large

Pinned snapshot — reproducible.

https://api.openai.com/v1/embeddings

Embedding model — 3072 dimensions by default.

Source
OpenAIImage

GPT Image 1

gpt-image-1

Pinned snapshot — reproducible.

https://api.openai.com/v1/images/generations

Image generation — called on the images endpoint, not chat.

Source
GoogleTextVisionAudio

Gemini 2.5 Pro

gemini-2.5-pro

Pinned snapshot — reproducible.

https://generativelanguage.googleapis.com/v1beta

1,048,576 ctx · 65,536 max out

Source
GoogleTextVisionAudio

Gemini 2.5 Flash

gemini-2.5-flash

Pinned snapshot — reproducible.

https://generativelanguage.googleapis.com/v1beta

1,048,576 ctx · 65,536 max out

Source
GoogleTextVisionAudio

Gemini 2.0 Flash

gemini-2.0-flash

Pinned snapshot — reproducible.

https://generativelanguage.googleapis.com/v1beta

1,048,576 ctx · 8,192 max out

Source
GoogleEmbedding

Gemini Embedding

gemini-embedding-001

Pinned snapshot — reproducible.

https://generativelanguage.googleapis.com/v1beta

Embedding model — called via :embedContent, not :generateContent.

Source
MetaTextVision

Llama 4 Maverick

Llama-4-Maverick-17B-128E-Instruct-FP8

Pinned snapshot — reproducible.

https://api.llama.com/v1/chat/completions

Open-weight repo on Hugging Face: meta-llama/Llama-4-Maverick-17B-128E-Instruct.

Source
MetaText

Llama 3.3 70B Instruct

Llama-3.3-70B-Instruct

Pinned snapshot — reproducible.

https://api.llama.com/v1/chat/completions

Open-weight repo on Hugging Face: meta-llama/Llama-3.3-70B-Instruct.

Source
MistralText

Mistral Large

mistral-large-2411

Pinned snapshot — reproducible.

https://api.mistral.ai/v1/chat/completions

131,072 ctx

Source
MistralText

Mistral Small

mistral-small-latest

Pinned snapshot — reproducible.

https://api.mistral.ai/v1/chat/completions
Source
MistralText

Codestral

codestral-latest

Pinned snapshot — reproducible.

https://api.mistral.ai/v1/chat/completions

Code-specialised model.

Source
MistralTextVision

Pixtral Large

pixtral-large-latest

Pinned snapshot — reproducible.

https://api.mistral.ai/v1/chat/completions
Source
MistralEmbedding

Mistral Embed

mistral-embed

Pinned snapshot — reproducible.

https://api.mistral.ai/v1/embeddings
Source
DeepSeekText

DeepSeek Chat

deepseek-chat

Pinned snapshot — reproducible.

https://api.deepseek.com/v1/chat/completions

65,536 ctx

Source
DeepSeekText

DeepSeek Reasoner

deepseek-reasoner

Pinned snapshot — reproducible.

https://api.deepseek.com/v1/chat/completions

65,536 ctx

Reasoning model — returns chain-of-thought in reasoning_content.

Source
xAITextVision

Grok 4

grok-4

Pinned snapshot — reproducible.

https://api.x.ai/v1/chat/completions
Source
xAIText

Grok 3

grok-3

Pinned snapshot — reproducible.

https://api.x.ai/v1/chat/completions
Source
xAIText

Grok 3 Mini

grok-3-mini

Pinned snapshot — reproducible.

https://api.x.ai/v1/chat/completions
Source

Every model ID, alias, and endpoint above is transcribed verbatim from each provider's official documentation and last cross-checked on Jun 29, 2026. An automatic format check confirms all 29/29 identifiers conform to their provider's naming pattern.

Aliased IDs auto-track the newest snapshot; pinned IDs are reproducible. Provider IDs drift — if a string 404s, check the cited source link on its card for the current value.

How it works

This is a reference lookup, not a calculator — so “methodology” here means data provenance and the transform from your filters to the rows you see. The data lives in a single typed module, lib/data/ai-model-id-cheatsheet.ts, holding 29 models across seven providers.

  1. Transcription, not inference. Every modelId, alias and endpoint is copied verbatim from the provider's official documentation. Each row stores its own source URL, surfaced as the “Source” link on every card.
  2. Aliased vs pinned. Where a provider offers both an undated alias (gpt-4o, claude-haiku-4-5) and a dated snapshot (gpt-4o-2024-08-06, claude-haiku-4-5-20251001), both are stored. The ID-variant toggle picks which one the pill and snippet show. Pinned snapshots are reproducible; aliases track the newest version, which can shift behaviour — so production code should usually pin.
  3. Filtering. The displayed rows are simply models.filter(provider && modality && nameMatches). It is pure and synchronous: chips narrow by provider and modality, the search box does a trimmed, case-insensitive substring match over the display name, model ID, alias and provider. No chip selected means “show all”, not “show none”.
  4. Snippet generation. For a model, language and variant, the snippet is a static template with the resolved model ID and endpoint interpolated in. Anthropic rows use the anthropic SDK shape, Gemini rows the google-genai shape, and the OpenAI-compatible providers (OpenAI, Mistral, DeepSeek, xAI, Meta's Llama API) use the openai client with the right base URL. Keys are always YOUR_API_KEY placeholders — nothing executes, nothing leaves your browser.
  5. Integrity check.A cross-check function validates that every stored ID matches its provider's documented naming pattern (Anthropic IDs start with claude-, xAI with grok-, and so on). The card footer reports the pass count, so a future mistyped entry is caught rather than silently shipped.

There are no rates or formulas to get wrong — correctness reduces to faithful transcription, which is exactly why the per-row source link and the LAST_VERIFIED date (Jun 29, 2026) are mandatory. For context windows, pricing and knowledge-cutoff dates, this page deliberately links out to the dedicated tools rather than duplicating them.

Worked examples

Anthropic · pinned vs aliased · Python

  1. Filter provider = Anthropic, search 'haiku'.
  2. One row appears: Claude Haiku 4.5.
  3. Variant = Pinned → pill shows claude-haiku-4-5-20251001
  4. Variant = Aliased → pill shows claude-haiku-4-5
  5. Endpoint (both): https://api.anthropic.com/v1/messages
  6. Python snippet pre-fills model="<the pill string>".

OpenAI · Vision filter · cURL

  1. Filter provider = OpenAI, modality = Vision, language = cURL.
  2. GPT-4o appears: gpt-4o-2024-08-06 (pinned) / gpt-4o (alias).
  3. Endpoint: https://api.openai.com/v1/chat/completions
  4. cURL body POSTs {"model": "gpt-4o-2024-08-06", ...}.
  5. text-embedding-3-large is EXCLUDED — it is Embedding-only,
  6. so the Vision filter drops it.

Modality = Embedding · all providers

  1. Select only the Embedding modality chip; no provider chip.
  2. Exactly the embedding rows remain:
  3. text-embedding-3-large (OpenAI, /v1/embeddings),
  4. gemini-embedding-001 (Google, :embedContent),
  5. mistral-embed (Mistral, /v1/embeddings).
  6. No chat or vision model appears — the filter is exclusive.

Frequently asked questions

Sources & references

All model IDs, aliases and endpoints were last cross-checked against these provider docs on Jun 29, 2026. Provider model lists change often; the page is reviewed on a rolling basis and each card links to its source so you can confirm the live value.

Related tools

Rate this tool
Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Spotted a model ID that has drifted, or a provider we should add?

Email me at [email protected] — most fixes ship within 24 hours.