Are tool definitions sent on every API request?

Yes. The model is stateless, so the full tools array is re-sent with each call — it is not remembered between requests. That is why a small-looking schema multiplied by tens of thousands of calls becomes a real line item on your bill.

How much do function definitions add to my OpenAI or Claude bill?

Multiply the tool-definition tokens by your monthly calls and the model's input price. Example: 1,200 tool tokens × 100,000 calls = 120 million tokens; at Claude Sonnet's $3/1M that is $360/month, paid purely to re-send the definitions. The calculator does this for any model.

Does prompt caching reduce tool-definition cost?

Yes, substantially. A stable tool-definition prefix can be cached, so repeat reads bill at the cache-read rate — about 0.1× the input rate on Anthropic. In the $360/month example, caching the tools prefix drops it to roughly $36/month. Toggle caching in the calculator to see your saving.

How do I reduce the token cost of tool use?

Two levers: trim and cache. Shorten verbose descriptions, drop rarely-used optional parameters, and keep names terse — the trim simulator shows the saving. Then enable prompt caching for the stable tools prefix. Combining both can cut tool-definition cost by over 90%.

Why does the Claude token count look different from the official one?

This tool counts tokens in your browser using gpt-tokenizer (OpenAI's tiktoken / cl100k_base). Anthropic's own tokenizer typically produces 10–15% more tokens on the same JSON. For an exact Claude figure, send your tools array to Anthropic's count_tokens endpoint; the dollar arithmetic here is exact for whatever token count you use.

Does the per-model table use different token counts?

No — it deliberately prices the same token count across every model so you see the pure price spread. The same 1,200-token tools array costs $120/month on Haiku, $360 on Sonnet, $600 on Opus and $1,200 on Fable at 100,000 calls. Token counts vary slightly between real tokenizers; the table isolates price.

AI · Developer cost

Function Calling / Tool-Use Token Cost Calculator

Paste your tool definitions and see the hidden cost of re-sending them on every LLM API call — per model, with and without prompt caching, plus the saving from trimming verbose schemas. Counts run in your browser; no API key, no signup.

By Induwara Ashinsana— Executive Director, Ryzera TechnologiesUpdated Jun 19, 2026

Tool-use token cost

Anthropic-verified rate

Tool / function definitions (JSON)loading tokenizer…

[
  {
    "name": "get_order_status",
    "description": "Look up the current status, carrier, and estimated delivery date of a customer's order by its order ID.",
    "input_schema": {
      "type": "object",
      "properties": {
        "order_id": { "type": "string", "description": "The order identifier, e.g. ORD-10294." },
        "include_tracking": { "type": "boolean", "description": "Whether to include carrier tracking events." }
      },
      "required": ["order_id"]
    }
  },
  {
    "name": "issue_refund",
    "description": "Issue a full or partial refund for an order to the customer's original payment method.",
    "input_schema": {
      "type": "object",
      "properties": {
        "order_id": { "type": "string", "description": "The order identifier to refund." },
        "amount": { "type": "number", "description": "Refund amount; omit for a full refund." },
        "reason": { "type": "string", "description": "Reason code for the refund." }
      },
      "required": ["order_id", "reason"]
    }
  }
]

Paste the exact tools array you send on every request (Anthropic or OpenAI shape). Counted in your browser with gpt-tokenizer — nothing is uploaded.

Provider & model

Currency

Calls per month

How many API requests you make in a month.

Conversation tokens / call

Average input tokens per call, excluding tool definitions.

Output tokens / call

Average visible output tokens per call.

Prompt caching

Claude Sonnet 4.6 cache-read rate: $0.3/1M.

Monthly tool-def cost

$0.00

without caching

Per-call overhead

0 tok

= $0.00/call (no cache)

Share of input bill

tools vs conversation (no cache)

Caching could save

$0.00

per month vs full input rate

Paste a tool definition above to see its per-call token overhead and monthly cost.

Same tools, every model

Model	Input $/1M	Monthly (no cache)	Monthly (cached)
Claude Opus 4.8	$5	$0.00	$0.00
Claude Sonnet 4.6selected	$3	$0.00	$0.00
Claude Haiku 4.5	$1	$0.00	$0.00
Fable 5	$10	$0.00	$0.00
GPT-5list price	$1.25	$0.00	$0.00
GPT-4olist price	$2.5	$0.00	$0.00
GPT-4o minilist price	$0.15	$0.00	$0.00
Gemini 3 Prolist price	$1.25	$0.00	$0.00
Gemini 3 Flashlist price	$0.3	$0.00	$0.00

Identical token count priced across tiers — this isolates the price spread. Cached column uses each model's cache-read rate.

Trim simulator

Trimmed tool tokens (from 0)

Target per-call tool-definition tokens after trimming.

Monthly saving

$0.00

0% less · new cost $0.00/mo

Tool/function definitions are serialized into the request and billed as input tokenson every call (Anthropic token-counting API; OpenAI cookbook). Token counts are in-browser estimates (gpt-tokenizer / cl100k_base); Anthropic's own tokenizer counts ~10–15% higher, so for an exact Claude figure use its count_tokens endpoint. Anthropic prices are cross-checked; OpenAI/Gemini are published list prices that change without notice. Rates last verified 2026-06-19. Full sources are listed below the calculator.

How it works

Large language models are stateless, so the full set of tool (function) definitions you give a model is serialized into the request and re-sent on every single call. Those definitions are billed as input tokens: Anthropic's token-counting API returns input counts that include the tools array, and OpenAI's token-counting cookbook documents that function definitions are injected into the prompt and counted as input. A schema that looks trivial can quietly dominate your bill once it is multiplied across high call volume.

The calculator works in four steps:

Count tool-definition tokens.Your pasted JSON is tokenized in the browser with gpt-tokenizer (the JavaScript port of OpenAI's tiktoken, cl100k_base) to get the per-call overhead, plus a per-tool breakdown so you can spot the bloated one.
Project monthly tokens. Per-call tool tokens × calls per month gives the monthly tool-definition input tokens.
Apply the price. Monthly cost = tool_tokens × calls ÷ 1,000,000 × input_$/1M. With prompt caching, the stable tools prefix bills at the cache-read rate (≈0.1× the input rate on Anthropic) instead, which the caching toggle applies.
Compare and trim. The same token count is priced across every model so you see the pure price spread, and the trim simulator shows the monthly saving from cutting your schemas to a target size.

The share of input billfigure divides the tool-definition cost by the total input cost (tool definitions + conversation), both at the full input rate, so you can see what fraction of your input spend is just re-sent definitions. Token counts are in-browser estimates — Anthropic's own tokenizer counts about 10–15% higher on the same JSON, so for a Claude figure to the token use Anthropic's count_tokens endpoint. The dollar arithmetic is exact for whatever token count you supply, and every monthly figure is cross-checked by an independent per-call calculation.

Worked examples

Support agent, 8 tools, Claude Sonnet 4.6

Tool definitions: 1,200 tokens/call; 100,000 calls/month; input $3/1M.
Monthly tool tokens: 1,200 × 100,000 = 120,000,000 (120 MTok)
Cost (no caching): 120 × $3 = $360.00/month
With prompt caching (0.1× → $0.30/1M): 120 × $0.30 = $36.00/month
Caching saving: $360 − $36 = $324.00/month
Conversation 1,500 tok/call: 150 × $3 = $450.00; tool share = 360 ÷ 810 = 44%

Trim the schemas (same workload)

Cut verbose descriptions: 1,200 → 700 tokens/call; 100,000 calls; Sonnet $3/1M.
Cost: 700 × 100,000 ÷ 1e6 × $3 = 70 × $3 = $210.00/month
Saving vs un-trimmed: $360 − $210 = $150.00/month from trimming alone
Add caching on the 700-token prefix: 70 × $0.30 = $21.00/month
Trim + cache together: ~94% off the original $360.

Model spread (1,200 tokens, 100k calls, no caching)

Same 1,200-token tools array, identical token count across models.
Haiku 4.5 ($1/1M): 120 × $1 = $120/month
Sonnet 4.6 ($3/1M): 120 × $3 = $360/month
Opus 4.8 ($5/1M): 120 × $5 = $600/month
Fable 5 ($10/1M): 120 × $10 = $1,200/month — a 10× spread for the same tools.

Frequently asked questions

Sources & references

Prices and the token-counting method were last cross-checked against the cited sources on 2026-06-19. Anthropic rates are verified against the Anthropic pricing page; OpenAI and Gemini rows are published list prices that change without notice. Token counts are in-browser estimates — for billing-exact Claude counts, use Anthropic's count_tokens endpoint.

Related tools

LiveAI

AI Vision Token Calculator

Calculate how many tokens an image costs on GPT-4o, GPT-4o mini, Claude, and Gemini from its pixel dimensions — plus the per-image and total cost in USD and LKR, side by side. Runs entirely in your browser; the image is never uploaded.

Open tool

LiveAI

Reasoning Token Cost Calc

Estimate the true cost of reasoning-model API calls by accounting for the hidden reasoning/thinking tokens that o-series, Claude, and Gemini bill at output rates. See per-call and monthly cost in USD and LKR, plus how much more it is than a naive estimate.

Open tool

LiveAI

AI Web Search Cost

Estimate the true monthly cost of LLM web search — the per-search/grounding tool fee plus the retrieved-content input tokens — across Claude, GPT, Gemini and Sonar, with a cheapest-provider comparison.

Open tool

Rate this tool

Be the first to rate

Comments & feedback

Spotted a bug or want an improvement? Tell us — our team reviews every comment, and good ideas get built. Comments are public and anonymous.

Found a bug, edge case, or want to suggest an improvement?

Email me at [email protected] — most fixes ship within 24 hours.

Same tools, every model

Trim simulator

How it works

Worked examples

Frequently asked questions

Are function/tool definitions counted as input tokens?

Are tool definitions sent on every API request?

How much do function definitions add to my OpenAI or Claude bill?

Does prompt caching reduce tool-definition cost?

How do I reduce the token cost of tool use?

Why does the Claude token count look different from the official one?

Does the per-model table use different token counts?

Are the prices in this calculator current?

Sources & references

Related tools

AI Vision Token Calculator

Reasoning Token Cost Calc

AI Web Search Cost

Comments & feedback