Question 1

How accurate is the default token estimate?

Accepted Answer

Within ±10% for English text across all major models. The estimate uses empirical character-to-token ratios validated against official tokenizers. For exact counts, use the official vendor tokenizer APIs.

Question 2

Which models are supported?

Accepted Answer

OpenAI GPT-5 family (GPT-5.5, GPT-5.4, GPT-5 Mini, GPT-5 Nano), Anthropic Claude family (Opus 4.8, Sonnet 4.6, Haiku 4.5), and Google Gemini family (Gemini 3.5 Flash, Gemini 2.5 Pro, Gemini 2.5 Flash). Pricing is updated monthly.

Question 3

Why don't you use the exact tokenizer by default?

Accepted Answer

Because the OpenAI tokenizer alone is 600KB. We keep the page under 50KB on first load so it works instantly on mobile. The ±10% estimate is accurate enough for cost planning.

Question 4

Are my prompts sent anywhere?

Accepted Answer

No. All counting and cost math happens in your browser. No prompts are uploaded. Verify in DevTools Network tab.

Question 5

How do you handle prompt caching costs?

Accepted Answer

Set the cache-hit slider in the advanced section. The calculator applies the cached-input rate (typically ~10% of standard input) to that portion of your tokens.

Question 6

What's the agent workload preset?

Accepted Answer

It models multi-turn agent scenarios. The 10 tool calls preset adds 500 input + 200 output tokens per tool call to your base prompt, simulating a typical agent run.

Question 7

Can I compare multiple prompts at once?

Accepted Answer

Yes. Paste prompts separated by --- and toggle Batch mode in the advanced section.

Question 8

How do you decide which model is cheapest?

Accepted Answer

For your specific token count and output length, we compute cost across every model and highlight the lowest. Adjust the daily-calls slider to see how it scales monthly.

Question 9

Are Gemini token counts as accurate as OpenAI or Claude?

Accepted Answer

Gemini has no public official tokenizer for browser use, so we use the empirical chars/3.8 ratio. For workloads where ±10% matters, validate against the Gemini API's countTokens endpoint.

Question 10

How often is pricing updated?

Accepted Answer

Monthly, manually, with a diff in the public repo. The footer shows Prices as of YYYY-MM-DD. If you spot a stale rate, file an issue on GitHub.

AI Token Counter + Prompt Cost Calculator

Cost Breakdown

How to Use

Why We Built This

Frequently Asked Questions

Related Tools

About Token Counting