1 tile = 1 image
β 1,000 images
detailed images analyzed
Vision models spend roughly 800β1,600 tokens to read one detailed image, so a million tokens lets a model look at about a thousand photos or screenshots.
basis: ~1,000 tokens per image
A million tokens, measured in
Vision models pay per image in tokens too. At ~1,000 tokens per detailed image, a million tokens is a photo library of about a thousand pictures, read and understood.
1 tile = 1 image
β 1,000 images
detailed images analyzed
Vision models spend roughly 800β1,600 tokens to read one detailed image, so a million tokens lets a model look at about a thousand photos or screenshots.
basis: ~1,000 tokens per image
Everything on this page is exactly one million tokens. So the price of having a model read all of it is simply each model's per-million rate. Updated 2026-07-03.
| Model | Read all of it (input) |
|---|---|
| GPT-5.5 Pro | $30 |
| Claude Fable 5 | $10 |
| GPT-5.5 | $5 |
| Claude Opus 4.8 | $5 |
| Claude Sonnet 5 | $2 |
| Gemini 3.1 Pro | $2 |
| Gemini 3.5 Flash | $1.50 |
| Claude Haiku 4.5 | $1 |
| Gemini 3 Flash | $0.50 |
From $0.50 to $30 for the same million tokens. Try your own budget in the calculator β
Or start over with the full picture β scale, prices, and the budget calculator.