1 block = 1 PDF page
β 650 PDF pages
visually-processed PDF pages
When a model reads a PDF page as both text and image (preserving layout, tables and figures), each page costs ~1,500 tokens. Text-only extraction is cheaper: ~2,000 pages per million tokens.
basis: ~1,500 tokens per page with layout