Models & Cost Reference
Summary — what this page covers A keeper page: the current model lineup with per-token pricing, the cost-reduction levers, and a clear map of what genuinely needs paid access versus where a free/local alternative works. Verify the rates against current pricing before the event.
Free-first: default to free/local where possible; pay only where it's genuinely required.
Model lineup & pricing
Per million tokens, input / output. (Verify against current pricing before the event.)
Cost-reduction levers
- Prompt caching — up to ~90% off the cached input portion (Section 1).
- Batch processing — ~50% cheaper for non-interactive workloads.
- Model choice — Haiku for CI reviews and mechanical tasks (Section 5).
Paid vs free/local map
Remember: a Claude.ai Pro/Max subscription does NOT grant API access. Day 2 runs on the Anthropic API, which is billed separately (pay-as-you-go) — you need a Console key.