Models & Cost Reference

Summary — what this page covers A keeper page: the current model lineup with per-token pricing, the cost-reduction levers, and a clear map of what genuinely needs paid access versus where a free/local alternative works. Verify the rates against current pricing before the event.

Free-first: default to free/local where possible; pay only where it's genuinely required.

Model lineup & pricing

Per million tokens, input / output. (Verify against current pricing before the event.)

Model	Input / Output (per 1M tokens)	Use for
Claude Haiku 4.5	$1 / $5	High-volume, mechanical work; CI code review
Claude Sonnet 4.6	$3 / $15	Default for app features
Claude Opus 4.8	$5 / $25	Hardest reasoning

Cost-reduction levers

Prompt caching — up to ~90% off the cached input portion (Section 1).
Batch processing — ~50% cheaper for non-interactive workloads.
Model choice — Haiku for CI reviews and mechanical tasks (Section 5).

Paid vs free/local map

Capability	Needs paid?	Free / local alternative
Claude API calls (all sections)	Yes — API key required	Trial credit covers the workshop
RAG embeddings (Section 3)	OpenAI key by default	Ollama + `nomic-embed-text` (768-dim) behind the same interface
Vector store (Section 3)	No	Qdrant in Docker (local)
Integration test infra (Section 5)	No	TestContainers (local Docker)
Claude Code (Day 1)	Pro min ($20/mo)	— (not on Free)

Remember: a Claude.ai Pro/Max subscription does NOT grant API access. Day 2 runs on the Anthropic API, which is billed separately (pay-as-you-go) — you need a Console key.

Models & Cost Reference

Model lineup & pricing

Cost-reduction levers

Paid vs free/local map

See also