Pricing · 01

Halton Meter bundled rates · 2026-05-01

Pricing matrix bundled with halton-meter v0.1.24. Anthropic, OpenAI, Google Gemini, xAI. Bundled 2026-05-01. Cost computation runs locally.

macOS 12+ · Python 3.11+ Reading time 2 min Updated May 11, 2026

This is the pricing matrix bundled with halton-meter v0.1.24. Cost computation runs locally on the user’s machine using these rates. Sourced from each provider’s public pricing page on the bundle date. Methodology →

All figures are USD per million tokens. Cache and thinking columns are shown where the provider exposes them; an em dash means the provider does not bill that lane separately for the model. Gemini’s tiered surcharge for prompts above 200k tokens is shown on the row directly beneath the standard rate.

Anthropic

27 models · per million tokens

Model Input Output Cache read Cache write Thinking
claude-3-5-haiku-20241022 $0.8 $4 $0.08 $1 $4
claude-3-5-haiku-latest $0.8 $4 $0.08 $1 $4
claude-3-7-sonnet-20250219 $3 $15 $0.3 $3.75 $15
claude-3-7-sonnet-latest $3 $15 $0.3 $3.75 $15
claude-3-haiku-20240307 $0.25 $1.25 $0.03 $0.3125 $1.25
claude-3-opus-20240229 $15 $75 $1.5 $18.75 $75
claude-3-opus-latest $15 $75 $1.5 $18.75 $75
claude-haiku-4-5 $1 $5 $0.1 $1.25 $5
claude-haiku-4-5-20251001 $1 $5 $0.1 $1.25 $5
claude-opus-4-0 $15 $75 $1.5 $18.75 $75
claude-opus-4-1 $15 $75 $1.5 $18.75 $75
claude-opus-4-1-20250805 $15 $75 $1.5 $18.75 $75
claude-opus-4-20250514 $15 $75 $1.5 $18.75 $75
claude-opus-4-5 $5 $25 $0.5 $6.25 $25
claude-opus-4-5-20251101 $5 $25 $0.5 $6.25 $25
claude-opus-4-6 $5 $25 $0.5 $6.25 $25
claude-opus-4-6-20251101 $5 $25 $0.5 $6.25 $25
claude-opus-4-7 $5 $25 $0.5 $6.25 $25
claude-opus-4-7-20260101 $5 $25 $0.5 $6.25 $25
claude-opus-4-8 $5 $25 $0.5 $6.25 $25
claude-opus-4-8-20260219 $5 $25 $0.5 $6.25 $25
claude-sonnet-4-0 $3 $15 $0.3 $3.75 $15
claude-sonnet-4-20250514 $3 $15 $0.3 $3.75 $15
claude-sonnet-4-5 $3 $15 $0.3 $3.75 $15
claude-sonnet-4-5-20250929 $3 $15 $0.3 $3.75 $15
claude-sonnet-4-6 $3 $15 $0.3 $3.75 $15
claude-sonnet-4-6-20251101 $3 $15 $0.3 $3.75 $15

OpenAI

21 models · per million tokens

Model Input Output Cache read Cache write Thinking
codex-auto-review $0.4 $1.6 $0.1 $0 $1.6
codex-unknown $2 $8 $0.5 $0 $8
gpt-3.5-turbo $0.5 $1.5 $0 $0 $1.5
gpt-4.1 $2 $8 $0.5 $0 $8
gpt-4.1-mini $0.4 $1.6 $0.1 $0 $1.6
gpt-4.1-nano $0.1 $0.4 $0.025 $0 $0.4
gpt-4o $2.5 $10 $1.25 $0 $10
gpt-4o-mini $0.15 $0.6 $0.075 $0 $0.6
gpt-5.2 $2 $8 $0.5 $0 $8
gpt-5.3-codex $2 $8 $0.5 $0 $8
gpt-5.4 $2.5 $15 $0.25 $0 $15
gpt-5.4-mini $0.75 $4.5 $0.075 $0 $4.5
gpt-5.5 $5 $30 $0.5 $0 $30
o1 $15 $60 $7.5 $0 $60
o1-pro $150 $600 $0 $0 $600
o3 $10 $40 $2.5 $0 $40
o3-mini $1.1 $4.4 $0.55 $0 $4.4
o4-mini $1.1 $4.4 $0.275 $0 $4.4
text-embedding-3-large $0.13 $0 $0 $0 $0
text-embedding-3-small $0.02 $0 $0 $0 $0
text-embedding-ada-002 $0.1 $0 $0 $0 $0

Google Gemini

11 models · per million tokens

Model Input Output Cache read Cache write Thinking
gemini-2.5-flash $0.3 $2.5 $0.03 $0 $2.5
gemini-2.5-flash-lite $0.1 $0.4 $0.01 $0 $0.4
gemini-2.5-pro $1.25 $10 $0.125 $0 $10
>200k tokens $2.5 $15 $0.63 $3.125 $15
gemini-3-flash-preview $0.5 $3 $0.05 $0 $3
gemini-3.1-flash-image-preview $0.5 $3 $0.05 $0.625 $3
modal: audio in $0 audio out $0 image gen $60
gemini-3.1-flash-lite $0.25 $1.5 $0.025 $0 $1.5
gemini-3.1-flash-lite-preview $0.25 $1.5 $0.025 $0.3125 $1.5
gemini-3.1-flash-live-preview $0.75 $4.5 $0 $0 $4.5
modal: audio in $3 audio out $12 image gen $0
gemini-3.1-pro-preview $2 $12 $0.2 $0 $12
>200k tokens $4 $18 $1 $5 $18
gemini-3.5-flash $1.5 $9 $0.15 $0 $9
gemini-code-assist-unknown $0.5 $3 $0.05 $0 $3

xAI

14 models · per million tokens

Model Input Output Cache read Cache write Thinking
grok-2-vision-1212 $1.25 $2.5 $0.2 $0 $2.5
grok-3 $1.25 $2.5 $0.2 $0 $2.5
grok-3-fast $1.25 $2.5 $0.2 $0 $2.5
grok-3-mini $0.3 $0.5 $0 $0 $0.5
grok-3-mini-fast $0.6 $4 $0 $0 $4
grok-4 $1.25 $2.5 $0.2 $0 $2.5
grok-4-mini $1.25 $2.5 $0.2 $0 $2.5
grok-4.20-0309-non-reasoning $1.25 $2.5 $0.2 $0 $2.5
grok-4.20-0309-reasoning $1.25 $2.5 $0.2 $0 $2.5
grok-4.20-multi-agent-0309 $1.25 $2.5 $0.2 $0 $2.5
grok-4.3 $1.25 $2.5 $0.2 $0 $2.5
grok-4.3-latest $1.25 $2.5 $0.2 $0 $2.5
grok-build $1 $2 $0.2 $0 $2
grok-build-0.1 $1 $2 $0.2 $0 $2

Last updated 2026-05-01 from daemon/halton_meter/pricing/matrix.py. JSON: /rates.json · Freshness manifest: /rates-manifest.json