7 providers50 models
Open-weight LLM leaderboard — nightly

Tell us your hardware. We'll rank the models you can run.

Pick your GPU or enter your VRAM, and modelbeat ranks the open-weight LLMs that fit — at the best quantization for your card, with a composite intelligence index, output speed, and context window. Can't run it locally? We show the cheapest provider to rent it instead. 50 models across 7 providers, scraped nightly.

Browse all models Cost calculatorlive · verified May 27, 2026

Category leaders

Full leaderboard →
DeepSeek V3.2$0.27in / 1M tokDeepInfra-12%
Llama 4 70B$0.35in / 1M tokHyperbolic-8%
Gemma 3 27B$0.18in / 1M tokDeepInfra-4%
Phi-4$0.08in / 1M tokDeepInfra— flat

DeepSeek V3.2 — 90-day output price

Median across providers · $ / 1M tokens
▼ 58%Detail →
$2.40$1.50$0.9990d ago60d30dtoday

What we crosswalk

8 dimensions
$ / 1M inputacross every provider
$ / 1M outputnormalized currency
Latency (p50, p95)observed via OpenRouter
Throughputtok/s · decode-only
Context windoweffective, not advertised
QuantizationFP8 / FP16 / Q4 etc.
RegionUS, EU, APAC
Licensederived per model

The toolkit

04 surfaces · all free to read
indexed nightly

Cheapest leaderboard

Live ranking of providers per model. Filter by region, context window, latency budget.

View ranking →
indexed nightly

Per-model detail

Every provider stacked, 90-day price history, observed latency, benchmarks side-by-side.

Browse models →
indexed nightly

Provider crosswalk

Compare hosts on price, capability, throughput and a workload of your choice.

Open providers →
indexed nightly

Workload calculator

Plug in monthly tokens — get the actual bill on every provider. No marketing fluff.

Run a workload →
50
open-weights models tracked
7
inference providers crawled
350
model × provider crosswalks
90d
price history per model

Tracked providers

View all →

Recent price drops

last sync May 27, 2026
Full changelog →
ModelProviderBeforeAfterΔTierTrend
DeepSeek V3.2DeepInfra$1.10$0.99-10.0%output
Llama 4 70BHyperbolic$0.42$0.40-4.8%output
Qwen 3 72BTogether$0.95$0.90-5.3%input
Mistral Large 3Bedrock$5.00$4.40-12.0%output
Gemma 3 27BGroq$0.30$0.27-10.0%input