What can your hardware run?

Pick your GPU. Get every open-weight model that fits at its best quantization — and the cheapest verified host for everything that doesn't.

scraped nightlyno estimatesfull price history

Your rig24 GB VRAM

Custom

1of 5 models fit your RTX 4090
at their best quantization

Llama 3.1 8B Instructunknown19.2 GB

params × bytes/quant × 1.2 overhead · verified May 27, 2026

The market today

all figures verified May 27, 2026

Price movers · 24hChangelog →

— new low — new low

— new low — new low

— new low — new low

— new low — new low

Featured · 90-day price$/1M output

Llama 3.1 8B Instruct$0.0200 · ▼ 60% / 90d

cheapest verified host per day · source: nightly scrape · full history →

Category leadersLeaderboard →

May 27DeepInfra listed DeepSeek V3 at $0.32/1M out

May 27DeepInfra listed Llama 3.1 70B Instruct at $0.40/1M out

May 27DeepInfra listed DeepSeek R1 Distill Llama 70B at $0.70/1M out

May 27DeepInfra listed Llama 3.1 8B Instruct at $0.0200/1M out

May 27DeepInfra listed Qwen 2.5 72B Instruct at $0.36/1M out