LLM pricing overview

High‑level pricing snapshot for a few popular models. For a more detailed table and filters, use the models page.

Budget‑friendly APIs

Lowest headline prices for prototyping and high‑volume workloads.

Input / output prices per 1M tokens.

Good trade‑off between quality, latency, and cost.

See individual model pages for more details and benchmarks; numbers here are just a starting point.