Pricing
| Rate | RODI | USD (ref.) | Unit |
|---|---|---|---|
| In | 59.7 | ~ 0.100 | USD/M · RODI/M |
| Out | 238.5 | ~ 0.400 | USD/M · RODI/M |
| Cached | 6.0 | ~ 0.0100 | USD/M · RODI/M |
| Audio in | 17.9 | ~ 0.0300 | USD/M · RODI/M |
| Image in | 59.7 | ~ 0.100 | USD/M · RODI/M |
RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.
Capabilities
Streaming
Tool calling
Vision
JSON mode
Reasoning
About this model
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off c
API usage
Use the canonical model slug in your chat completion requests.
…Shell / scripts:
…