Pricing
| Rate | RODI | USD (ref.) | Unit |
|---|---|---|---|
| In | 149.1 | ~ 0.250 | USD/M · RODI/M |
| Out | 894.2 | ~ 1.50 | USD/M · RODI/M |
| Cached | 15.0 | ~ 0.0250 | USD/M · RODI/M |
| Audio in | 29.9 | ~ 0.0500 | USD/M · RODI/M |
| Image in | 149.1 | ~ 0.250 | USD/M · RODI/M |
RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.
Capabilities
Streaming
Tool calling
Vision
JSON mode
Reasoning
About this model
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.
API usage
Use the canonical model slug in your chat completion requests.
…Shell / scripts:
…