Rodium AI
Google

Gemini 3.1 Flash Lite

google/gemini-3-1-flash-lite

visionchatlong-context

Input price

149.1 RODI/M

~ 0.25 USD/M

Output price

894.2 RODI/M

~ 1.5 USD/M

Context

1.0M

Max output

Input:textimagevideodocumentaudio
Output:text

Pricing

RateRODIUSD (ref.)Unit
In149.1~ 0.250USD/M · RODI/M
Out894.2~ 1.50USD/M · RODI/M
Cached15.0~ 0.0250USD/M · RODI/M
Audio in29.9~ 0.0500USD/M · RODI/M
Image in149.1~ 0.250USD/M · RODI/M

RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.

Capabilities

Streaming
Tool calling
Vision
JSON mode
Reasoning

About this model

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

API usage

Use the canonical model slug in your chat completion requests.

Shell / scripts:

Chat completions docs →