Gemini 3.1 Flash Lite

google/gemini-3-1-flash-lite

visionchatlong-context

Input price

149.1 RODI/M

~ 0.25 USD/M

Output price

894.2 RODI/M

~ 1.5 USD/M

Context

1.0M

Max output

—

Input:textimagevideodocumentaudio

Output:text

Pricing

Rate	RODI	USD (ref.)	Unit
In	149.1	~ 0.250	USD/M · RODI/M
Out	894.2	~ 1.50	USD/M · RODI/M
Cached	15.0	~ 0.0250	USD/M · RODI/M
Audio in	29.9	~ 0.0500	USD/M · RODI/M
Image in	149.1	~ 0.250	USD/M · RODI/M

RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.

Capabilities

Streaming

Tool calling

Vision

JSON mode

Reasoning

About this model

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic workflows, simple data extraction, and applications where responsiveness and API cost are the primary constraints. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.

API usage

Use the canonical model slug in your chat completion requests.

…

Shell / scripts:

…

Chat completions docs →

Back to all models All Google models