Rodium AI
Google

Veo 3.1

google/veo-3-1

visionvideo

From

238.5 RODI/s

~ 0.4 USD/s

Context

128K

Max output

Input:textimage
Output:video

Pricing

RateRODIUSD (ref.)Unit
Per second238.5~ 0.400USD/s · RODI/s
With audio / s238.5~ 0.400USD/s · RODI/s
No audio / s119.3~ 0.200USD/s · RODI/s
1080p with audio238.5~ 0.400USD/s · RODI/s
1080p no audio119.3~ 0.200USD/s · RODI/s
4K with audio357.7~ 0.600USD/s · RODI/s
4K no audio238.5~ 0.400USD/s · RODI/s

RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.

Capabilities

Streaming
Tool calling
Vision
JSON mode
Reasoning

About this model

Google's state-of-the-art video generation model, built for maximum visual fidelity in final production cuts. Veo 3.1 generates high-quality 1080p video from text or image prompts with native synchronized audio — including dialogue, ambient effects, and background sound. Supports scene extension (up to 20 chained clips for 140+ second narratives), frames-to-video transitions between two images, vertical video for Shorts, and 4K upscaling.

API usage

Use the canonical model slug in your chat completion requests.

Shell / scripts:

Chat completions docs →