Pricing
| Rate | RODI | USD (ref.) | Unit |
|---|---|---|---|
| Per second | 238.5 | ~ 0.400 | USD/s · RODI/s |
| With audio / s | 238.5 | ~ 0.400 | USD/s · RODI/s |
| No audio / s | 119.3 | ~ 0.200 | USD/s · RODI/s |
| 1080p with audio | 238.5 | ~ 0.400 | USD/s · RODI/s |
| 1080p no audio | 119.3 | ~ 0.200 | USD/s · RODI/s |
| 4K with audio | 357.7 | ~ 0.600 | USD/s · RODI/s |
| 4K no audio | 238.5 | ~ 0.400 | USD/s · RODI/s |
RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.
Capabilities
Streaming
Tool calling
Vision
JSON mode
Reasoning
About this model
Google's state-of-the-art video generation model, built for maximum visual fidelity in final production cuts. Veo 3.1 generates high-quality 1080p video from text or image prompts with native synchronized audio — including dialogue, ambient effects, and background sound. Supports scene extension (up to 20 chained clips for 140+ second narratives), frames-to-video transitions between two images, vertical video for Shorts, and 4K upscaling.
API usage
Use the canonical model slug in your chat completion requests.
…Shell / scripts:
…