Rodium AI
Google

Lyria 3 Clip Preview

google/lyria-3-clip-preview

visionaudiochatlong-contextpreview

Input price

Output price

Context

1.0M

Max output

Input:textimage
Output:textaudio

Pricing

Pricing unavailable

RODI prices include Rodium markup and upstream fees. USD figures are wholesale reference rates.

Capabilities

Streaming
Tool calling
Vision
JSON mode
Reasoning

About this model

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz stereo audio from text prompts or from images. These models deliver structural coherence, including vocals, timed lyrics, and full instrumental arrangements. Lyria 3 Clip can generate short clips, loops, previews.

API usage

Use the canonical model slug in your chat completion requests.

Shell / scripts:

Chat completions docs →