Text to Video

Create AI videos from text prompts with model, resolution, aspect ratio, duration, and audio controls.

Checking credits
AI Model
Model Version
Prompt
0/5000
Aspect Ratio
Resolution
25 credits1080p · 8s · silent

AI Video Showcase

Explore cinematic motion, character consistency, and multi-modal video generation examples.

Veo 3.1 Text to Video

Create sound-aware videos from text with Veo 3.1

Veo 3.1 is a practical text-to-video model for short cinematic shots, product clips, story beats, and social video drafts. This Studio page exposes Lite, Fast, and Quality modes with 720p, 1080p, 4k, aspect-ratio, and audio controls.

Google's Gemini API documentation lists Veo 3.1 for text-to-video, image-to-video, and frame-based video generation workflows.
Google's launch notes highlight stronger audio, narrative control, realism, and prompt adherence in Veo 3.1.
This Studio route currently uses an 8-second default with auto, 16:9, and 9:16 aspect-ratio controls.

Where Veo 3.1 Fits

Ad concept drafts

Turn product positioning, camera movement, and mood into a watchable clip before committing to a shoot.

Social video openers

Generate horizontal or vertical opening shots, launch teasers, and visual hooks for short-form content.

Storyboarding

Test a character action, environment shift, or camera rhythm before building a longer sequence.

Product demos

Place a product in a realistic moving scene for pitches, internal reviews, and early creative direction.

What This Veo 3.1 Page Supports

Veo 3.1 works best when the prompt is specific about motion, framing, lighting, and sound. The controls on this page keep those choices visible before you spend credits.

Lite, Fast, and Quality versions for balancing cost, speed, and output fidelity.

720p, 1080p, and 4k output controls with the current credit estimate shown in the form.

Auto, 16:9, and 9:16 aspect ratios for cinematic and mobile-first videos.

Audio-enabled generation controls for prompts that describe ambience, music, speech, or scene sound.

Prompt Examples You Can Adapt

Product shot

A cinematic 8-second product video of a matte black wireless speaker on a rain-soaked rooftop at night, slow push-in camera, blue neon reflections, subtle bass vibration, realistic sound design.

Travel scene

A handheld travel video of a narrow Kyoto alley after sunset, warm lanterns, light rain, pedestrians passing naturally, slow forward camera movement, ambient street sound.

Story moment

A young astronaut discovers a glowing plant inside a quiet greenhouse on Mars, gentle camera orbit, dust floating in the light, soft emotional music, realistic cinematic lighting.

Veo 3.1 FAQ

Is Veo 3.1 best for long scripts?
It works best when you describe one clear shot: subject, action, setting, camera movement, lighting, and sound. For longer stories, split the scene into separate shots.
Should I choose Lite, Fast, or Quality?
Use Lite or Fast while exploring ideas. Move to Quality once the prompt and visual direction are stable enough to justify the higher cost.
Can this Veo 3.1 page generate audio?
Yes. The current Studio integration includes audio controls for Veo 3.1, so prompts can describe ambience, music, speech style, or other scene sounds.
https://geminiomni.app/studio/text-to-video/veo-3-1