A storm chaser stepping out of a truck as a supercell rotates on the horizon, slow push-in, wide anamorphic lens, golden-grey light, distant thunder rumbling in the audio.
Veo 3 Prompt Guide
Veo 3.1 is Google's flagship text-to-video model, known for cinematic realism, accurate physics, and native synced audio including dialogue and sound effects. The prompts below cover the seven use cases creators ship most on Kubricon — cinematic shots, product videos, short-form social, ads, dialogue with native audio, camera moves, and image-to-video. Copy any prompt into Kubricon, generate with Veo, and finish in the editor.
Subject + Action + Camera language + Setting + Lighting + Audio (dialogue/SFX) + Style
Use this skeleton as a starting point. Substitute your subject and setting, keep one idea per clause, and lead with the most important visual so the model anchors on it before adding camera and lighting detail.
Cinematic shots
5 promptsA lighthouse keeper climbing a spiral staircase at dawn, camera tracking upward beside him, soft window light, footsteps echoing on metal in native audio.
A vintage train pulling into a foggy mountain station, slow side dolly, 35mm film grain, brakes hissing and bell ringing in synced audio.
A chef plating a dish under warm restaurant light, overhead slow descent, shallow depth of field, the clink of cutlery and low ambient chatter.
A surfer paddling out as a large wave forms behind them, water-level tracking shot, overcast cinematic grade, crashing surf in native audio.
Product videos
5 promptsA matte ceramic coffee mug steaming on a wooden table, slow orbit, soft morning window light, faint pour and steam hiss in native audio.
A mechanical watch rotating on a black plinth, extreme macro, single key light raking across the brushed steel, subtle ticking in synced audio.
A pair of headphones unfolding in mid-air against a deep blue gradient, slow dolly-in, clean studio light, a soft mechanical click as they lock.
A glass perfume bottle catching a sweep of light on a marble surface, macro lens, pastel backdrop, a gentle spritz sound in native audio.
A running shoe landing on wet pavement in slow motion, low angle, neon reflections, the splash and impact in synced audio.
TikTok / Reels / YouTube Shorts
5 promptsA barista pouring latte art top-down in a bright kitchen, vertical 9:16, slow motion at the swirl, milk pour audible in native audio, energetic morning mood.
A creator opening a laptop and a glowing dashboard animating out of the screen, vertical 9:16, fast 1.5s cut rhythm, soft UI chimes.
A skateboarder landing a trick in golden hour, vertical follow shot, freeze at the peak, board clack and crowd reaction in synced audio.
Hands assembling a smoothie bowl in fast cuts, vertical top-down, each ingredient dropping in slow motion with a satisfying thud in audio.
A runner lacing up at sunrise then sprinting out of frame, vertical 9:16, handheld energy, breathing and footsteps in native audio.
AI video ads
5 promptsA 6-second hook ad: a frustrated person stares at a cluttered desk, then a clean product slides in and the desk reorganizes itself, bright flat lighting, upbeat audio sting.
A split-screen ad: left side dull grey routine, right side the same task done effortlessly with the product, 15-second pacing, confident voiceover cue.
A UGC-style talking head holding a skincare bottle in a sunny kitchen, slightly off-center framing, natural light, authentic spoken line in native audio, vertical 9:16.
A SaaS dashboard lifting off a phone into a floating 3D workspace, parallax camera, deep purple gradient, futuristic whoosh in synced audio.
A 15-second demo: customer scans a QR code on packaging and AR specs lift off the box, bright retail aisle, soft confirmation beep in audio.
Dialogue & native audio
5 promptsTwo friends laughing at a cafe table, one says 'You actually did it?', natural lip-sync, warm light, ambient cafe noise in native audio, eye-level medium shot.
A coach in a gym leaning toward the camera saying 'One more set — you've got this', determined tone, synced lip movement, weights clinking in the background.
A grandmother in a kitchen saying 'The secret is patience', soft smile, warm tungsten light, a gentle simmer on the stove in native audio.
A street vendor calling out 'Fresh, hot, right here!' in a busy market, lively ambient crowd, handheld camera, natural lip-sync.
A scientist in a lab calmly stating 'The results are conclusive', sterile white light, quiet hum of equipment in synced audio, locked medium close-up.
Camera moves
5 promptsA slow crane shot rising from a child's sandcastle to reveal a wide empty beach at sunset, smooth vertical move over 6 seconds, gentle surf in audio.
A whip-pan from one character to another across a dinner table, fast 0.4s move, motion blur, clinking glasses in synced audio.
A dolly-zoom on a detective realizing a clue, subject stays the same size as the room compresses, tense low strings in native audio.
A 360-degree orbit around a couple dancing in an empty ballroom, constant speed, warm chandelier light, soft piano in synced audio.
A handheld push through a crowded festival toward a stage, natural sway, layered crowd noise and distant music in native audio.
Image-to-video
5 promptsFrom a product still: slow 180-degree orbit, studio lighting unchanged, no other motion, a soft ambient room tone in audio.
From a portrait still: gentle breathing and a slow blink over 4 seconds, faint room ambience in native audio, no head movement.
From a landscape still: add drifting clouds and faint wind in the grass, 6 seconds, distant bird calls in synced audio.
From a city-night still: blinking distant windows and faint traffic streaks, 6 seconds, low urban hum in audio.
From a character still: slow head turn toward camera over 3 seconds, hold, a subtle breath audible in native audio.
Frequently asked questions
What makes a Veo 3 prompt effective?
Veo rewards specific camera language, clear lighting cues, and explicit audio direction. Because Veo generates native synced audio, naming the dialogue line or sound effect you want improves results noticeably.
Does Veo 3.1 generate sound and dialogue?
Yes. Native synced audio — including spoken dialogue, ambience, and sound effects — is one of Veo's strongest features. Describe the line and tone in the prompt for best lip-sync and timing.
When should I pick Veo over Sora or Kling?
Veo 3.1 is a strong default when you need realistic physics, cinematic grading, or synced dialogue. Sora 2 and Kling 3 are also excellent — test the model-specific guides and pick per shot inside Kubricon.
Can I run Veo in Kubricon?
Yes. Veo is one of the in-product video engines on Kubricon. Generated clips ship straight into the editor for captions, hooks, reframing, and short-form export.
How do I turn a Veo clip into a publish-ready short?
Bring the Veo generation into the Kubricon editor, add captions and a hook overlay, auto-reframe to 9:16, and export presets for TikTok, Reels, and YouTube Shorts.
Bring these prompts into the Kubricon editor
Copy any prompt, run it on the model you want, then add captions, hooks, beat-sync, and vertical reframing before exporting for TikTok, Reels, and YouTube Shorts.