What are the three depth layers needed in an over-the-shoulder prompt?

An OTS prompt should name a soft foreground shoulder, a sharp subject at eye level, and a lit background. These layers create the depth separation that distinguishes an OTS from a flat two-shot.

Why should you name a light source instead of using a lighting adjective?

Specifying a named light source like warm yellow from a practical lamp produces more accurate results than generic terms like warm lighting. This precision was documented in production lighting passes as a measurable improvement.

What lens range makes an OTS shot feel cinematic?

A 50–85mm lens feel compresses space the way real dialogue coverage is shot. Including this range in your prompt helps the AI replicate the natural perspective of live-action OTS framing.

How do you keep both characters consistent in an OTS prompt?

Attach multi-angle character reference sheets to every OTS prompt, including close-up panels so details like scars and accessories survive the angle change. Keeping sheets loaded in the agent's context lets them ride along automatically with every prompt.

What should a negative prompt include for over-the-shoulder shots?

Exclude warped shoulder anatomy, extra figures, and flat even lighting. These exclusions prevent the most common drift issues that break the realism of OTS compositions.

Writing AI Prompts for Natural Over-the-Shoulder Shots

A natural over-the-shoulder prompt names three depth layers — a soft foreground shoulder, a sharp subject at eye level, a lit background — plus a 50–85mm lens feel and a named light source, assembled in a fixed element order and ending with a negative prompt that excludes anatomy drift, extra figures, and flat lighting.

Assemble the prompt in a fixed element order and describe the depth layers explicitly — one documented production held a 9-element assembly order (camera spec, lens and aspect ratio, lighting source, palette, composition, atmosphere, mood register, film attribution, negative prompt) across every frame of the film. A labeled template you can adapt:

"Over-the-shoulder shot from behind [Character A] — [A]'s shoulder and back of head soft in the near foreground, [Character B] framed at eye level in sharp focus, shallow depth of field, 85mm portrait-lens feel, [named light source, e.g. warm practical lamp light from camera left], [palette and mood line], in your film's aspect ratio. Negative: no warped shoulder anatomy, no third character, no flat even lighting."

Each element does a specific realism job. The soft foreground shoulder is the depth cue that separates an OTS from a flat two-shot; eye-level framing reads as a listener's perspective; 50–85mm lens language compresses the space the way real dialogue coverage is shot. Name the light source instead of using an adjective — specifying "warm yellow from the lamps only, like all the refs" produces more accurate results than generic "warm lighting," a correction documented in one production's lighting passes. And keep the shot size medium or close: in an extreme wide the shoulder stops registering as a frame anchor, so the OTS composition collapses.

Attach character references to every OTS prompt, because the frame contains two characters and both must hold. Use multi-angle character sheets that include close-up panels so small details — scars, accessories — survive the angle change; one documented 70-second film kept two characters consistent across every scene this way, with no LoRA fine-tuning, by keeping the sheets loaded in the invideo agent's context so they ride along with every prompt automatically.

Once the OTS lands, you can build the matched reverse by changing only the camera line and keeping every other element fixed — coverage pairing is its own workflow, but the rule of thumb is one variable per re-prompt.

Before spending credits, run a quick naturalness check against the prompt itself: shoulder visible and soft in the foreground, subject sharp at eye level, light source named and matched to the scene's established lighting, lens feel in the 50–85mm range, shot size medium or closer, negative prompt excluding anatomy drift and extra figures.

Watch some of these to see what works for you:

How an AI agent cracked a stubborn over-the-shoulder shot

Full AI horror short workflow: OTS shots, reverse angles, lighting

AI-generated reverse angles with spatial consistency, no re-prompting

the AI always needs to see what the character is exactly, right? Or else it'll kind of hallucinate and imagine something that's under the cap. So, we don't want to do that. We always want the character to be seen as we see it on the character sheet.

— invideo's creative team

How do you write AI prompts for over-the-shoulder shots that look natural?

More on AI Filmmaking

How do you write AI prompts for over-the-shoulder shots that look natural?

Related questions

More on AI Filmmaking