What is the difference between an orchestrator agent and a specialist agent in AI video production?
Last updated June 26, 2026
An orchestrator agent owns the whole film's context and routes work — script, shot list, character continuity, handoffs between roles. A specialist agent does one narrow job — storyboard, casting, costume, DOP, production design, edit — using only what the orchestrator hands it. In invideo, the orchestrator is the creative producer agent; specialists are sub-agents named for crew roles.
Start with the orchestrator. invideo is an agentic video creation tool, and the orchestrator role inside it is the creative producer agent — you load it once with the full script, shot breakdown, and character details, and it becomes the single source of truth every other agent reads from. Hridaye, invideo's creative director, describes the setup directly: "To really set up the context for the agent, I normally start off with the creative producer agent. That's where I'll give the script, or the shot breakdown, along with the characters. That's the main agent that sort of holds the understanding and the vision of the entire film." The orchestrator's job is workflow state: which scene is locked, which character sheet is current, which references attach to which prompt, what comes next, and what to retry when a model output misses. It does not generate the final asset itself — it decides which specialist gets the task and feeds them the right context.
Specialists are narrowly scoped sub-agents you spin up from the orchestrator's context, each named for the crew role it plays. A typical invideo crew runs roughly six to eight in parallel: a storyboard artist agent that visualizes shots before any direction is given, a casting agent that builds and iterates character sheets (often running the same prompt across two image models — Nano Banana and GPT-Image-2 or Recraft — to compare), a costume designer agent that takes mood-based briefs and returns multiple options, a production designer agent for sets and props, one or more DOP agents for cinematography (assign a separate DOP per scene when scenes need different visual sensibilities, or two DOPs to one scene when it's complex), and a director's assistant agent that sequences the shot breakdown before video execution begins. Each lives on its own project page so feedback to one doesn't contaminate another. The orchestrator routes between them.
The practical difference shows up in what each agent holds. The orchestrator holds the film. A specialist holds one job and one toolchain — the casting agent holds character sheets and image models; the DOP agent holds lens, lighting, and blocking language; the edit-stage agent (a maker-checker pass) holds the rough cut and the treatment to compare against. When a continuity error shows up in a finished shot, the orchestrator routes the fix to the casting agent to correct the character sheet at the source, not back to the video generator to re-roll. When a video model fails on a shot type (an over-the-shoulder, a POV), the orchestrator can hand the same brief to a different model — the invideo agent has Runway, Veo, Kling, and Seedance 2.0 all available, so model routing is internal, not a platform switch.
Keep the specialist count tight. Documented productions on invideo ran six to eight agents in parallel — a 2-minute brand promo used eight specialists across separate project pages and finished in three days; a multi-day short ran six per person across a four-person team. Three to five well-defined roles outperform a dozen vague ones, because every additional agent is another context boundary the orchestrator has to manage.
A few failure modes the orchestrator handles that specialists cannot: silent context drift (a specialist generating without the latest character sheet — fixed by the orchestrator re-attaching), hallucinated continuity (a costume detail that wasn't in the sheet — caught when the rough cut goes back to the orchestrator for a maker-checker pass), and model-specific weaknesses (one image or video model failing on a shot type — re-routed to another). Skipping the orchestrator's review pass on the assembled cut is the most common mistake in this workflow.
One rule of thumb: if you can't say in one sentence what a specialist is responsible for, it shouldn't be its own agent — fold the job back into an adjacent role and let the orchestrator route around it.
Watch some of these to see what works for you:
To really set up the context for the agent, I normally start off with the creative producer agent. That's where I'll give the script, or the shot breakdown, along with the characters. That's the main agent that sort of holds the understanding and the vision of the entire film.
— Hridaye, invideo's creative director