Part 6 · Multi-Agent Loops

Harness Engineering · ~8 min

Orchestrator-Worker

One lead agent decomposes a task, fans it out to parallel workers, and synthesizes the results. The parallelism is the win — and the 15× token bill is the catch.

Why this, for you: Lesson 4 introduced the sub-agent as an isolation primitive. This lesson scales it into a topology: an orchestrator that partitions a problem across many workers at once. Done right it beats a single agent by a wide margin; done wrong it buys coordination cost with no quality return. The difference is decomposition and synthesis.

The orchestrator-worker pattern has two roles. The orchestrator receives the task, decomposes it into independent subtasks, dispatches each to a worker, and synthesizes the results. Workers each take a bounded subtask with their own tool set and return findings. The orchestrator never executes subtasks; workers never coordinate with each other.

1 When parallelism actually helps

Anthropic's multi-agent research system — Opus orchestrating Sonnet workers — outperformed single-agent Opus by 90.2% on complex research tasks. But a protocol-aligned study across ten benchmarks found most multi-agent configurations underperformed a single-agent baseline — only one of six tested workflows beat it.

Both results are true. Parallel dispatch pays off when subtasks are genuinely independent — multiple sources, different methodologies on one dataset, code review across separate modules with no shared state. It does not help when subtasks are sequentially dependent: that needs chaining, not fan-out. Adding workers to a task that doesn't decompose cleanly buys coordination cost and returns nothing.

2 Effort scaling and synthesis

Two orchestrator responsibilities separate the wins from the waste. First, match worker count to complexity — and put the rule in the prompt, not in code:

QueryAllocation
Simple1 agent, 3–10 tool calls
Moderate2–4 subagents, clearly divided responsibilities
Complex10+ subagents with partitioned search spaces

Hard-coding agent counts removes the flexibility to match scale to complexity. Second, synthesis is a reasoning step, not aggregation: the orchestrator evaluates each worker's reliability, finds conflicts and gaps, and produces a unified output drawing on the strongest elements. If it simply concatenates worker outputs, the pattern adds latency without improving quality.

# orchestrator prompt — decompose, scope, then synthesize spawn one worker per repo, tools scoped to that repo, run in parallel worker: audit deps, test coverage, secrets → structured JSON on return: rank by severity, resolve cross-repo conflicts # NOT: concatenate 50 raw outputs and call it a report

3 The token economics you're signing up for

Isolation has a price, and it's steep. Multi-agent orchestration multiplies token consumption ~15× over chat (vs. ~4× for a single agent), and token usage explains roughly 80% of performance variance across research tasks. The effort-scaling rules in the orchestrator's prompt are the primary cost control. And the orchestrator prompt is the highest-leverage component overall — Anthropic reports small changes to it can unpredictably shift subagent behavior, so test decomposition explicitly across a range of inputs.

The classic failure modes

Over-spawning: too many workers for a simple query — effort-scaling rules prevent it. Orchestrator as single point of failure: a misclassified decomposition routes every worker to the wrong subtask, and the orchestrator's own LLM call caps throughput. Synthesis context overflow: the orchestrator must hold the task plus every worker's results — beyond ~4 substantive outputs this routinely blows the context budget. Premature termination and source-quality drift round it out. The pattern is conditional, not automatic: the task value must justify the 15× bill.

↪ Your win: decompose, scope, synthesize — only when it pays

Retrieval practice — recall, don't peek

Question 1In orchestrator-worker, the workers…

Question 2Parallel dispatch pays off specifically when subtasks are…

Question 3Effort-scaling rules (worker count vs. complexity) should live in…

Question 4Multi-agent orchestration's token cost over chat is roughly…

Question 5 · spaced recall from Lesson 11The reasoning sandwich gives extra-high compute to…

Ask me anything. Want an orchestrator prompt with effort-scaling rules drafted for your task, or to work out whether your problem decomposes cleanly enough to justify the bill? Next in Part 6: Evaluator-Optimizer — a generator and a critic looping until a machine-checkable bar is met.
✎ Feedback