Overview
Category maturity: Emerging. Multiple tools have shipped stable v0.x releases and are in active production use among early-adopter engineering teams. The category has not yet converged on a single dominant architecture, and governance tooling is absent at the enterprise layer.
Direction of travel: The next 6-12 months will see the category bifurcate:
- Bundled orchestrators (Claude Code Agent Teams, Grok Build Arena Mode) will absorb the mainstream adoption curve as they simplify the entry point
- Standalone orchestrators will compete on depth of control, runtime flexibility, and cost optimization
- Governance tooling, currently absent across the board, will become the primary differentiation vector for enterprise-grade contenders within that window
Coalesced patterns: The industry has settled on several reliable foundations:
- Isolated git worktrees per agent is the near-universal approach to preventing file conflicts, adopted by Claude Code Agent Teams, Overstory, Multiclaude, Vibe Kanban, and oh-my-claudecode; the practical ceiling is 5-10 parallel agents before merging becomes the bottleneck
- Supervisor-worker hierarchy is the dominant coordination model: one orchestrator agent decomposes the task and assigns subtasks to worker agents; tools have built rich role vocabularies around this, from Gas Town's Mayor/Polecat/Refinery to Overstory's four-tier Orchestrator-to-Worker stack
- Shared task boards with pull-based work claiming, where agents self-assign the highest-priority unblocked task rather than waiting for the supervisor, appear across Claude Code Agent Teams, oh-my-claudecode's Swarm mode, and Gas Town's Beads system
- Mailbox-based inter-agent communication, typically file- or SQLite-backed inboxes, enables peer-to-peer coordination between sibling agents without routing every message through the supervisor
- Builder-Validator separation, where a read-only reviewer agent with an independent context window verifies output it did not produce, is the primary mechanism for catching agent blind spots; the effective ratio is one reviewer per three to four builders
- Git-backed state persistence, pioneered by Gas Town's Beads system and adopted by Goosetown, is becoming the standard approach to crash recovery
- CI-pass as merge gate, while insufficient for enterprise governance, provides a reliable shared contract that most tools now support
Unsolved problems: Several gaps remain unaddressed across the category:
- The governance gap is the most acute risk: agent-level audit trails, policy controls on file access, and rollback triggers are unaddressed across all tools. Until tooling catches up, implement compensating controls: mandatory human review of all agent PRs (Multiclaude's multiplayer mode), branch protection rules scoped to agent-submitted branches, and a designated engineering owner for agent workflow monitoring
- The skill floor is a real adoption risk: Steve Yegge, a well-known software engineer and creator of Beads and Gas Town, explicitly warns that teams not already running multiple parallel single-agent sessions daily will find orchestration counterproductive rather than additive; most engineering teams have not reached this baseline
- Compounding error rates across agents are not well-characterized; the primary documented mitigation is strong test coverage, which many growth-stage codebases lack; without it, orchestration accelerates the accumulation of technical debt rather than reducing it
- Cost amplification at scale is a budget planning requirement, not a theoretical concern: a documented Gas Town session cost ~$100 in API credits in a single hour, roughly 10x the cost of a normal Claude Code session; Anthropic's own data puts average single-developer spend at $100-200 per month, meaning continuous multi-agent orchestration across a team requires explicit budget modeling before deployment
- Debugging multi-agent workflows is significantly harder than single-agent debugging, with no dedicated tooling in the category today; this raises the cost of incidents and slows the engineering cycles that growth-stage companies depend on
- The merge wall is the hardest unsolved technical problem: when five or more agents generate branches against the same repository, conflict resolution strategies range from Overstory's four-tier automated pipeline to Multiclaude's philosophy of simply merging everything that passes CI
Recommendations
1. Verify single-agent fluency before piloting orchestration. Claude Code Agent Teams, Gas Town, and Multiclaude have each resolved their most critical production bugs and are in real use among developers who were already running multiple single-agent sessions daily. The right question is not "is this ready?" but "does our team already run single Claude Code sessions fluently?" If yes, a two-agent pilot is a reasonable next step. If not, single-agent fluency is the prerequisite.
2. Budget for cost amplification upfront. The documented floor for heavy multi-agent use is approximately $100/hour in API credits across 12-30 agents. Most teams will not run at that ceiling continuously; targeted use for large refactors, migration work, or parallel feature development is the practical pattern. The relevant planning question is: what is the hourly cost of a two-to-three agent session on a specific, bounded task? Tools with intelligent model routing (oh-my-claudecode) can reduce this materially by assigning routine subtasks to cheaper models.
3. Ensure CI coverage catches regressions before deploying orchestration. Every tool in the category that supports autonomous merging (Multiclaude, Claude Code Agent Teams) uses CI-pass as the primary merge gate. If test coverage is below the point where CI reliably catches regressions, address that gap first. Running multiple agents against an undertested codebase multiplies the risk of undetected regressions, not just the throughput.
4. Pilot oh-my-claudecode for Claude Code teams. Its installation path via the Claude Code plugin marketplace, intelligent model routing for cost control, auto-resume on rate limits, and active development velocity make it the lowest-friction entry point for teams already on Claude Code who want to go beyond single-agent workflows. Validate the performance claims against your own workload before relying on the self-reported benchmarks.
Trends and Strategic Signals
1. Bundled orchestration is graduating from experimental to default. The most consequential signal this week is the arc of Claude Code Agent Teams: shipped as an opt-in experimental feature on February 6, 2026, stabilized through critical patches in early March, and now described by practitioners as "the new default" rather than an advanced configuration. The March v2.1.69 fixes addressed runaway agent spawning and memory leaks that had been the primary blockers for production use. Teams that held off pending stability now have a materially lower-risk entry point.
2. The Gas Town ecosystem is expanding from a tool to a protocol. Steve Yegge's March 2026 "Wasteland" release extends Gas Town from a local multi-agent workspace manager into a distributed trust network connecting multiple Gas Town deployments. The Wasteland introduces a shared work queue, contributor reputation ledger, and passbook system modeled on Git's fork/pull model. This is a significant architectural step: the pattern is evolving toward federated orchestration at the organization level, not just within a single developer's machine. Whether this gains meaningful adoption beyond early enthusiasts is unverified, but the directional signal is clear.
3. oh-my-claudecode is the fastest-growing community project in the category. With v4.9.2 shipping in late March and a peak velocity of 858 GitHub stars in 24 hours, oh-my-claudecode has established itself as the highest-traction community-built orchestrator for Claude Code. The March feature set added voice mode, a 1M token context path, MCP Elicitation support, and auto-resume on rate limit hits. These are production-facing features, not demos. The project's claim of 3-5x speedup and 30-50% token savings through intelligent model routing remains self-reported and unverified by independent benchmarks, but the star velocity indicates meaningful developer attention.
4. Cost amplification remains the governing constraint. Across every tool that has reached real-world use, the cost signal is consistent: running 12 to 30 parallel agents burns roughly $100 per hour in API credits under heavy use, per Gas Town early adopters. This is not a solvable-through-optimization problem at current model pricing; it is a ceiling that governs which teams can run orchestration continuously versus in targeted bursts. Tools that invest in intelligent model routing (oh-my-claudecode's routing to Haiku/Sonnet vs. Opus by task type) are directly addressing this constraint and represent the current frontier of practical cost governance.
5. Governance and merge strategy tooling is still the weakest link. The tools that have stabilized their orchestration layer (Claude Code Agent Teams, Gas Town, Multiclaude) have each converged on CI-pass as the primary merge gate. This is a workable threshold for teams with strong test coverage, but it is not a governance posture. No tool in the category currently provides an audit trail of which agent produced which commit, automated rollback triggers, or policy-level controls on what agents are permitted to modify. This gap is the primary reason the category is not yet enterprise-ready at the level PE-owned SaaS companies will eventually require.
Tools
Claude Code Agent Teams
- Maker: Anthropic
- Archetype: Bundled orchestrator
- Works with: Claude Code (Claude models exclusively; Opus 4.6 and Sonnet variants)
- Architecture pattern: Supervisor-worker hierarchy; one lead agent decomposes tasks and coordinates multiple teammate agents, each in its own context window and git worktree. Teammates can communicate directly with each other. Referred to in practice as "swarm mode."
- State persistence: Context-window scoped per session; no native cross-crash persistence beyond what the lead agent's conversation retains. Beads-style persistence is not built in; teams combine with Git branch hygiene for recovery.
- Access: Included with Claude Code subscription; enabled via
CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMSin settings.json or environment. No separate pricing tier documented. - Strengths:
- Zero additional tooling required; teams already using Claude Code gain orchestration with a config flag change
- March 2026 stability patches resolved the two highest-impact production bugs (runaway spawning and memory leak), materially reducing operational risk
- Native integration with Anthropic's model stack means teammate coordination benefits from shared context formats and tool definitions
- Strong for research-and-build workflows: parallel teammates can each own a layer (frontend, backend, tests) with the lead synthesizing
- Limitations:
- Works exclusively within the Claude model family; teams using mixed-model workflows will need a standalone orchestrator for cross-model coordination
- The experimental flag and absence of SLA-level support documentation means teams should treat this as a production-capable beta, not a supported enterprise feature
- Cost amplification is proportional to teammate count; teams should plan budget before scaling beyond two to three teammates
- Enterprise readiness: Developing. Stability has improved materially post-March patches; governance controls and audit trail tooling are absent. Cost predictability improves with teammate count caps, but policy-level enforcement is manual.
- Best for: Engineering teams already running Claude Code daily who want to parallelize well-decomposed tasks without adopting a new tool. Strong fit for teams with good test coverage that can use CI-pass as a merge gate.
- This week: March v2.1.69 fixes for runaway spawning and memory leak are the key update. Practitioner commentary now describes Agent Teams as production-ready rather than experimental.
GRGrok Build (Arena Mode)
- Maker: xAI
- Archetype: Bundled orchestrator
- Works with: Grok Code 1 Fast and Grok 4 Fast (xAI models)
- Architecture pattern: Parallel agent execution with automated competitive ranking; up to 8 agents run simultaneously (up to 4 per model), outputs displayed side-by-side, Arena Mode ranks outputs algorithmically before human review. Parallel Agents (without Arena ranking) is the currently available subset.
- State persistence: Not publicly documented; session-scoped based on available information.
- Access: Waitlisted as of late March 2026; Parallel Agents feature available to waitlist users. Arena Mode not yet publicly released.
- Strengths:
- The competitive ranking layer in Arena Mode represents the most differentiated evaluation architecture in the category: agents compete and outputs are scored before the developer sees them, reducing review burden
- Running up to 8 simultaneous agents on a task provides throughput ceiling that exceeds every other tool in the category at the interface level
- Built directly into the Grok Build IDE surface, eliminating context-switching for teams already on xAI's platform
- Limitations:
- Access remains behind a waitlist; teams cannot adopt today. Arena Mode specifically is not yet deployed publicly.
- Exclusive to xAI's model stack; teams with existing investment in Claude or Gemini workflows face a platform switch to evaluate
- Eight-agent parallel runs carry the highest compute cost per session in the category; xAI has cited infrastructure scaling challenges as the primary barrier to broader availability
- Enterprise readiness: Early. Public availability is not confirmed; governance, audit trail, and cost controls are undocumented. Evaluation appropriate only for teams with xAI platform access.
- Best for: Engineering teams already in the xAI/Grok Build ecosystem who want to pilot competitive multi-agent evaluation. Not yet suitable for production adoption by teams outside the existing waitlist.
- This week: No new public launch. Arena Mode remains in code traces; waitlist access for Parallel Agents continues. Infrastructure scaling delays reported as recently as February remain unresolved.
Gas Town
- Maker: Steve Yegge (independent; open-source at gastownhall/gastown)
- Archetype: Standalone orchestrator
- Works with: Claude Code (primary); designed to coordinate 20 to 30 instances in parallel on the same codebase
- Architecture pattern: Git-backed workspace manager using a "molecules" model: work is expressed as chained sequences of small tasks ("Beads"), each with explicit acceptance criteria, stored in Git. Agents pick up molecules, execute steps, and the state persists through crashes. Nondeterministic idempotence: the path to completion is flexible but the acceptance criteria are fixed, so different agent runs converge on valid outcomes.
- State persistence: Beads (Git-backed issue tracker): agent identities, work assignments, and orchestration state all live in Git. If an agent crashes mid-step, the next session picks up where it left off via the persistent molecule definition.
- Access: Open-source; free to run. API costs (Claude Code credits) are the primary operating expense.
- Strengths:
- The most battle-tested standalone orchestrator for large-scale parallel Claude Code sessions; Yegge runs 20-30 agents daily on real codebases
- Beads state persistence is the most mature crash-recovery mechanism in the category; work survives agent failures without manual intervention
- The March 2026 Wasteland extension enables federated multi-organization work queues, pointing toward a future where Gas Town instances can share workloads across teams
- Credible production use signal: the tool's creator documents real workflows, real costs, and real failure modes publicly
- Limitations:
- The skill floor is explicitly high: Yegge warns that developers not already running multiple parallel CLI agents daily will find it counterproductive. This is a tool for practitioners, not onboarding vehicles.
- API cost at $100/hour under heavy use is a real budget constraint; teams should run cost-bounded pilots before committing to continuous orchestration
- The Deacon patrol agent bug in v0.3.x (terminating worker agents) illustrates that stability at this scale requires active bug tracking; teams should pin to v0.4.0+ once released
- Enterprise readiness: Developing. Crash recovery and state persistence are the category's most mature. Governance controls, access management, and audit trail are absent. Cost predictability requires operational discipline on molecule scope and agent count.
- Best for: Senior engineering teams already comfortable running multiple Claude Code sessions in parallel who want a structured, crash-resilient orchestration layer with Git-native state. Strong fit for long-running, large-scope refactoring or migration work.
- This week: "Wasteland" release (March 4, 2026) extends Gas Town to a federated multi-organization work network. Bug fix for Deacon patrol agent killing workers committed to v0.4.0. Both are material updates.
Multiclaude
- Maker: Dan Lorenc (independent; open-source at dlorenc/multiclaude)
- Archetype: Standalone orchestrator
- Works with: Claude Code
- Architecture pattern: Supervisor-subagent model; a supervisor agent assigns tasks to autonomous Claude Code subagents. Each subagent gets its own tmux window and git worktree. Singleplayer mode merges all CI-passing PRs automatically; multiplayer mode holds PRs for human or teammate review.
- State persistence: Git worktrees and PR branches; work persists through the branch even if a session crashes. No Beads-style issue tracking layer.
- Access: Open-source; free to run.
- Strengths:
- The singleplayer/multiplayer mode distinction provides a practical governance dial: teams can choose fully autonomous merging (high throughput, CI as sole gate) or human-in-the-loop review without changing the orchestration architecture
- Git PR-based coordination means all work is reviewable, auditable, and reversible through standard development tooling
- The Brownian ratchet philosophy (merge anything that passes CI, tolerate occasional duplication, keep moving) is a genuine throughput optimization for teams with strong test coverage
- Limitations:
- The automatic-merge singleplayer mode requires strong CI test coverage as a prerequisite; teams with coverage gaps should use multiplayer mode or a different tool until coverage improves
- Exclusive to Claude Code; teams seeking model-agnostic orchestration will need Overstory or a similar runtime-flexible tool
- No signals of active new development in early 2026; the tool may be in maintenance mode relative to the faster-moving projects in the category
- Enterprise readiness: Early. Git PR trail provides meaningful auditability. Governance controls beyond CI-pass are absent. Suitable for teams with mature CI pipelines and a tolerance for eventual-consistency merge behavior.
- Best for: Teams running Claude Code on codebases with strong automated test coverage who want a low-overhead autonomous coding loop and are comfortable with CI-pass as the primary quality gate.
- This week: No new signals this week.
Overstory
- Maker: jayminwest (independent; open-source at jayminwest/overstory)
- Archetype: Standalone orchestrator
- Works with: Claude Code, Pi, Gemini CLI, Aider, Goose, Amp, and 5 additional runtime adapters via pluggable AgentRuntime interface (11 runtimes total)
- Architecture pattern: CLI tool that spawns worker agents in isolated git worktrees via tmux sessions. SQLite-backed inter-agent mail system handles coordination. FIFO merge queue with 4-tier conflict resolution handles merging. Three-tier health monitoring: Tier 0 (mechanical tmux/PID liveness), Tier 1 (AI-assisted failure triage), Tier 2 (monitor agent for fleet patrol).
- State persistence: SQLite databases on local filesystem for agent state and inter-agent messages; git worktrees for code isolation. An
ov updatecommand refreshes managed files without requiring full re-initialization. - Access: Open-source; free to run. Now at v0.9.3.
- Strengths:
- The only tool in the category with documented support for 11 distinct agent runtimes; teams with multi-model workflows can orchestrate Claude Code, Gemini CLI, Aider, and Goose in the same session
- Three-tier health monitoring is the most sophisticated agent fleet management in the category; Tier 1 AI-assisted failure triage reduces manual intervention when agents fail
- The 4-tier conflict resolution system addresses the merge strategy gap that most orchestrators leave to CI alone
- Active release cadence (v0.9.3 with meaningful feature additions in 2026) signals healthy project momentum
- Limitations:
- Complexity scales with capability: 11 runtimes, 3-tier health monitoring, and a custom SQLite mail system represent a significant operational surface area compared to simpler tools. Teams should plan for setup and tuning time.
- The project's own documentation explicitly warns that agent swarms are not a universal solution and that compounding error rates, cost amplification, and merge conflicts are the normal case, not edge cases
- Local filesystem SQLite backend means state does not travel across machines; not suited to distributed team workflows without additional infrastructure
- Enterprise readiness: Developing. The 4-tier conflict resolution and 3-tier health monitoring provide more operational depth than most alternatives. Governance controls and audit trail tooling remain absent. Best suited to single-machine, single-team deployments today.
- Best for: Engineering teams who want runtime flexibility (the ability to mix Claude Code with Aider, Gemini CLI, or Goose in the same orchestrated workflow) and are willing to invest setup time for a more operationally sophisticated tool.
- This week: Active development signals:
ov updatecommand, coordinator communication CLI tools, SQLite lock contention crash fix, reviewer-coverage doctor check. Meaningful incremental improvements.
oh-my-claudecode
- Maker: Yeachan Heo (independent; open-source at Yeachan-Heo/oh-my-claudecode)
- Archetype: Standalone orchestrator (Claude Code plugin)
- Works with: Claude Code (exclusively; installs as a Claude Code plugin)
- Architecture pattern: Teams-first pipeline with 32 specialized agents organized by role (architect, developer, tester, reviewer, and more). 5 execution modes including fully autonomous end-to-end execution. Intelligent model routing assigns tasks to Claude Haiku, Sonnet, or Opus based on complexity, reducing cost per task. Auto-resume on rate limit events continues sessions without manual intervention. From v4.1.7, Team is the canonical orchestration surface.
- State persistence: Auto-resume functionality restores state after rate limit interruptions; project memory feature (v4.9.2) adds persistent context across sessions.
- Access: Open-source; installs via Claude Code plugin marketplace. Free to run (API costs apply).
- Strengths:
- Highest-traction community project in the category with 858 GitHub stars in 24 hours and ongoing active development at v4.9.2
- Intelligent model routing is the most developed cost-optimization approach in the category; routing routine tasks to Haiku and complex architecture work to Opus provides a structural path to managing API spend
- The self-reported 3-5x speedup and 30-50% token savings via routing have not been independently verified, but the underlying routing architecture is sound and the adoption signal is real
- March 2026 feature additions (voice mode, 1M token context, /effort command, MCP Elicitation) represent a production-focused roadmap, not a demo-only feature set
- Limitations:
- Performance claims (3-5x speedup, 30-50% token savings) are self-reported; independent benchmarks against baseline Claude Code are not yet available. Teams should validate against their own workloads before committing.
- Exclusive to Claude Code; teams seeking model-agnostic orchestration will need a different tool
- At v4.9.2 with 32 agents and 5 execution modes, the tool's configurability can be an onboarding barrier; the zero-config default path is the recommended starting point for new teams
- Enterprise readiness: Developing. Active development and a production-facing roadmap signal trajectory. Governance controls, audit trail, and enterprise access management are absent. The plugin install model makes enterprise IT governance straightforward relative to self-hosted alternatives.
- Best for: Engineering teams heavily invested in Claude Code who want a zero-config orchestration layer with built-in cost optimization. Particularly strong for teams running CI-heavy workflows where the intelligent routing can reduce token spend on routine validation tasks.
- This week: v4.9.2 shipped with security hardening, project memory, Korean i18n, and 50+ fixes. Strong momentum signal with the March feature additions (voice mode, 1M token context, /loop, /effort, MCP Elicitation). The most active development velocity in the category this week.
Goosetown
- Maker: Block (open-source at block/goosetown)
- Archetype: Standalone orchestrator
- Works with: Goose (Block's open-source AI agent; model-agnostic by design)
- Architecture pattern: Orchestrator-delegate model; a main Orchestrator agent phases work into research, build, and review, then spawns parallel Delegate agents for execution. Shared Town Wall (append-only log) serves as inter-agent communication. Delegates are ephemeral; they execute work in clean context windows and return summaries to the Orchestrator, avoiding context cliff degradation on long-running sessions. Skills (Markdown files) pre-load each Delegate with role-specific instructions.
- State persistence: Beads (Git-backed issue tracker, shared with Gas Town's pattern): Orchestrator creates issues, Delegates update them, and the next agent picks up a failed Bead and continues. Work survives crashes at the issue level.
- Access: Open-source; free to run. Runs on top of Goose, which is model-agnostic.
- Strengths:
- The model-agnostic foundation via Goose means teams can orchestrate any supported LLM backend without being locked to Anthropic or xAI infrastructure
- Ephemeral Delegate pattern directly addresses context cliff degradation: long-running orchestration stays coherent because each Delegate operates in a fresh context window with a targeted skill
- Beads state persistence provides crash recovery at the task level, consistent with Gas Town's proven approach
- Block's engineering team backing provides organizational continuity relative to solo-maintainer projects
- Limitations:
- The Goose prerequisite adds a dependency layer; teams not already using Goose must evaluate both tools simultaneously. Goose adoption is primarily among open-source and developer-community audiences today.
- Described as "research-first parallel work" in Block's own documentation, positioning it as a deliberate choice for exploratory tasks rather than production code delivery pipelines
- No new release signals in the March-April window; development velocity appears lower than oh-my-claudecode and Overstory this cycle
- Enterprise readiness: Early. Model-agnostic architecture and Beads persistence are strong foundations. Block's organizational backing provides some stability signal. Governance tooling is absent. Best suited to teams already invested in the Goose ecosystem.
- Best for: Engineering teams using Goose as their primary coding agent who want a structured multi-agent layer without platform lock-in to Anthropic or xAI model stacks.
- This week: No new signals this week. Last substantive coverage was the February 19, 2026 architecture post from Block.
Adoption and Traction
-
oh-my-claudecode: v4.9.2 shipped late March with security hardening, project memory, Korean i18n, and 50+ fixes. GitHub peak velocity of 858 stars in 24 hours; now at v4.9.2 with 32 specialized agents. (Source)
-
Claude Code Agent Teams: March v2.1.69 patch resolved runaway teammate spawning (teammates accidentally spawning additional teammates) and a memory leak caused by pinning full conversation history per teammate lifetime. These were the two most-reported production blockers post-February launch. (Source)
-
Gas Town: Steve Yegge published the "Wasteland" sequel post on March 4, 2026, extending the Gas Town model to a federated multi-organization work network at gastownhall.ai. A separate X post flagged a bug in v0.3.x where the "Deacon" patrol agent was terminating worker agents, with a fix committed to v0.4.0. (Source)
-
Overstory: Active release cadence continuing in 2026, with recent additions including an
ov updatecommand for refreshing managed files, coordinator communication CLI tools (send, ask, output), dashboard crash fixes on SQLite lock contention, and a reviewer-coverage doctor check. Now at v0.9.3. (Source) -
Grok Build (Arena Mode): Still on waitlist as of late March 2026; Arena Mode remains in code traces only and is not publicly available. No new public launch signals this week. (Source)
-
Goosetown: No new release signals in the March-April window beyond the February 2026 architecture blog post from Block. (Source)
-
Multiclaude: No new signals this week. Last meaningful coverage from January 2026 introduction post by Dan Lorenc. (Source)
New Entrants & Watch List
Vibe Kanban is a cross-platform Kanban board interface for multi-agent coding orchestration that surfaced in the past month and warrants a watch. Maker: independent (vibekanban.com). Archetype: standalone orchestrator interface layer. It supports Claude Code, Codex, Gemini CLI, Amp, and Cursor Agent CLI with git worktrees and branch isolation per task card. The "doomscrolling gap" framing (filling the 2-5 minute idle window while an agent works) and in-board diff review are differentiated UX choices. Free and BYOK. It is not yet on the core coverage list because it functions more as a management interface than a coordination engine; it does not implement supervisor-worker logic or inter-agent communication. Watching for capability additions that cross the orchestration threshold.
OpenClaw + Antfarm is a batteries-included agent team framework with planner, developer, verifier, tester, and reviewer roles running in a deterministic YAML-and-SQLite-and-crons loop. One-command install. Maker: Enderfga (open-source at Enderfga/openclaw-claude-code). Archetype: standalone orchestrator. The "Ralph loop" pattern (plan, build, verify, ship while you sleep) is a well-defined workflow, and the deterministic implementation is a deliberate contrast to the nondeterministic approaches in Gas Town and Multiclaude. Watching for production use reports; the project is new enough that reliability at scale is unverified.
Claude Squad is a terminal app that manages multiple Claude Code, Codex, and Aider instances in separate workspaces via tmux. Zero-setup, free. The tool is in the category scope and warrants a watch; it will be considered for the core coverage list once production use signals are available.