Claude Code agent teams explained

Lessons from building with the new agent teams feature in Claude Code

Feb 12, 2026

Hey 👋,

Claude Code just shipped a really interesting feature called Agent Teams. Instead of one agent doing everything, you can now run multiple Claude Code instances that work together as a team. Each agent has its own context window, and they can talk to each other directly (which is crazy).

Using AI agents to write code is standard now. Using one agent in a terminal or IDE feels natural - describe a task, it builds, you review. Straightforward loop.

Multiple agents talking to each other feels completely different. And, despite this being an early feature, it feels like looking into the future.

I made a video showing how to set this up with tmux so you can watch all the agents working together.

Subagents vs Agent Teams

You’re probably familiar with subagents in Claude Code. With subagents, each one has its own context window and results return to the parent. Communication is one direction. The parent manages everything.

With agent teams, each agent also has its own context window but they’re fully independent Claude Code sessions. They can message each other directly. There’s a shared task list for coordination. It’s best for long running work that needs discussion and iteration between agents.

The quick test is: if your agents would benefit from talking to each other, use a team. If they just need to return results, subagents are simpler and cheaper.

When To Use This

The pattern I keep coming back to is agents reviewing other agents’ work.

One agent writes code. A second agent reads the output and sends specific feedback. The first agent fixes the issue. The reviewer checks again.

With subagents, this feedback loop runs through you. The reviewer reports back, you read it, you paste it into a new prompt for the builder. You’re the middleman.

With agent teams, the reviewer sends notes directly to the builder. The builder fixes it. The reviewer checks again. Multiple rounds without you relaying anything.

That’s a small thing on paper. In practice, it changes what kind of work you can hand to agents. Single-pass generation - write this function, generate these tests - works fine with one agent. Multi-pass long running work - build something, review it, revise, review again — requires agents that can talk to each other.

C Compiler

To see where this goes, look at what Anthropic’s engineering team did. Nicholas Carlini ran sixteen Claude instances to build a C compiler from scratch in Rust. Two weeks, about two thousand sessions, just under twenty thousand dollars in tokens. The result: a hundred thousand lines of Rust that compiles the Linux kernel. Ninety-nine percent GCC torture test pass rate.

The interesting part isn’t the scale. It’s the change in the type of work agents can do.

Carlini’s key observation: “Claude will work autonomously to solve whatever problem I give it. So it’s important that the task verifier is nearly perfect.” The agents weren’t the bottleneck. The quality of the feedback loop was.

That’s the same pattern at a different scale. Builder agents produce code. Reviewer agents check it and send feedback. The builders act on it. The loop runs without a human relaying messages.

My Experience

I used agent teams to build an app with three Claude instances: a backend agent, a frontend agent, and a code reviewer. The reviewer watches both, checks that the API contract lines up, and sends issues to the agent that owns the code.

During the build, the reviewer caught bugs in the initial implementation and delegated back to the front end and back end agents to fix the issues.

That’s a closed-loop correction that happened without me. With a single agent, I’d have caught it during code review. With teams, the feedback loop ran on its own.

Tradeoffs

Three agents in parallel costs roughly three times as much. I burned through my rate limits making a video about this (and rarely have issues). That’s the honest trade-off.

The C compiler project used about twenty thousand dollars in tokens over two weeks. That’s sixteen agents running in loops. For most of us, the question isn’t “can I afford sixteen agents” — it’s “is the closed-loop feedback worth 3x the cost for this particular task?”

Where Is This Heading?

Right now, agent teams are experimental.

But the pattern - agents reviewing agents in a loop, self-correcting, picking up the next task when they’re done - that’s clearly the direction we’re going in. Systems of specialised agents working together on more complex tasks. The C compiler project showed it actually works at scale. Agent teams in Claude Code bring a version of it to your terminal.

I walked through the full build in my latest video here.

The AI Engineer

Ready for more?