Anthropic Leaked Claude Code: AI Agent Architecture

512,000 Lines of Code. One Forgotten Build Flag.

On March 31, 2026, security researcher Chaofan Shou discovered that Anthropic's Claude Code had its entire source code sitting in plain sight on the npm registry. According to CyberSecurity News, a 59.8MB source map file shipped inside @anthropic-ai/claude-code version 2.1.88, exposing 1,906 files and over 512,000 lines of TypeScript.

The cause? Bun's bundler generates source maps by default. Someone forgot to turn it off.

Anthropic pulled the package and scrubbed old versions from npm, but the code was already mirrored across multiple GitHub repositories within hours. According to VentureBeat, this is the most detailed look anyone outside Anthropic has ever had at how a production AI coding agent actually works.

For companies deploying AI agent infrastructure, the leak is more interesting than alarming. What's inside validates architectural patterns that serious practitioners are already using.

What the Leak Actually Revealed

44 Feature Flags Behind a Kill Switch

According to The AI Corner, buried in the source code were 44 feature flags covering capabilities that are fully built but not yet shipped. They compile to false in the public build. Key unreleased features include:

Feature Flag	What It Does
KAIROS / PROACTIVE	A persistent, always-running assistant that watches your environment and acts proactively without waiting for input
COORDINATOR_MODE	Multi-agent orchestration: one Claude instance spawns and manages multiple worker agents in parallel
ULTRAPLAN	30-minute remote planning sessions that run in the cloud
VOICE_MODE	Voice interaction capabilities
BUDDY	A Tamagotchi-style AI companion with 18 species, rarity tiers, and five stats (Debugging, Patience, Chaos, Wisdom, Snark)

The KAIROS and COORDINATOR_MODE flags are the ones worth paying attention to. They confirm Anthropic is building toward persistent, multi-agent systems that run autonomously. That's not a research direction. That's production code behind a flag.

A 40-Tool Orchestration Architecture

Claude Code isn't a single prompt that writes code. It's a plugin-like tool architecture with approximately 40 discrete tools, each permission-gated. According to DEV Community, the base tool definition alone spans 29,000 lines of TypeScript.

The Query Engine, at 46,000 lines, is the largest module in the codebase. It handles all LLM API calls, streaming, caching, and orchestration.

This architecture tells you something important: production AI agents are not prompt chains. They're tool orchestration systems. Each capability is isolated, permissioned, and composable. That's the same pattern used in any serious agent deployment.

Internal Codenames and Unreleased Models

The leak exposed Anthropic's internal naming conventions. "Tengu" appears hundreds of times as Claude Code's project codename. Model codenames like Capybara (Claude 4.6 variant), Fennec (Opus 4.6), and an unreleased model called Numbat were discovered.

There's even an "Undercover Mode" that prevents Claude from revealing internal information in commits and PRs. When active, it injects into the system prompt: "You are operating UNDERCOVER in a PUBLIC/OPEN-SOURCE repository."

Why This Matters for AI Agent Deployment

1. Tool-Based Architecture Is the Right Pattern

The biggest validation from this leak: Anthropic's own production agent uses discrete, permission-gated tools as the fundamental unit of work. Not prompt templates. Not chain-of-thought pipelines. Tools.

When we deploy agent swarms across sales, marketing, and operations for B2B companies, each agent's capabilities are built as composable tools. Signal detection is a tool. CRM enrichment is a tool. Content generation is a tool. The Claude Code leak confirms this is how the builders at Anthropic think about agent architecture too.

2. Multi-Agent Coordination Is Coming Fast

The COORDINATOR_MODE flag shows Anthropic has a working multi-agent system where one Claude instance directs multiple workers in parallel. That's not an experiment. It's compiled code waiting to ship.

For businesses currently deploying single-agent workflows, this is a signal to build your infrastructure with multi-agent coordination in mind. The orchestration layer, tool registry, and permission system you build today should be ready for a world where agents spawn other agents.

3. The Security Surface Is the Deployment Pipeline

This entire incident happened because of one overlooked build configuration. A single flag in Bun's bundler exposed half a million lines of proprietary code.

According to Analytics India Magazine, the exposed code includes internal API client logic, OAuth 2.0 authentication flows, and the full permission enforcement system. That gives adversarial actors a roadmap for crafting inputs designed to bypass trust boundaries.

The lesson is clear: if you're building production AI systems, your CI/CD pipeline, build configuration, and deployment process are your security perimeter. Not your model. Not your prompts. Your build system.

What This Doesn't Change

A source code leak doesn't make Claude Code less capable. The tool still works exactly the same way it did yesterday. Anthropic's architecture is solid, which is why practitioners will study this code rather than dismiss it.

What it does change is the security calculus. According to Gartner, 40% of enterprise applications will incorporate agentic AI by end of 2026. Every one of those deployments has a build pipeline, a dependency chain, and a configuration surface that could be the next npm source map incident.

Companies deploying AI agent infrastructure need to treat deployment security with the same rigor they apply to application security. That means auditing build outputs, gating releases behind security checks, and assuming that anything shipped to a public registry will eventually be read.

The Bottom Line

The Claude Code leak gives us the clearest picture yet of how a production AI agent is actually built. The architecture validates what serious practitioners already know: tools over prompts, permissions over trust, orchestration over monoliths.

Anthropic built something impressive. They just accidentally showed everyone how it works.

The companies that benefit most from this moment aren't the ones reading the source code for fun. They're the ones already building agent infrastructure using the same patterns and now have confirmation they're on the right track.

Sources:

Anthropic Accidentally Leaked Claude Code's Entire Source Code. Here's What It Reveals About AI Agent Architecture.