McKinsey Has 25,000 AI Agents on Payroll. Your Team of Four Can Run the Same Play.

McKinsey's CEO got on stage at CES and dropped a number: 25,000 AI agents. Forty thousand humans, twenty-five thousand agents, 1.5 million hours saved on search and synthesis in a single year. They went from 3,000 agents to 25,000 in eighteen months and they're targeting 1:1 human-agent parity by the end of this year.

That's a big number. It's supposed to be impressive. And it is -- if you're running a 40,000-person consulting firm with hundreds of discrete workflows across every continent.

But here's what nobody on LinkedIn is doing the math on: 25,000 agents for 40,000 people is a ratio of 0.6 agents per human. McKinsey needs that many because McKinsey is that big. A four-person studio? You need four.

The Protocol Layer Is Solved

The Model Context Protocol -- MCP -- just crossed 97 million monthly SDK downloads. For context, that's 4,750% growth in sixteen months. Kubernetes took nearly four years to hit comparable deployment density.

Anthropic donated MCP to the Linux Foundation in March. It's now governed by the Agentic AI Foundation, co-founded with Block and OpenAI, backed by Google, Microsoft, AWS, and Cloudflare. Every major AI provider ships MCP-compatible tooling as default.

There are 5,800+ MCP servers covering CRMs, databases, cloud providers, productivity tools, e-commerce platforms, analytics dashboards. The integration development time for multi-tool agent deployments dropped 60-70%.

What this means in plain English: the protocol that lets an AI agent talk to your invoicing software, your CRM, your email, your project management tool -- that's free now. It's universal. It's not a competitive advantage anymore. It's plumbing.

So Why Are 95% of Companies Still Getting Nothing From This?

MIT Media Lab published a study showing 95% of organizations see no measurable return on AI investment. Billions spent. Nothing to show.

The reason is simple and it's the same reason it's always been: they bolted AI onto workflows they didn't change. Four out of five companies are automating existing processes exactly as-is. Only one in five is actually redesigning how work gets done.

PwC's own chief AI officer put a number on it: technology delivers about 20% of an initiative's value. The other 80% comes from redesigning the work itself.

McKinsey's rivals are saying the quiet part out loud. EY's global engineering chief said "some of the best value we have is returned by just a handful of agents doing the heavy lifting." PwC called agent count "probably the wrong measure."

The agent count is a vanity metric. The wiring is the real work.

What "Wiring" Actually Means

Here's a concrete example. Zoho shipped MCP integration that lets a single prompt like "find all invoices overdue by 30+ days and create follow-up tasks for each one" execute in under 10 seconds. Setup takes 20 minutes. First useful automation runs inside an hour. Available on free tiers.

That's one agent, connected to two tools, doing something a human used to spend 45 minutes on every morning.

Now multiply that by four. One agent watches your invoicing and flags overdue accounts. One handles your CRM hygiene after every meeting. One generates your weekly client reports. One manages your content calendar. Each one is connected to maybe two or three tools you already pay for.

That's not a 25,000-agent deployment. That's a Tuesday afternoon.

The gap isn't the AI. The gap isn't the protocol. The gap is someone sitting down and mapping YOUR specific invoicing workflow, YOUR specific CRM, YOUR specific content process to agents that actually do the work. Not in theory. In production. Running every day without you touching it.

Small Teams Have a Structural Advantage Here

This might sound counterintuitive, but the data supports it: small teams are better positioned for AI agent deployment than enterprises.

Every MCP server you connect adds tool definitions to the AI's context window. Over-connecting is a real failure mode. An agent that can see 200 tools performs worse than an agent that can see 6. The context window is a finite resource and stuffing it full of irrelevant capabilities degrades performance.

McKinsey has hundreds of workflows. They need thousands of agents because each one has to be scoped narrowly enough to actually work. A four-person operation has maybe a dozen core workflows. You can map those in a week.

There's also the redesign problem. Changing how work gets done at a 40,000-person company requires change management consultants, steering committees, pilot programs, and eighteen months of meetings. Changing how work gets done at a four-person company requires a conversation over lunch.

The 95% failure rate isn't about bad technology. It's about organizations that can't move fast enough to redesign around it.

The Ratio Scales Down

McKinsey's 0.6 agents per human is a starting point. But the ratio inverts for small teams because each agent handles a larger percentage of your total workload.

One well-wired agent that handles your post-meeting CRM updates, client report generation, and invoice follow-ups might save you 8-10 hours a week. At McKinsey, that same agent saves one person among thousands a few hours. For you, it's the difference between a 50-hour week and a 40-hour week.

We run our entire daily content pipeline on agents -- topic selection, research, writing, quality scoring, video generation, publishing, distribution across six platforms. No human in the loop. The content you're reading right now came through that system.

We built 40+ internal tools. We use MCP servers we wrote ourselves to connect our agents to our specific workflows. Between two people, we cover what would typically require a 10-14 person team. That's not a hypothetical. That's our operating model.

The infrastructure to do this is available to anyone now. The protocol is free. The servers exist. The AI models are accessible.

What's not free is the expertise to wire it together correctly. To know which workflows to redesign and which to leave alone. To scope agents tightly enough that they're useful instead of noisy. To build the connections between YOUR tools, not the demo tools.

That's the actual gap. And it's exactly what we do for clients.

The Window

Gartner says 40% of enterprise apps will have embedded AI agents by end of 2026. IDC says 80% of workplace apps will have copilots by then. The trajectory is clear.

The companies figuring out the wiring now -- connecting agents to their actual tools, redesigning their actual workflows -- will have a compounding advantage over the ones still running pilots in Q4.

McKinsey spent 18 months and presumably significant resources getting to 25,000 agents. You don't need 18 months. You don't need 25,000. You need four agents wired into the tools you already use, with workflows redesigned to actually take advantage of them.

We handle this for clients. First conversation is free -- kief.studio/contact.

All Insights More in automation →