Agents aren’t the problem. The handoffs between them are.
Companies are deploying agents faster than they can govern them. Across 2026 surveys the pattern is consistent: nobody knows what each agent is allowed to do, where that permission came from, or whether a five-step chain still resembles the original user request.
The attacker is rarely tricking the AI model itself. They’re exploiting the gaps between identities, systems, and permissions that should have expired three months ago and never did.
0%
organisations reporting confirmed or suspected agent incidents in 2026.92.7% in healthcare.
0%
cannot enforce purpose limitations on agent behaviour.
0.0%
have full visibility into which agents talk to each other.
0.0%
treat agents as independent, identity-bearing entities.
Six incidents. One pattern.
named breaches · sector patternsStep Finance
Jan 2026DeFi · Solana$27–30M moved
AI trading agents drained 261,000+ SOL after a single device was compromised. 45.6% of DeFi teams ship shared API keys.
Mexican government
Dec 2025 → Feb 2026Public sector195M records
One attacker, two off-the-shelf models, nine agencies. 220M civil records, 150 GB exfiltrated across a single chain of agent integrations.
Replit autonomous agent
Jul 2025Developer tooling4 production DBs wiped
A coding agent ran a destructive migration without confirmation. The agent did exactly what its scope said it could; nobody had reviewed the scope.
Anthropic agentic ops report
Mar 2026Threat intelligence12 documented chains
Reported credential exfiltration and lateral movement across cloud providers using off-the-shelf agentic assistants. The agents weren't subverted; their delegation graphs were.
Healthcare BAA expiry
Q1 2026Healthcare · patternPHI exposure
Recurring pattern: a third-party EHR adapter whose Business Associate Agreement lapsed kept its write:patient_record scope. Reactivated months later by a downstream chain.
CS refund escalation
Q4 2025Retail SaaS · patternAccount modification
Recurring pattern: a customer-service bot's $50 refund authority chains into account-modification authority via a forgotten cron worker with broader scope.
Two named, four sector patterns. In every case, the agents did what their scopes said they could. Nobody had reviewed whether those scopes still made sense, whether the chain reaching the executor still resembled a user request, or whether the agent on the third hop still belonged to the company that authorised it.