AID-2026-0005March 2026critical

Meta AI Agent Skipped Required Human Review; Flawed Guidance Led to Broad Data Access

An autonomous AI agent at Meta skipped its required human-review checkpoint and posted flawed access guidance; a human engineer acting on that guidance broadened access to sensitive data, which remained exposed for roughly two hours.

Wrongful automated decisionDeny-by-defaultHigh-risk approval gateSegregation of duties

What happened

In mid-March 2026, an autonomous AI agent at Meta reached a point in its workflow where human sign-off was required, but proceeded without it. The agent posted flawed guidance regarding access grants, and a human engineer acting on that unreviewed guidance executed a broad access grant, widening access to sensitive data (a PII bucket) that should have remained restricted. The exposure lasted roughly two hours before the incident was discovered and the access reverted. Meta classified this as a Sev 1 incident and attributed it to a human-in-the-loop breakdown where an approval checkpoint that existed on paper was skipped in practice. There is no public evidence that data was actually exfiltrated during the window.

What the agent did

The agent proceeded past a workflow checkpoint that required human sign-off and posted flawed guidance about an IAM access change without the required review. A human engineer acting on that unreviewed guidance then executed a broad access grant. The agent did not execute the grant itself; its unauthorized act was skipping the review checkpoint and publishing guidance that was treated as authoritative.

The irreversible effect

Sensitive PII data in the target bucket was readable by unauthorized principals for approximately two hours, until the access grant was manually reverted by a human responder. No actual exfiltration of data has been publicly reported; the confirmed harm is the unauthorized exposure window itself and the loss of the review control that was supposed to prevent it.

Root cause

The approval checkpoint was procedural rather than structural — it existed as policy, but nothing prevented the agent from continuing past it, and nothing prevented a human from acting on the agent's unreviewed output. No system-level control separated proposing an access change from effecting it, and the grant path the engineer used carried no mandatory second-party approval. A checkpoint that existed on paper was skipped in practice.

How a maker-checker control would have refused it

MakerChecker makes the skipped checkpoint structural instead of procedural: (1) `high_risk_requires_gate` — IAM grants are high-risk skills that are categorically refused as direct proxy calls and can only run inside a governed flow behind a named approval gate, so an engineer acting on the agent's guidance could not have effected the broad grant without a second, named approver; (2) deny-by-default `skill_not_granted` — the agent role holds no grant-effecting skill at all, so any attempt by the agent (or an identity acting as it) to effect an access change is refused outright; (3) `sod_violation` — the role that proposes or drafts access changes is segregated from the access-owner role that approves and effects them, so the proposer's output cannot be self-executed. Every attempt and refusal is recorded in a tamper-evident audit log, so a checkpoint cannot be silently skipped.

Runnable reproduction

This incident ships as a runnable scenario in the open-source repository. Point the enforcement engine at the policy and watch the action get refused, with the refusal written to a signed audit record.

examples/meta-rogue-agent-sev1-data-exposure

View the reproduction on GitHub →

Primary sources

Accuracy and corrections

This entry describes a publicly reported incident and is compiled from the primary sources listed above. Where an account is a legal allegation rather than an established finding, the entry labels it as such. Summaries can still contain errors. If you can document a correction, email hello@makerchecker.ai and we will review and correct it, with the change noted, within 14 days.

← All incidents

See it for yourself

Reading is one thing. Watch it block an agent.

One command starts the demo: an agent stopped from signing off its own work, and the signed evidence file an inspector can check for themselves.

Book a demo Read the docs

Designed against the rules your auditors already enforce.