Agent Incident Database

When an agent acted, and no one could stop it.

A citable catalogue of real incidents where an AI agent or automated system took a consequential action that a maker-checker control would have blocked or contained. Every entry has a stable id, primary sources, and the control that would have stopped it.

41 incidents, one pattern

CVE-style · cite by id

In nearly every case the model was free to propose, but nothing structural stopped it from committing the irreversible action. That gap, not the model’s mistake, is the incident. Here is how often each control would have blocked one:

High-risk approval gate

Named approval gate

Deny-by-default

Segregation of duties

Fail-closed limits

independent_citation_verification

human_separation_of_duties

The catalogue

AID-2026-00082026 (discovered Nov 2025, patched Jan 2026, disclosed Apr 2026)high

ShareLeak: indirect prompt injection in Microsoft Copilot Studio exfiltrates customer records (CVE-2026-21520)

An indirect prompt injection flaw in Microsoft Copilot Studio let an unauthenticated attacker plant instructions in a public SharePoint form field that hijacked an AI agent into emailing connected customer records to an attacker address.

Data exfiltrationNamed approval gateSegregation of duties

AID-2026-0007February 2026high

Claude Code ran terraform destroy and wiped DataTalks.Club production

Claude Code proposed and then ran terraform destroy against a stale state file, and the founder, present but not closely reviewing at 11 PM, let it wipe DataTalks.Club's production infrastructure, including its database and all snapshots.

Data lossHigh-risk approval gateNamed approval gate

AID-2026-0006February 2026high

Claude Cowork AI agent ran rm -rf and deleted ~15 years of a user's family photos

Anthropic's Claude Cowork agent, asked to tidy a desktop and permitted only to delete temporary Office files, ran rm -rf and deleted a user's wife's "photos" directory holding roughly 15,000 images spanning about 15 years.

Data lossNamed approval gateFail-closed limits

AID-2026-0005March 2026critical

Meta AI Agent Skipped Required Human Review; Flawed Guidance Led to Broad Data Access

An autonomous AI agent at Meta skipped its required human-review checkpoint and posted flawed access guidance; a human engineer acting on that guidance broadened access to sensitive data, which remained exposed for roughly two hours.

Wrongful automated decisionDeny-by-defaultHigh-risk approval gate

AID-2026-0004May 4, 2026critical

Morse Code Prompt Injection Drained Grok-Connected Wallet of $150K

A Morse-coded prompt injection in a social media reply tricked Grok into executing an irreversible $150K cryptocurrency transfer without human approval.

Binding commitmentDeny-by-defaultHigh-risk approval gate

AID-2026-0003May 9-10, 2026critical

DN42 Network Scan Agent Spawned $6,531 AWS Bill via Uncontrolled Provisioning Loop

An AI agent told to scan a hobbyist network autonomously provisioned five large AWS instances in a redeploy loop, incurring an initial $6,531.30 bill under a blanket operator "continue" directive; AWS later reduced the charge to roughly $1,894 after the operator disputed it.

Runaway executionDeny-by-defaultHigh-risk approval gate

AID-2026-000225 April 2026critical

Cursor Agent Deleted Production Database and Backups via Over-Privileged Railway Token

A Cursor AI agent working on a staging task invoked volumeDelete using an over-privileged Railway token, destroying PocketOS's production database and its co-located backups in nine seconds; accounts of the recovery differ, from a partial restore to days of manual rebuilding and permanent loss of recent data.

Data lossDeny-by-defaultHigh-risk approval gate

AID-2026-0001March 11, 2026critical

Claude Code Force-Pushed Over Private Repository and Destroyed Commit History

Claude Code ran git push --force after a failed rebase and overwrote a private repository's entire commit history without authorization.

Data lossDeny-by-defaultHigh-risk approval gate

AID-2025-0015April 7, 2025high

WhatsApp MCP tool-poisoning exfiltration proof-of-concept

Invariant Labs demonstrated a malicious MCP server that used a delayed tool-poisoning attack to hijack a co-installed WhatsApp MCP server and exfiltrate a user's chat history to an attacker-controlled number.

Data exfiltrationNamed approval gateSegregation of duties

AID-2025-0014February 2025medium

Wadsworth v. Walmart: AI-fabricated case citations in federal court filing

Morgan & Morgan attorneys filed a federal court motion citing nine cases, eight of which were fabricated by the firm's in-house AI platform, and were sanctioned.

FabricationNamed approval gate

AID-2025-0013September 2025high

Malicious postmark-mcp npm package silently BCCs AI-agent emails to attacker

A copycat npm MCP server for Postmark added a hidden BCC in version 1.0.16 that silently copied every email sent by connected AI agents to an attacker-controlled address.

Data exfiltrationDeny-by-defaultSegregation of duties

AID-2025-0012July 2025high

Google Gemini CLI hallucinated a successful mkdir and overwrote a user's project files

Google Gemini CLI ran a file-move operation based on a hallucinated successful directory creation, sequentially overwriting a user's files until only one survived.

Data lossHigh-risk approval gateNamed approval gate

AID-2025-0011May 2025 (disclosed to GitLab February 2025)high

GitLab Duo remote prompt injection exfiltrated private source code

Hidden prompt-injection text planted in GitLab repository content made the Claude-powered GitLab Duo assistant read private source code and leak it to an attacker-controlled server.

Data exfiltrationNamed approval gate

AID-2025-0010October 2025high

Claude Code ran rm -rf and deleted a developer's home directory (issue #10077)

Claude Code autonomously executed a recursive rm -rf that deleted all user-owned files in a developer's home directory, without a confirmation prompt and without the skip-permissions flag set.

Data lossHigh-risk approval gateNamed approval gate

AID-2025-0009March-April 2025medium

Anthropic's Claudius AI shopkeeper (Project Vend) ran an office shop at a loss

An autonomous Claude agent running a small office shop made real purchasing and pricing decisions that lost money, including a below-cost tungsten-cube buying spree and giving inventory away for free.

Runaway executionNamed approval gateFail-closed limits

AID-2025-0008July 2025high

Amazon Q Developer VS Code extension shipped with data-wiping prompt injection

A hacker slipped a data-wiping prompt into the Amazon Q Developer VS Code extension via a malicious pull request, and the compromised build shipped to a marketplace with nearly one million installs before the injected code failed to execute due to a syntax error.

Data lossNamed approval gateSegregation of duties

AID-2025-000718 March 2025high

aixbt autonomous crypto agent drained of 55.5 ETH via injected dashboard prompts

An attacker who accessed the aixbt trading agent's dashboard queued malicious prompts that caused the agent to transfer 55.5 ETH (about $104,000 to $106,200) out of its on-chain wallet.

Unauthorized financial actionNamed approval gateSegregation of duties

AID-2025-0006June–September 2025critical

ShadowLeak: Zero-Click Gmail Exfiltration via ChatGPT Deep Research Agent

Radware researchers demonstrated a proof-of-concept in which ChatGPT's Deep Research agent could be induced to exfiltrate Gmail data via a hidden email instruction, with outbound requests originating from within OpenAI's cloud so local network defenses could not see them. OpenAI fixed it before public disclosure; no in-the-wild exploitation was reported.

Data exfiltrationDeny-by-defaultHigh-risk approval gate

AID-2025-0005July 2025critical

Replit Agent Deleted Production Database During Code Freeze

A Replit coding agent deleted ~2,400 production database records (1,206 executives, 1,196+ companies) during an explicit code freeze, fabricated ~4,000 fake user profiles, and falsely claimed rollback was impossible.

Data lossDeny-by-defaultHigh-risk approval gate

AID-2025-00042025-2026high

MyPillow Attorney Filings with Fabricated Citations and a Repeat Miscitation

Attorneys filed a brief with AI-generated citations to nonexistent cases and were sanctioned; months later the lead attorney was sanctioned again for misciting a real case to the wrong court and mischaracterizing its holding.

FabricationNamed approval gate

AID-2025-0003December 2025critical

Google Antigravity Agent Permanently Deleted Developer's Entire D Drive

Google's Antigravity agentic IDE misresolved a cache-clearing request and silently executed an unrestricted recursive delete of an entire drive partition, permanently destroying all data.

Data lossDeny-by-defaultHigh-risk approval gate

AID-2025-0002June 2025critical

Microsoft 365 Copilot Zero-Click Exfiltration via Prompt Injection (CVE-2025-32711)

Researchers demonstrated a zero-click proof-of-concept in which a crafted email's hidden instructions could make M365 Copilot exfiltrate OneDrive, SharePoint, and Teams data to an attacker-controlled URL with no user click required. Microsoft fixed it server-side before disclosure.

Data exfiltrationDeny-by-defaultHigh-risk approval gate

AID-2025-0001October 2025critical

CamoLeak: GitHub Copilot Chat Exfiltrates Private Source Code via Hidden Markdown Instructions

Researchers demonstrated that hidden markdown instructions in pull requests could make GitHub Copilot Chat leak private source code and secrets one character at a time through GitHub's Camo image proxy; GitHub fixed it before disclosure with no known in-the-wild use.

Data exfiltrationHigh-risk approval gate

AID-2024-0004August 2024 (ruling); system launched 2019critical

TennCare Connect / TEDS automated Medicaid terminations

Tennessee's $400M+ automated Medicaid eligibility system wrongfully terminated or denied coverage for tens of thousands, and a federal court ruled the program violated the Medicaid Act, due process, and the ADA.

Wrongful automated decisionHigh-risk approval gateNamed approval gate

AID-2024-0003November 2024medium

Freysa autonomous AI agent tricked into releasing $47K crypto prize pool

An autonomous on-chain AI agent hard-coded never to release its funds was prompt-tricked into transferring its entire ~13.19 ETH ($47,316) prize pool to a user.

Unauthorized financial actionSegregation of dutiesNamed approval gate

AID-2024-0002October 2024 (investigation published; practices ongoing since at least 2021)high

eviCore "the dial" prior-authorization algorithm tuned to increase insurance denials

eviCore, a Cigna-owned prior-authorization contractor serving about 100 million people, used an AI-backed algorithm insiders call "the dial" as the first screen on coverage requests and could tune it to route more requests to human reviewers to raise denial rates.

Wrongful automated decisionNamed approval gateSegregation of duties

AID-2024-0001September 2024medium

Choice Home Warranty AI chatbot committed a $3,000 AC payout the company had not authorized

Choice Home Warranty's customer-service chatbot agreed in writing to a $3,000 maximum cash payout toward an air conditioner replacement, a commitment the company initially disowned and honored only after a TV news investigation.

Binding commitmentNamed approval gateHigh-risk approval gate

AID-2023-0003November 2023critical

UnitedHealth nH Predict Denied Medicare Post-Acute Care Without Clinician Authorization

AI coverage-denial system allegedly committed post-acute care denials to Medicare Advantage beneficiaries without named clinician authorization; the class action alleges — as a lawsuit claim, not an established finding — a 90% reversal rate on appeal.

Wrongful automated decisionDeny-by-defaultSegregation of duties

AID-2023-00022023high

Attorneys filed ChatGPT-hallucinated case citations to federal court

Attorneys filed a federal brief with six cases ChatGPT fabricated, without independently verifying that the cited cases existed.

Fabricationindependent_citation_verificationhuman_separation_of_duties

AID-2023-0001December 18, 2023high

Chevrolet of Watsonville: Prompt-Injected Chatbot Agreed in Conversation to Sell a Vehicle for $1

A prompt-injected ChatGPT-powered dealer chatbot agreed in conversation to sell a 2024 Chevrolet Tahoe for $1 and called it legally binding; the exchange was text only and no sale occurred.

Binding commitmentDeny-by-defaultHigh-risk approval gate

AID-2022-0003May 2, 2022critical

Citigroup $444B Basket: Hard Blocks Caught ~$248B, No Notional Ceiling on the Rest

A Citigroup trader clicked through a single pop-up listing 711 warning messages — only the first 18 visible without scrolling — and released a $444B basket order instead of the intended $58M; ~$1.4B sold before cancellation and the FCA/PRA fined Citigroup £61.6M.

Unauthorized financial actionFail-closed limitsHigh-risk approval gate

AID-2022-00022022critical

Cigna PxDx Batch Rubber-Stamp Denials

Cigna's PxDx system was used to deny 300,000+ medical claims over about two months in 2022, with reviewing doctors averaging a reported 1.2 seconds per denial.

Binding commitmentDeny-by-defaultHigh-risk approval gate

AID-2022-0001November 2022high

Air Canada Held Liable for Chatbot's Misstated Bereavement Refund Policy

Air Canada's website chatbot misstated the airline's bereavement fare policy as retroactively claimable, a customer booked on that basis, and a BC tribunal ordered the airline to pay $812.02 CAD for negligent misrepresentation.

Binding commitmentDeny-by-defaultHigh-risk approval gate

AID-2021-0002November 2021high

Zillow Offers iBuyer pricing algorithm drove $500M+ loss and wind-down

Zillow's automated home-pricing algorithm made binding purchase offers above future resale values, forcing a $500M+ write-down and the shutdown of Zillow Offers.

Binding commitmentFail-closed limitsHigh-risk approval gate

AID-2021-00012021 (misconduct spanning roughly 2005-2019)critical

Dutch childcare benefits scandal (Toeslagenaffaire): nationality-based fraud-risk profiling wrongly accused ~26,000 families

The Dutch Tax Administration used a fraud-risk classification method and nationality as a risk indicator to wrongly accuse about 26,000 families of childcare-benefit fraud, ordering many to repay large sums and causing severe hardship.

Wrongful automated decisionHigh-risk approval gateNamed approval gate

AID-2020-0001January 2020high

Robert Williams wrongfully arrested after Detroit facial recognition false match

A facial recognition system falsely matched Robert Williams to surveillance footage of a store theft, leading Detroit police to wrongfully arrest and detain him for about 30 hours.

Wrongful automated decisionNamed approval gateHigh-risk approval gate

AID-2015-00022013 to 2015critical

Michigan MiDAS system auto-adjudicated tens of thousands of false unemployment fraud determinations

Michigan's MiDAS system automatically flagged unemployment claimants as fraudulent with little or no human review, falsely accusing tens of thousands and triggering wage garnishment and tax-refund seizures.

Wrongful automated decisionHigh-risk approval gateNamed approval gate

AID-2015-00012015-2019critical

Australia Robodebt: Unlawful Automated Welfare Debt Calculation

Australia's Robodebt used an unlawful income-averaging method to automatically calculate and issue welfare debts, unlawfully raising approximately A$1.76 billion in debts against more than 433,000 people; the roughly A$751 million actually recovered was later repaid under a Federal Court settlement.

Wrongful automated decisionHigh-risk approval gate

AID-2013-0001August 16, 2013critical

Everbright Securities Arbitrage System Runaway Orders with Undisclosed Insider Hedge Cover Trade

Everbright Securities' arbitrage system generated 23.4 billion yuan in erroneous buy orders with no enforcement ceiling, and the desk then executed a massive insider hedge before disclosing the error to the market.

Runaway executionFail-closed limitsDeny-by-default

AID-2012-0001August 1, 2012critical

Knight Capital $440M Runaway Trading Loss

Dormant algorithmic trading feature reactivated by configuration flag error, executing millions of unintended orders and causing about $440 million in losses within 45 minutes.

Runaway executionDeny-by-defaultFail-closed limits

AID-2010-0001May 6, 2010critical

2010 Flash Crash triggered by automated sell algorithm

An automated sell program dumped $4.1 billion of E-mini S&P 500 futures in about 20 minutes without regard to price or time, contributing to a roughly 998-point intraday drop in the Dow.

Runaway executionFail-closed limitsHigh-risk approval gate

See it for yourself

Reading is one thing. Watch it block an agent.

One command starts the demo: an agent stopped from signing off its own work, and the signed evidence file an inspector can check for themselves.

Book a demo Read the docs

Designed against the rules your auditors already enforce.