The Wire

First reported 26 Jun 2026 · today

What happened after 2,000 people tried to hack my AI assistant

Fernando Irarrázaval ran a public challenge at hackmyclaw.com inviting people to leak secrets from his OpenClaw test instance via email-based prompt injection. After roughly 6,000 attempts by ~2,000 people, nobody succeeded in extracting the secret, with the instance protected by anti-prompt-injection system rules on the underlying model.

analysisprompt-injectiondata-exfiltrationllmai-agents

Incident details →

First reported 24 Jun 2026 · 2d ago

Dawn of the Apex Agentic Adversary

An analysis piece arguing that autonomous, agentic AI adversaries are compressing the timeline of cyberattacks beyond human-speed defenses, ending the era of human-paced threat cycles. The available text is introductory commentary without specific technical proof-of-concept details.

analysisautonomous-attack-frameworkai-agentsllm

Incident details →

First reported 23 Jun 2026 · 3d ago

Agentic AI: The Weapon That No Longer Needs a Warrior

An opinion/commentary piece reflecting on how agentic AI removes the human from the targeting loop, drawing analogies to the historical evolution of weapons that distanced warriors from their victims.

analysisautonomous-attackai-agentsllm

Incident details →

First reported 22 Jun 2026 · 4d ago

Stop Your Legacy Infrastructure from Hijacking Your AI Agents

A conference talk recap discussing how attackers may use legacy infrastructure to circumvent AI security programs and hijack AI agents, noting rapid AI agent adoption outpacing security controls.

analysisagent-hijackingai-agents

Incident details →

First reported 19 Jun 2026 · 7d ago

Forget Data Leakage: Shadow AI's Real Threat Is Access Control

The article argues that shadow AI in enterprises has evolved from a data leakage concern into an access control problem, where the risk lies in autonomous AI tools and agents having unmanaged access permissions rather than just employees pasting sensitive data.

analysisshadow-aiaccess-controlai-agentsllm

Incident details →

First reported 12 Jun 2026 · updated 16 Jun 2026 · 2 sources · 15d ago

MCP Supply Chain Attacks: Why Better Models Make It Worse

An analysis arguing that the Model Context Protocol (MCP) is following the insecure early-API trajectory by leaving authentication, authorization, input validation, and sandboxing to implementers. It highlights that compromising the AI/MCP layer can cause broader, harder-to-trace damage than a compromised API because LLM-driven agents autonomously select tools and can be manipulated via upstream data or prompts.

analysissupply-chaintool-abuseprompt-injectionmcpai-agentsllm

Incident details →

First reported 16 Jun 2026 · 11d ago

Quoting Matteo Wong, The Atlantic

An Atlantic piece quotes cybersecurity expert Katie Moussouris discussing a White House report on a Claude jailbreak, where the model refused to 'review code for security issues' but complied when asked to 'fix this code.' Moussouris characterized this as the model working as intended for cyberdefense rather than a genuine exploit.

analysisjailbreakllm

Incident details →

First reported 10 Jun 2026 · 16d ago

Is security a skill issue? Five scanners, 3,084 skills, a different verdict 64% of the time · Mastro

A Mastro study analyzed 3,084 agent skills across five security scanners and found they disagree on a verdict 63.9% of the time, with 14.2% rated CRITICAL by one scanner and SAFE by another. The piece frames the broader supply-chain risk of AI agent skills—markdown files agents execute with full tool access—citing reported incidents where malicious skills lifted SSH keys, cloud credentials, and crypto wallets, and a fake download counter pushed a dummy skill to #1.

analysissupply-chaindata-exfiltrationmalicious-skillai-agentsllm

Incident details →

First reported 9 Jun 2026 · 17d ago

GitHub - denoland/clawpatrol: Security firewall for agents · GitHub

Clawpatrol is an open-source security firewall for AI agents from denoland, designed to sandbox external plugins (treated as an untrusted supply-chain attack surface) using OS-level namespaces, Landlock, and macOS sandbox profiles, with permission lockfiles and brokered network dialing.

analysistool-abusesupply-chainprompt-injectionai-agentsmcpllm

Incident details →

First reported 7 Jun 2026 · 19d ago

Polymarket annotation injection

The author found injected annotations on a Polymarket event page that are rendered server-side and therefore visible to LLMs via web_search even when hidden in the browser. A planted annotation (source 'grok') contained a fake emergency-rate-cut message directing users to withdraw funds at a phishing-style domain, representing an indirect prompt-injection vector through Polymarket's annotation API endpoints. Claude's web search saw the content but correctly flagged it as phishing.

analysisprompt-injectionindirect-prompt-injectiondata-exfiltrationllmweb-searchrag

Incident details →

First reported 3 Jun 2026 · 23d ago

GitHub - pixiebrix/agent-browser-shield: Browser extension with 35+ rules for keeping your AI agent safe while browsing · GitHub

A GitHub repository for 'agent-browser-shield,' a browser extension by pixiebrix offering 35+ rules aimed at keeping AI agents safe while browsing. It is a defensive tool addressing risks to browser-based AI agents rather than a report of a specific threat.

analysisprompt-injectiondata-exfiltrationbrowser-agentai-agents

Incident details →

First reported 3 Jun 2026 · 24d ago

The New Security Risks of the Agentic Development Lifecycle

An article discussing how AI agents are reshaping the software development lifecycle and shifting where security risk originates, arguing that securing the development process matters as much as securing code.

analysissupply-chainai-agentsllm

Incident details →

First reported 29 May 2026 · 28d ago

Inside MCP: defending the runtime layer of agent security · Arcis Blog

An Arcis blog post argues that agent security has four layers (identity, pre-deploy testing, observability, runtime defense) and that the runtime hot path is structurally underserved. It frames MCP's explicit tool-call contract as enabling runtime defense against agent toolcall injection (their vector V32), applying allowlist/sanitize/refuse techniques at the agent-tool boundary.

analysistool-abuseprompt-injectionmcpai-agents

Incident details →