Incident · curated 27 Jun 2026
First reported 13 Jun 2026 · 13d ago
Single-source incident — first reported, latest, and curated coincide.
It shows attackers can weaponize an LLM's own safety guardrails to evade AI-driven security scanning, blinding defensive tools to embedded payloads.
A malware campaign reportedly named Hades injects text referencing biological and nuclear weapons into its code to trigger the safety failsafe mechanisms of AI-based malware scanners, causing the scanners to halt analysis before reaching the actual malicious payload.
Why it matters
It shows attackers can weaponize an LLM's own safety guardrails to evade AI-driven security scanning, blinding defensive tools to embedded payloads.