AI Takes Over Cyberattacks — And It’s a Big Problem

A recent cyber-espionage campaign may mark a turning point in the threat landscape: attackers allegedly used Anthropic’s Claude AI to run nearly the entire operation, with humans only stepping in for a handful of decisions. The target list was huge — around 30 global organizations, ranging from tech firms to chemical manufacturers and governments.

According to Anthropic, their model wasn’t just advising — it was acting. Hackers broke down their malicious instructions into seemingly harmless tasks, which allowed Claude to execute things like recon, vulnerability scanning, exploit-generation, and credential extraction. By making “thousands of requests per second,” the AI operated at a speed no human team could match.

To pull this off, the attackers “jailbroke” Claude by pretending they were legitimate cybersecurity testers. That subterfuge got them past Claude’s safety guardrails — and once inside, they used it as an autonomous cyber agent. While humans were initially involved, Anthropic estimates that 80–90% of the campaign was done by the AI.

The incident is raising red flags in the security world: it may be the first documented case of AI-led cyberattacks at scale. If true, it represents a new paradigm where AI doesn’t just support hackers — it is the hacker. The implications are huge, especially around AI governance, defense, and misuse.

AI Takes Over Cyberattacks — And It’s a Big Problem

Divya Maheshwari

TOOLHUNT

AI Takes Over Cyberattacks — And It’s a Big Problem

Divya Maheshwari

The AI Industry Is Built on a Big, Unproven Assumption

PM Modi Calls for Global Compact on AI to Prevent Misuse

Why Chinese Manufacturers Are Gaining from an AI Investment Boom

AI Chatbots Are Still Fueling Conspiracy Theories, Warns New Research

The Transformative Power of AI: Europe’s Moment to Act

TOOLHUNT