[apfma] Schneier: Autonomous AI Hacking

Roger Clarke via apf-media-archive Sat, 18 Oct 2025 10:13:47 -0700

--- Begin Message ---
> In August, Anthropic reported that they disrupted a threat actor thatused Claude, Anthropic’s AI model, to automate the entire cyberattackprocess. It was an impressive use of the AI, which performed networkreconnaissance, penetrated networks, and harvested victims’ credentials.The AI was able to figure out which data to steal, how much money toextort out of the victims, and how to best write extortion emails.
[ https://www.anthropic.com/news/detecting-countering-misuse-aug-2025 ]


Autonomous AI Hacking and the Future of Cybersecurity
Schneier on Security
10 October 2025
https://www.schneier.com/blog/archives/2025/10/autonomous-ai-hacking-and-the-future-of-cybersecurity.html

[ For embedded links, see the original source ]
AI agents are now hacking computers. They’re getting better at allphases of cyberattacks, faster than most of us expected. They can chaintogether different aspects of a cyber operation, and hack autonomously,at computer speeds and scale. This is going to change everything.
Over the summer, hackers proved the concept, industry institutionalizedit, and criminals operationalized it. In June, AI company XBOW took thetop spot on HackerOne’s US leaderboard after submitting over 1,000 newvulnerabilities in just a few months. In August, the seven teamscompeting in DARPA’s AI Cyber Challenge collectively found 54 newvulnerabilities in a target system, in four hours (of compute). Also inAugust, Google announced that its Big Sleep AI found dozens of newvulnerabilities in open-source projects.
It gets worse. In July Ukraine’s CERT discovered a piece of Russianmalware that used an LLM to automate the cyberattack process, generatingboth system reconnaissance and data theft commands in real-time. InAugust, Anthropic reported that they disrupted a threat actor that usedClaude, Anthropic’s AI model, to automate the entire cyberattackprocess. It was an impressive use of the AI, which performed networkreconnaissance, penetrated networks, and harvested victims’ credentials.The AI was able to figure out which data to steal, how much money toextort out of the victims, and how to best write extortion emails.
Another hacker used Claude to create and market his own ransomware,complete with “advanced evasion capabilities, encryption, andanti-recovery mechanisms.” And in September, Checkpoint reported onhackers using HexStrike-AI to create autonomous agents that can scan,exploit, and persist inside target networks. Also in September, aresearch team showed how they can quickly and easily reproduce hundredsof vulnerabilities from public information. These tools are increasinglyfree for anyone to use. Villager, a recently released AI pentesting toolfrom Chinese company Cyberspike, uses the Deepseek model to completelyautomate attack chains.
This is all well beyond AIs capabilities in 2016, at DARPA’s Cyber GrandChallenge. The annual Chinese AI hacking challenge, Robot Hacking Games,might be on this level, but little is known outside of China.
Tipping point on the horizon
AI agents now rival and sometimes surpass even elite human hackers insophistication. They automate operations at machine speed and globalscale. The scope of their capabilities allows these AI agents tocompletely automate a criminal’s command to maximize profit, orstructure advanced attacks to a government’s precise specifications,such as to avoid detection.
In this future, attack capabilities could accelerate beyond ourindividual and collective capability to handle. We have long taken itfor granted that we have time to patch systems after vulnerabilitiesbecome known, or that withholding vulnerability details preventsattackers from exploiting them. This is no longer the case.
The cyberattack/cyberdefense balance has long skewed towards theattackers; these developments threaten to tip the scales completely.We’re potentially looking at a singularity event for cyber attackers.Key parts of the attack chain are becoming automated and integrated:persistence, obfuscation, command-and-control, and endpoint evasion.Vulnerability research could potentially be carried out duringoperations instead of months in advance.
The most skilled will likely retain an edge for now. But AI agents don’thave to be better at a human task in order to be useful. They just haveto excel in one of four dimensions: speed, scale, scope, orsophistication. But there is every indication that they will eventuallyexcel at all four. By reducing the skill, cost, and time required tofind and exploit flaws, AI can turn rare expertise into commoditycapabilities and gives average criminals an outsized advantage.
The AI-assisted evolution of cyberdefense
AI technologies can benefit defenders as well. We don’t know how thedifferent technologies of cyber-offense and cyber-defense will beamenable to AI enhancement, but we can extrapolate a possible series ofoverlapping developments.
Phase One: The Transformation of the Vulnerability Researcher. AI-basedhacking benefits defenders as well as attackers. In this scenario, AIempowers defenders to do more. It simplifies capabilities, providing farmore people the ability to perform previously complex tasks, andempowers researchers previously busy with these tasks to accelerate ormove beyond them, freeing time to work on problems that require humancreativity. History suggests a pattern. Reverse engineering was alaborious manual process until tools such as IDA Pro made the capabilityavailable to many. AI vulnerability discovery could follow a similartrajectory, evolving through scriptable interfaces, automated workflows,and automated research before reaching broad accessibility.
Phase Two: The Emergence of VulnOps. Between research breakthroughs andenterprise adoption, a new discipline might emerge: VulnOps. Largeresearch teams are already building operational pipelines around theirtooling. Their evolution could mirror how DevOps professionalizedsoftware delivery. In this scenario, specialized research tools becomedeveloper products. These products may emerge as a SaaS platform, orsome internal operational framework, or something entirely different.Think of it as AI-assisted vulnerability research available to everyone,at scale, repeatable, and integrated into enterprise operations.
Phase Three: The Disruption of the Enterprise Software Model. Ifenterprises adopt AI-powered security the way they adopted continuousintegration/continuous delivery (CI/CD), several paths open up. AIvulnerability discovery could become a built-in stage in deliverypipelines. We can envision a world where AI vulnerability discoverybecomes an integral part of the software development process, wherevulnerabilities are automatically patched even before reachingproduction -- a shift we might call continuous discovery/continuousrepair (CD/CR). Third-party risk management (TPRM) offers a naturaladoption route, lower-risk vendor testing, integration into procurementand certification gates, and a proving ground before wider rollout.
Phase Four: The Self-Healing Network. If organizations can independentlydiscover and patch vulnerabilities in running software, they will nothave to wait for vendors to issue fixes. Building in-house researchteams is costly, but AI agents could perform such discovery and generatepatches for many kinds of code, including third-party and vendorproducts. Organizations may develop independent capabilities that createand deploy third-party patches on vendor timelines, extending thecurrent trend of independent open-source patching. This would increasesecurity, but having customers patch software without vendor approvalraises questions about patch correctness, compatibility, liability,right-to-repair, and long-term vendor relationships.
These are all speculations. Maybe AI-enhanced cyberattacks won’t evolvethe ways we fear. Maybe AI-enhanced cyberdefense will give uscapabilities we can’t yet anticipate. What will surprise us most mightnot be the paths we can see, but the ones we can’t imagine yet.
This essay was written with Heather Adkins and Gadi Evron, andoriginally appeared in CSO.
--
Roger Clarke                            mailto:[email protected]
T: +61 2 6288 6916   http://www.xamax.com.au  http://www.rogerclarke.com
Xamax Consultancy Pty Ltd 78 Sidaway St, Chapman ACT 2611 AUSTRALIA
Visiting Professorial Fellow                          UNSW Law & Justice
Visiting Professor in Computer Science    Australian National University
--- End Message ---

_______________________________________________
apf-media-archive mailing list
[email protected]
https://lists.privacy.org.au/mailman/listinfo/apf-media-archive

[apfma] Schneier: Autonomous AI Hacking

Reply via email to