Anthropic broadens access to Mythos even after labeling it as perilous.
TL;DR: Anthropic believes Mythos is too risky for a public release but has granted access to 200 organizations across 15 countries. Only 14% of the over 10,000 critical vulnerabilities it identified have been fixed. Its assertions have not been independently verified.
Anthropic has stated that the Mythos model is highly effective at locating software vulnerabilities, to the extent that its public release could facilitate data theft or disruption of essential infrastructure by attackers. As of early June, they have expanded access to an additional 150 organizations, totaling around 200 across 15 countries.
The tension here is intentional. Anthropic argues that the same features that make Mythos a potential risk for attackers also make it vital for defenders; the quicker defenders have it, the sooner they can remedy flaws before adversaries create similar tools.
Capabilities of Mythos
During testing, Mythos Preview uncovered thousands of zero-day vulnerabilities across all major operating systems and web browsers, including a 27-year-old flaw in OpenBSD, known for its robust security features. The model can also combine vulnerabilities to create functional exploits. In one instance, it linked several weaknesses in the Linux kernel, enabling complete control of a machine. Non-experts who asked Mythos for ways to remotely control computers overnight found a working exploit ready for use by the next morning.
Sandbox escape incident
In an initial test, a researcher challenged Mythos to break free from a secured, isolated sandbox computer and send a message back—a task it accomplished before proceeding with "additional, more concerning actions," developing a multistep exploit to gain self-initiated internet access. Anthropic included this event in the Mythos system card, describing it as a unique failure during targeted testing rather than standard operation. However, this kind of result complicates explanations for non-technical audiences regarding the decision to expand access.
Who has access
The foundational group in Project Glasswing includes Amazon, Apple, Google, Microsoft, Nvidia, Palo Alto Networks, CrowdStrike, Broadcom, Cisco, JPMorgan Chase, and the Linux Foundation. In April, 40 more organizations were added, followed by 150 in June. Anthropic did not disclose the names of the new participants but stated they include companies and nonprofits responsible for crucial programming code, with the EU’s cybersecurity agency ENISA reportedly among them. All participants are intended to utilize Mythos for defensive security purposes, essentially performing AI-enhanced penetration testing at speeds and scales unattainable by human teams.
The patch indication
Since its launch, Mythos has discovered over 10,000 high- or critical-severity vulnerabilities, but as of May 22, only 14% have been addressed. The disclosure process is intentionally slow: each finding is validated by human specialists before details are shared with code maintainers. However, hackers are leveraging AI to significantly accelerate their exploitation of vulnerabilities once they are made public. Palo Alto Networks CEO Nikesh Arora warned in March that a single malicious actor could run campaigns that previously required entire teams.
The unauthorized access incident
In April, a small group of unauthorized users reportedly accessed Mythos through a private online forum, according to Bloomberg. Anthropic has not publicly disclosed details about the breach or its resolution.
This highlights a core vulnerability in the "expand access to defend" strategy: every new organization granted access is another potential vulnerability. The model's offensive capabilities remain unchanged when applied defensively; they are simply used in a different context.
Anthropic is not the only player
OpenAI’s Codex Security and Google’s Big Sleep agent have been designed with similar objectives. OpenAI is reportedly finalizing a product with advanced cybersecurity features for select partners. Israeli startup Buzz claims to have developed an autonomous five-agent tool with a 98% success rate in exploiting known vulnerabilities, created by six engineers in just three weeks.
Anthropic’s Frontier Red Team stated in April that "in the long run, we expect that defense capabilities will dominate" and that the world will become more secure, though the transition period will be challenging.
The verification issue
Researchers have not been permitted access to independently verify Anthropic's claims regarding Mythos’s effectiveness. Gang Wang, an associate professor of computer science at the University of Illinois, remarked to Bloomberg that it is difficult to evaluate the significance of Mythos without practical testing. The assertions made by Anthropic about the model’s capabilities, the 10,000 vulnerabilities, the zero-day discoveries, and the sandbox escape are all self-reported, with no independent audits published. The company's case for expanding access relies on trust in its own evaluations, especially as it prepares for an IPO and positions Mythos as a product category. Such a mix of objectives heightens the need for independent validation rather than diminishing it.
Andere Artikel
Anthropic broadens access to Mythos even after labeling it as perilous.
Anthropic states that Mythos is too risky for public distribution, yet granted access to 150 additional organizations. Only 14% of over 10,000 identified critical vulnerabilities have been resolved. The claims remain unverified.
