Claude Mythos Leak: How the unauthorized access happened

The Claude Mythos was developed by Anthropic as a protective measure for the Internet and Anthropic is currently searching for evidence of whether there was an unauthorized attempt to access Claude Mythos. The Claude Mythos leak was reported by Bloomberg on April 21. Which has raised urgent inquiries regarding the security of third-party vendors, the safety of AI containment, and what happens when something that is stated to be “too dangerous to release” actually gets out.

What is the Claude Mythos leak and why does it matter?

Claude Mythos is unlike any other artificial intelligence. Anthropic states that Mythos is a cutting edge general purpose model for Cybersecurity and has abilities for detecting vulnerability of any Software Package. At a level that exceeds almost all human experts. The company found the risks so significant that it chose not to release Mythos to the public at all.

Instead of releasing Mythos, Anthropic launched Project Glasswing on April 7, 2026 providing limited use to a small and trusted group of organizations. Including Apple, Google, Microsoft, Amazon Web Services, Cisco, CrowdStrike, NVIDIA, JPMorgan Chase, Palo Alto Networks, Broadcom and the Linux Foundation. With the goal of allowing the Digital Defenders to identify every vulnerability before they can be exploited by Digital Attackers.

Now, this strategy will get its first significant test.

How did unauthorized users access Claude Mythos Preview?

Members of a private online group had, reportedly, access to Claude Mythos Preview from a 3rd-party vendor’s environment, as per TechCrunch‘s reporting.

This group reportedly has a current contractor’s employee that works with Anthropic, and they obtained unauthorized access to Anthropic’s model via methods that require little technical know-how. Such as educated guesses regarding the location of Anthropic’s models online, based on their prior experience with other Anthropic models.

The group accessed the Anthropic model on the same day Anthropic announced it to the public (the Mythos model). Members of the group communicate with each other in a Discord channel whose purpose is to locate AI models that have not yet been released or are in limited release. And, since having gained access to the Anthropic model, the group has been using the Mythos model regularly, and providing Bloomberg with proof of their access with screenshots and a live demonstration.

What did Anthropic say about the security incident?

Anthropic says they’re looking into the matter, having stated to TechCrunch that they are investigating a purported breach of Claude Mythos Preview as the result of unauthorized access via one of their vendors. So far, the company has found no potential evidence that the alleged perpetration of the offence affects their business systems. This potential breach appears to be limited to the preview environment, accessed via vendor channels instead of internal Anthropic channels. For a broader look at how this incident connects to other recent Anthropic leaks and developments, including the Claude Code exposure, read our full coverage of the April 2026 Anthropic news cycle.

That distinction is important because, while it appears Anthropic’s internal systems remain intact, their design was intended to keep the model out of the hands of bad actors. The persons accessing the preview environment weren’t supposed to have access to it.

Why is Claude Mythos considered so dangerous?

In short, Mythos Preview identifies and exploits vulnerabilities faster and easier than most people can do it themselves.

During testing, Mythos Preview found thousands of critical vulnerabilities. That is within the major operating systems and web browsers through an autonomous testing process. The Anthropic Security Team also indicated that the model was capable of chaining multiple vulnerabilities together. Thereby, creating complex attack chains out of otherwise trivial vulnerabilities.

A member of the Glasswing team reported that, in the span of a few weeks, Mythos helped them find more bugs than they had found cumulatively over their entire career to date. There was no ambiguity in how these capabilities came about. Anthropic indicated that they did not train Mythos explicitly to have any weaponized/ offensive capabilities. These offensive capabilities developed as an unintended consequence of Mythos’ broader improvements of understanding code, reasoning, and autonomous operations. Therefore, the same qualities that make Mythos useful for fixing security vulnerabilities make it useful for exploiting them.

The Project Glasswing initiative was established because of the potential risks associated with the misappropriation of Mythos. As part of Project Glasswing, Anthropic has committed to providing $100 million in usage credits to partner organizations and $4 million in cash grants to open-source software security organizations.

Project Glasswing vs. an open breach: what is the real risk here?

Bloomberg reported that the Mythos group told them they want to explore new paradigms rather than to cause harm. This could be valid. However, the event has shown a structural flaw with how the release of controlled artificial intelligence (AI) has been designed.

Here is a breakdown of what each scenario looks like:

Scenario	Access Type	Risk Level
Project Glasswing partner	Authorized, vetted	Monitored, defensive use
Unauthorized forum group	Via vendor exploit	Unmonitored, unknown intent
General public release	Open access	Widespread, uncontrollable

Anthropic developed Project Glasswing with the goal of avoiding the third row, while the second row occurred without being intended.

Security professionals have long realized that even a head start given to defenders is not a permanent benefit. The Cloud Security Alliance recently issued guidance advising Chief Information Security Officers (CISOs). This is to brace for a surge of newly discovered vulnerabilities as Mythos-like AI models become more prevalent. Whenever a new tool of this strength has been introduced, it tends to be duplicated very quickly.

What does the Claude Mythos breach mean for AI cybersecurity going forward?

The Mythos incident is not a standalone event. Shortly after the announcement of Glasswing, OpenAI put forth their own rollout of an AI security tool that is very similar to those produced by Mythos. It appears that the AI industry’s rollout strategy will be to limit public release of any powerful AI model. You can see how these strategies compare across leading models in our detailed Claude vs ChatGPT comparison for 2026. Until safety controls can be developed that will match the safety potential of these advanced AI tools.

Enterprises and their corresponding security teams can expect several practical implications from this incident: they will need to shorten their patch cycles and increase the speed at which they detect vulnerabilities. If their security teams use traditional processes for patch management, those teams will be left behind.

The breach demonstrates the well-known weak point of any restricted access program (including partnerships with trusted third-party vendors). It is relatively easy to control what a limited number of trusted partners do with a sensitive tool. However, it is far more difficult to control the actions of all of the contractors, subcontractors, and employees of those trusted partners.

Key takeaways

The leak of the Claude Mythos demonstrates an essential conflict within sophisticated AIs; systems that protect are also vulnerable to attack.
Anthropic took intentional measures to keep the Claude Mythos restricted based on the dangers it posed to the general public.
Project Glasswing is a safe and secretive way to apply the Claude Mythos in a secure scope.
Although there are safeguards in place for Project Glasswing, the leak of the Claude Mythos indicates that restricting access to AIs of this magnitude may be far more challenging in practice than it is in theory.
The investigation surrounding this breach is still ongoing.
Anthropic has not publicly disclosed the full extent of the breach or how long it continued before the report became public. Confirmed reports show that unauthorized users accessed Claude Mythos, a highly sensitive AI system considered too dangerous for public release.
The access point designed to prevent this type of breach ended up enabling unauthorized access to Claude Mythos. This breach raises serious questions about how developers can effectively control advanced AI systems.

Claude Mythos Leak: What Happens When a ‘Too Dangerous’ AI Gets Into the Wrong Hands?