AI News Cybersecurity Tech agentic AI AI Ethics AI Models & Research

OpenAI's GPT-5.5 judged comparable to Anthropic Mythos

OpenAI's publicly available model is only the second to complete a 32-step corporate network attack simulation, raising questions about Anthropic's decision to withhold Mythos

by Defused News Writer

Updated May 01, 2026

OpenAI's GPT-5.5 judged comparable to Anthropic Mythos — Photo by Jorgen Hendriksen / Unsplash

OpenAI's GPT-5.5 has matched the cybersecurity capabilities of Anthropic's closely guarded Claude Mythos Preview, the UK AI Security Institute (AISI) said in an evaluation published on Wednesday, making it only the second model to complete an end-to-end simulated corporate network attack.

The finding is significant because Anthropic chose not to release Mythos publicly, describing it as too powerful for general access and restricting it to roughly 50 organisations through its Project Glasswing initiative.

GPT-5.5, by contrast, is already available to millions of ChatGPT subscribers and through OpenAI's API.

AISI tested GPT-5.5 against a suite of 95 capture-the-flag tasks across four difficulty tiers, built in collaboration with cybersecurity firms Crystal Peak Security and Irregular.

At the highest "Expert" level, GPT-5.5 achieved an average success rate of 71.4%, compared to 68.6% for Mythos Preview.

The gap falls within the statistical margin of error, but AISI noted GPT-5.5 "may be the strongest model we have tested."

Both represent a dramatic leap over the previous generation: GPT-5.4 scored 52.4% and Claude Opus 4.7 reached 48.6% on the same tasks.

The most demanding test was "The Last Ones" (TLO), a 32-step corporate network attack simulation built with SpecterOps that spans four subnets and roughly twenty hosts, requiring the model to chain together reconnaissance, credential theft, lateral movement across multiple Active Directory forests, a supply chain pivot and data exfiltration.

AISI estimates a human expert would need approximately 20 hours to complete the full chain.

GPT-5.5 completed it end-to-end in two out of ten attempts, while Mythos managed three out of ten, both at a budget of 100 million tokens per attempt.

No other model has solved the simulation.

AISI said the results suggest that rising cybersecurity capability is a broad trend driven by improvements in autonomy and coding rather than a breakthrough specific to any single model, a conclusion that raises questions about whether Anthropic's decision to restrict Mythos was necessary or whether it reflected compute constraints as much as safety concerns.

"The results from GPT-5.5 suggest the latter: a second model, from a different developer, now reaches a similar level of performance," AISI wrote.

Separately, cybersecurity firm XBOW, which received early access to GPT-5.5, reported that the model's miss rate on real-world vulnerability scanning had fallen from 40% in earlier generations to approximately 10%, a level it described as comparable to Mythos.

OpenAI has since announced GPT-5.5-Cyber, a restricted variant available only through its Trusted Access for Cyber programme, alongside $10 million in API credits for security researchers and partnerships with organisations including Trail of Bits.

GPT-5.5 received a "High" classification under OpenAI's own cybersecurity risk framework but remained below the "Critical" threshold, defined as the ability to develop zero-day exploits autonomously without human assistance.

AISI cautioned that its tests were conducted on networks with no active defences, and that future evaluations will need to incorporate hardened environments with real-time monitoring and incident response to keep pace with rapidly advancing model capabilities.

The recap

A UK group compared OpenAI's GPT-5.5 to Anthropic Mythos.
Assessment focused on cybersecurity capabilities rather than general performance.
Findings were published in a briefing the group released.

by Defused News Writer

Updated May 01, 2026

Explore stories

AI News Tech Tech Giants AI Ethics

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

by Jamie Ashcroft

May 01, 2026

Qualcomm Teases Agentic Cpus AI News agentic AI Tech Semiconductors Chipmakers Consumer Electronics Tech Giants

Qualcomm enters data centre chip race with custom silicon deal for unnamed hyperscaler

by Defused News Writer

May 01, 2026

Cybersecurity Tech Giants Ransomware nonce bug

Ransomware group's own coding blunder turns its software into a data destroyer, making recovery impossible even after payment

by Defused News Writer

April 29, 2026

AI News AI Platforms Tech AI Models & Research

Google rolls out Gemini personalisation features and chat history import tools in the UK

by Defused News Writer

April 29, 2026

AI News Tech AI Ethics

OpenAI outlines plans for superhuman AI

by Defused News Writer

April 28, 2026

Quantum Computing Tech Giants Cryptocurrency AI News Crypto Electric Vehicles Markets

Solana's two core development teams independently converge on Falcon as quantum-resistant upgrade

by Defused News Writer

April 28, 2026

AI News Tech AI Ethics Electric Vehicles Markets

OpenAI's missed targets expose the tension at the heart of AI's biggest bet

The ChatGPT creator fell short on revenue and user growth, prompting its own CFO to question whether the company can afford its data centre commitments

by Defused News Writer

April 28, 2026

AI News Start Ups Tech Start-ups Chipmakers Tech Giants

AlphaGo creator raises record $1.1bn seed round to build AI that learns without human data

by Defused News Writer

April 28, 2026

AI News multimodal AI Tech AI Models & Research Chipmakers Tech Giants

Nvidia launches open multimodal model that unifies vision, audio and language for AI agents

by Defused News Writer

April 28, 2026

Subscribe to Our Newsletter

OpenAI's GPT-5.5 judged comparable to Anthropic Mythos

The recap

Apple's best March quarter on record confirms the iPhone 17 supercycle is real

Bitcoin closes April up 13% in strongest monthly performance since last spring

Apple says iPhone 17 most popular as sales climb

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

Explore stories

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

Qualcomm enters data centre chip race with custom silicon deal for unnamed hyperscaler

Ransomware group's own coding blunder turns its software into a data destroyer, making recovery impossible even after payment

Google rolls out Gemini personalisation features and chat history import tools in the UK

OpenAI outlines plans for superhuman AI

Solana's two core development teams independently converge on Falcon as quantum-resistant upgrade

OpenAI's missed targets expose the tension at the heart of AI's biggest bet

AlphaGo creator raises record $1.1bn seed round to build AI that learns without human data

Nvidia launches open multimodal model that unifies vision, audio and language for AI agents

Explore topics

Tech

Artificial Intelligence

Business

Entertainment & Sport

Top tags

OpenAI's GPT-5.5 judged comparable to Anthropic Mythos

Related reading

The recap

Apple's best March quarter on record confirms the iPhone 17 supercycle is real

Alphabet's $190bn spending spree is working, and Wall Street finally believes it

Bitcoin closes April up 13% in strongest monthly performance since last spring

Apple says iPhone 17 most popular as sales climb

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

Explore stories

Alphabet's $190bn spending spree is working, and Wall Street finally believes it

Amazon's $20 billion Anthropic commitment structured as convertible financing tied to compute milestones

Qualcomm enters data centre chip race with custom silicon deal for unnamed hyperscaler

Ransomware group's own coding blunder turns its software into a data destroyer, making recovery impossible even after payment

China freezes all new robotaxi licences nationwide after Baidu fleet meltdown stranded 100 vehicles in Wuhan traffic

Google rolls out Gemini personalisation features and chat history import tools in the UK

South Africa withdraws national AI policy after discovering its own document was riddled with AI-generated fake citations

WhatsApp adds prepaid mobile recharges in India as Meta pushes to close gap with PhonePe and Google Pay

OpenAI outlines plans for superhuman AI

Solana's two core development teams independently converge on Falcon as quantum-resistant upgrade

OpenAI's missed targets expose the tension at the heart of AI's biggest bet

AlphaGo creator raises record $1.1bn seed round to build AI that learns without human data

Xu Zewei extradited to US

Judge tells Musk, Altman and Brockman to stop trading barbs on social media as OpenAI trial opens

Nvidia launches open multimodal model that unifies vision, audio and language for AI agents