XBOW Confirms Mythos Tops Vulnerability Hunting, but Costs 5x More Than Opus

1 articles · Updated · SecurityWeek · May 14

XBOW found Anthropic’s Mythos Preview delivers a clear step up in vulnerability detection, especially when it can inspect both live applications and source code rather than code alone.
Tests showed more mixed performance outside pure bug hunting: Mythos rejected false positives better than earlier models, but sometimes missed true positives and needed precise prompts for best results.
Reverse-engineering and native-code results were strong, with XBOW saying the model could triage its own findings, assess rival-model results and reason through unusual firmware and embedded-system contexts.
Cost emerged as a key constraint: Anthropic says Mythos will be about 5x as expensive as Opus, and XBOW found that on a fixed token budget GPT5.5 beat Mythos on web-vulnerability benchmarks.
The upshot is that Mythos appears best suited for high-accuracy source-code audits, while buyers weighing broader use cases may have to balance its top-end capability against weaker cost efficiency.

Is Anthropic's restricted release of its hacking AI a responsible act of safety or a move to create an exclusive market?

If AI can find 27-year-old security flaws in hours, does this mark the end of traditional human-led cybersecurity?

AI Vulnerability Discovery Surges: Claude Mythos Preview Uncovers Thousands of Critical Flaws, Forcing a Cybersecurity Paradigm Shift

Overview

The emergence of advanced AI models like Anthropic's Claude Mythos Preview has significantly reshaped the cybersecurity landscape. These frontier models, released in limited capacity, mark a pivotal leap in AI-driven offensive security. As a result, the time between discovering a vulnerability and its exploitation is shrinking rapidly, forcing security teams to adapt quickly. This new generation of AI is set to fundamentally change defender workflows and risk profiles for organizations, making it crucial for security professionals to rethink their strategies and stay ahead in an environment where threats evolve faster than ever before.

...

XBOW Confirms Mythos Tops Vulnerability Hunting, but Costs 5x More Than Opus

AI Vulnerability Discovery Surges: Claude Mythos Preview Uncovers Thousands of Critical Flaws, Forcing a Cybersecurity Paradigm Shift

Overview

Related Stories