Anthropic Proposes Industry-Wide Cyber Jailbreak Severity Scale

Anthropic

Research official 1 src. ~1 min

Published July 2, Anthropic detailed Fable 5's four-tier cybersecurity classifier and proposed the Cyber Jailbreak Severity (CJS) scale — CJS-0 through CJS-4 — scoring jailbreaks on capability gain, attack breadth, ease of weaponization, and discoverability. Developed with Project Glasswing partners including Amazon, Microsoft, and Google, and offered for industry-wide adoption.

Why it matters

A shared severity vocabulary for AI jailbreaks mirrors how CVSS scoring standardized traditional vulnerability disclosure. If CJS is adopted across labs, it enables faster coordinated response to safety incidents and gives policymakers a concrete metric.

Importance: 3/5

Cross-industry jailbreak severity scale proposed by Anthropic with Amazon, Microsoft, Google participation; CVSS-like standardization for AI security

Sources