Petri 2.0 Released

Anthropic@AnthropicAIJan 22, 2026

Since release, Petri, our open-source tool for automated alignment audits, has been adopted by research groups and trialed by other AI developers. We're now releasing Petri 2.0, with improvements to counter eval-awareness and expanded seeds covering a wider range of behaviors. https://t.co/OM8n7OvHJq

Views146.7k
Comments58
Reposts72
Likes784
Launched Jan 22, 2026View post

More from Anthropic

1

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. https://t.co/NQ7IfEtYk7

Apr 7, 2026
Views31.54M
Comments1.97k
Reposts6.63k
View post
2

We’ve raised $30B in funding at a $380B post-money valuation. This investment will help us deepen our research, continue to innovate in products, and ensure we have the resources to power our infrastructure expansion as we make Claude available everywhere our customers are.

Feb 12, 2026
Views7.23M
Comments1.06k
Reposts1.00k
View post
3

Last month we launched Project Glasswing, our collaborative AI cybersecurity initiative. Since then, we and our partners have found more than ten thousand high- or critical-severity vulnerabilities in essential software.

May 22, 2026
Views2.76M
Comments518
Reposts649
View post
4

When we released Claude Opus 4.5, we knew future models would be close to our AI Safety Level 4 threshold for autonomous AI R&D. We therefore committed to writing sabotage risk reports for future frontier models. Today we’re delivering on that commitment for Claude Opus 4.6.

Feb 10, 2026
Views2.69M
Comments330
Reposts401
View post

See all Anthropic launches →