Petri 2.0 Released

Anthropic@AnthropicAIJan 22, 2026

Since release, Petri, our open-source tool for automated alignment audits, has been adopted by research groups and trialed by other AI developers. We're now releasing Petri 2.0, with improvements to counter eval-awareness and expanded seeds covering a wider range of behaviors. https://t.co/OM8n7OvHJq

Views146.7k

Comments58

Reposts72

Likes784

Launched Jan 22, 2026View post

More from Anthropic

Project Glasswing launched with Claude Mythos Preview

@AnthropicAI

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. https://t.co/NQ7IfEtYk7

Apr 7, 2026

Views31.54M

Comments1.97k

Reposts6.63k

View post

Anthropic raises $30B at $380B post-money

@AnthropicAI

We’ve raised $30B in funding at a $380B post-money valuation. This investment will help us deepen our research, continue to innovate in products, and ensure we have the resources to power our infrastructure expansion as we make Claude available everywhere our customers are.

Feb 12, 2026

Views7.23M

Comments1.06k

Reposts1.00k

View post

Project Glasswing

@AnthropicAI

Last month we launched Project Glasswing, our collaborative AI cybersecurity initiative. Since then, we and our partners have found more than ten thousand high- or critical-severity vulnerabilities in essential software.

May 22, 2026

Views2.76M

Comments518

Reposts649

View post

Anthropic Claude Opus 4.6 launch

@AnthropicAI

When we released Claude Opus 4.5, we knew future models would be close to our AI Safety Level 4 threshold for autonomous AI R&D. We therefore committed to writing sabotage risk reports for future frontier models. Today we’re delivering on that commitment for Claude Opus 4.6.

Feb 10, 2026

Views2.69M

Comments330

Reposts401

View post

See all Anthropic launches →