SWE-Atlas by Scale AI

Scale AI@scale_AIMar 4, 2026

Introducing SWE-Atlas. We built SWE-Atlas as the next evolution of SWE-Bench Pro, expanding agent evaluation beyond change accuracy to better reflect the real, interactive workflows that define software development. Results for Codebase QnA, the first eval under SWE-Atlas that

Views58.0k
Comments18
Reposts53
Likes528
Launched Mar 4, 2026View post

More from Scale AI

1

Breaking: @AIatMeta just released Muse Spark — now live across @ScaleAILabs leaderboards. Here’s how it stacks up: Tied for 🥇on SWE-Bench Pro Tied for 🥇on HLE Tied for 🥇on MCP Atlas Tied for 🥇on PR Bench - Legal Tied for 🥈on SWE Atlas Test Writing 🥈on PR Bench - Finance https://t.co/GAJQQRTEIX

Apr 8, 2026
Views21.9k
Comments7
Reposts20
View post

See all Scale AI launches →