Cua-Bench launches with SnorkelAI
1/ Today we're launching Cua-Bench with @SnorkelAI: a benchmark for computer-use agents on professional software, open for any model to run. The benchmark covers 25 expert-authored KiCad tasks, and the best frontier model we tested cleared only 6 of them. https://t.co/AM7EIotW6F
Views13.5k
Comments6
Reposts18
Likes79
Launched Jun 15, 2026View post
