Anthropic

Anthropic ships Claude Opus 4.1, claims state-of-the-art on real-world coding

Opus 4.1 posts a 74.5% score on SWE-bench Verified and introduces a new “computer use” beta that lets agents click, type and scroll inside live applications.