Anthropic's new Claude Opus 4.5 model achieved 80.9% on SWE-bench and scored higher than human candidates on a performance ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible resultsResults that may be inaccessible to you are currently showing.
Hide inaccessible results