AI降智的实锤数据 claude Opus 4.5 这是Marginlab团队,每日使用claude code的Opus 4.5 对SWE-Bench-Pro 的50到题目进行通过性测试。 从数据看到,从1月初的60%,下降到现在54%。降智率为10%。
From X

Disclaimer: The above content reflects only the author's opinion and does not represent any stance of CoinNX, nor does it constitute any investment advice related to CoinNX.

0