new collab from @paradigm and @OpenAI:
evmbench is a benchmark and agent harness for exploiting smart contract bugs
a few months ago, the best models found <20% of critical, fund-draining @Code4rena bugs in our benchmark. today they find > 70%
From X
Disclaimer: The above content reflects only the author's opinion and does not represent any stance of CoinNX, nor does it constitute any investment advice related to CoinNX.


