OpenAI introduces EVMbench to measure AI crypto security. Benchmark evaluates detection, patching and exploit skills. OpenAI has launched a benchmarking system called EVMbench to evaluate how ...
OpenAI and Paradigm unveil EVMbench, a benchmark testing AI agents on smart contract security across 120 high-severity vulnerabilities.
OpenAI's EVMbench tests AI on smart contract security. Claude Opus 4.6 ranked first, beating GPT-5 and Gemini 3 Pro across 120 real crypto vulnerabilities.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results