The system evaluates agents across three distinct modes: detection of vulnerabilities, functional patching of code, and end-to-end execution of fund-draining exploits. Recent testing shows that the GPT-5.3-Codex model achieves a 72.2% success rate in exploit tasks, marking a significant increase from the 31.9% score recorded by GPT-5 just six months ago.
“Measuring model capability in this domain helps track emerging cyber risks and highlights the importance of using AI systems defensively to audit and strengthen deployed contracts,” according to the OpenAI announcement.
🧭 FAQs• How does the system verify if an agent successfully patches code? Automated tests ensure vulnerabilities are eliminated without breaking the contract’s intended functional logic.
• Is there financial support available for researchers using these tools? OpenAI is committing $10 million in API credits to support defensive cybersecurity research.


















