Skip to main content
development
4.5 out of 5 stars. Excellent.
4.5(10)

EVMbench

Open-source benchmark evaluating AI agents' ability to detect, patch, and exploit smart contract vulnerabilities on the Ethereum Virtual Machine.

Free·Best for ·1 min
Updated February 20, 2026Certified
APIOpen SourceFree Tier
1

What is EVMbench?

EVMbench is an open-source benchmark framework launched by OpenAI and Paradigm in February 2026 that evaluates how well AI agents can analyze, detect, patch, and exploit smart contract vulnerabilities. The benchmark draws on 120 curated vulnerabilities from 40 real-world audits and security competitions, including scenarios from the Tempo blockchain. It measures three capability modes: Detect (vulnerability auditing), Patch (vulnerability remediation), and Exploit (end-to-end attack execution in a sandboxed environment). EVMbench aims to encourage the use of AI defensively to audit and strengthen deployed smart contracts that secure over $100B in crypto assets.

Best for: AI agent evaluation · Security research · DeFi security benchmarking

EVMbench interface
2

Developer Stack Fit

Engineering evaluation

Quick read on where EVMbench fits in a software team's AI stack. Validate final fit against your codebase, data policy, and deployment model.

Methodology
Stack layer
AI Security
Deployment model
Open-source deployable
Open-source status
Yes or source-available
API support
API or integration-friendly
MCP support
No MCP signal found
Security posture
Stronger controls worth validating
Best use case
AI agent evaluation
3

Key Features

  1. 01

    Detect mode: AI agents audit smart contracts and score on vulnerability recall

    First benchmark for AI smart contract security capabilities

  2. 02

    Patch mode: AI agents modify vulnerable contracts while preserving functionality

    Real-world vulnerabilities from professional audits

  3. 03

    Exploit mode: AI agents execute end-to-end fund-draining attacks in sandbox

    Three evaluation modes covering full security workflow

  4. 04

    120 curated vulnerabilities from 40 real-world audits

    Open-source and freely available

  5. 05

    Scenarios from Tempo blockchain for payment-oriented contracts

    A core development capability that teams use daily.

  6. 06

    Automated task auditing agents for quality control

    A core development capability that teams use daily.

  7. 07

    Custom graders and red-teaming to prevent exploitation

    A core development capability that teams use daily.

4

Pros & Cons

What stands out

  • First-of-its-kind benchmark for AI security capabilities in DeFi
  • Based on real audit data, not synthetic vulnerabilities
  • Encourages defensive use of AI for contract auditing
  • Open-source framework for researchers and developers
  • Covers the full security workflow (detect, patch, exploit)

Watch outs

  • Research benchmark, not a production security tool
  • Limited to EVM-compatible contracts
  • Exploit mode is for evaluation only, not actual attacks
  • Requires AI agents to run the benchmark
5

Pricing Plans

EVMbench Pricing

Choose the perfect plan for your needs. All plans include our core features with different usage limits and advanced capabilities.

Most Popular

Open Source

Free
Free and open-source benchmark framework
120 curated smart contract vulnerabilities
Three evaluation modes: Detect, Patch, Exploit
Sandboxed testing environment
Red-teamed to prevent grader cheating
Built on real audit data from 40 audits
Get Started Free

Need a Custom Solution?

Looking for enterprise features or custom pricing? Contact EVMbench directly for tailored solutions.

Contact Sales

Most teams land on the Open Source plan.

6

Alternatives

ToolRatingPrice
EVMbench4.5Freecurrent
DeerFlow4.7Freeview →
Cursor4.8Freemiumview →
Entire Checkpoints4.3Freeview →
OpenCode4.6Freemiumview →
DiffSense4.4Freeview →
7

FAQ

What is EVMbench and how does it work?

EVMbench is a development tool that open-source benchmark evaluating ai agents' ability to detect, patch, and exploit smart contract vulnerabilities on the ethereum virtual machine.. It uses AI to help users improve productivity through analyzing input and generating relevant output.

Is EVMbench free to use?

EVMbench offers a completely free plan. You can get started without paying anything.

Is there a free plan or trial?

EVMbench doesn't offer a traditional free trial, but provides a money-back guarantee on paid plans.

What can EVMbench do?

Evaluating AI agents' smart contract security capabilities
Research into AI-powered DeFi security auditing
Benchmarking AI models for vulnerability detection
Training AI agents for defensive security tasks
Assessing AI readiness for blockchain security

More development Tools

Expert Reviewed
Personally Tested

Affiliate Disclosure: We may earn a commission when you purchase through links on our site. This doesn't affect our editorial independence or the price you pay.

EVMbench logo

EVMbench

Free

Try Free