Results for "benchmarking"

2 tools
Tool
Category
Pricing
Free?
Updated
SWE-bench
Benchmark framework evaluating AI coding agents on real GitHub issues and PRs.
Best for: ai-platform
AI Coding Free Mar 2026
GPT-5.3 Codex
OpenAI's latest coding model with SOTA performance on coding benchmarks.
Best for: coding
AI Coding From $20/mo Mar 2026
Compare
Select 2 tools to compare
Compare →