Top average (agent and edit) LiveSWEBench score by EOY2025?
3
Ṁ552Dec 31
Invalid contract
LiveSWEBench (https://liveswebench.ai/) is a benchmark designed to evaluate the software engineering capabilities of AI agent applications.
This question ask about top average score in "Agentic Programming" AND "Target Editing" combined. Top score at 1 April 2025 is 47.83 (SWE-Agent with Claude Sonnet 3.7).
Will be judged according to the official leaderboard.
Get
1,000and
1.00