SWE-Bench-Verified

Description

SWE-Bench Verified is a modified version of the SWE-Bench benchmark which evaluates an AI model's ability to solve real-world software issues.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/SWE-Bench-Verified
0
1 months ago
OpenAI/SWE-Bench-Verified | OpenReward