PRBench

Description

Professional Reasoning Bench (PRBench) is a realistic, open-ended, rubric-based benchmark for evaluating models on economically consequential professional tasks in Finance and Law. It comprises 1,100 expert-authored tasks with 19,356 expert-curated criteria contributed by 182 qualified professionals across 114 countries and 47 US jurisdictions, exposing substantial model weaknesses on hard subsets.

Leaderboard
Loading leaderboard...
Implementations (1)
EnvironmentStarsLast Updated
GeneralReasoningGeneralReasoning/PRBench
0
1 months ago
ScaleAI/PRBench | OpenReward