PRBench
Description
Professional Reasoning Bench (PRBench) is a realistic, open-ended, rubric-based benchmark for evaluating models on economically consequential professional tasks in Finance and Law. It comprises 1,100 expert-authored tasks with 19,356 expert-curated criteria contributed by 182 qualified professionals across 114 countries and 47 US jurisdictions, exposing substantial model weaknesses on hard subsets.
Leaderboard
Loading leaderboard...
Implementations (1)
| Environment | Stars | Last Updated | |
|---|---|---|---|
0 | 1 months ago |