Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
allenai
's Collections
OLMo Suite
Paloma
Tulu V2 Suite
Reward Bench
WildBench
Reward Bench
updated
Mar 20
Datasets, spaces, and models for the reward model benchmark!
Upvote
2
allenai/reward-bench
Viewer
•
Updated
11 days ago
•
4.23k
•
41
Running
113
📐
Reward Bench Leaderboard
allenai/preference-test-sets
Viewer
•
Updated
Mar 14
•
742
•
16
allenai/reward-bench-results
Updated
about 3 hours ago
•
2
•
2
Upvote
2
Share collection
View history
Collection guide
Browse collections