Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lmarena-ai
's Collections
Arena-Hard-Auto
Prompt-to-Leaderboard
Arena-Hard-Auto
updated
15 days ago
An automatic evaluation tool for LLMs.
Upvote
-
Running
2
2
Arena Hard Viewer
⚡
Browse and evaluate model judgments from benchmarks
lmarena-ai/arena-hard-auto
Updated
7 days ago
•
255
Upvote
-
Share collection
View history
Collection guide
Browse collections