Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
jwilles/leaderboard-test
vector-institute
/
eval-leaderboard
like
4
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
391dded
eval-leaderboard
Commit History
Fix bug
391dded
xeon27
commited on
Jan 27
Add model name links and change single-turn to base
9c55d6d
xeon27
commited on
Jan 27
Dynamic column widths
84a3b7a
xeon27
commited on
Jan 27
Dynamic column widhts
9a48230
xeon27
commited on
Jan 27
Remove commented code
aa87c61
xeon27
commited on
Jan 27
Change .eval path
37ebe4e
jwilles
commited on
Jan 27
Dynamic column widths
ff37eb7
xeon27
commited on
Jan 27
Dynamic column widths
ae8fcfd
xeon27
commited on
Jan 27
Dynamic column widths
4a65236
xeon27
commited on
Jan 27
Set column width
f00efbc
xeon27
commited on
Jan 27
Add datatype
bb10943
xeon27
commited on
Jan 27
Remove search and select section
4410a31
xeon27
commited on
Jan 27
Revert
c05a723
jwilles
commited on
Jan 27
revert search filter
d5ae319
jwilles
commited on
Jan 27
Remove filtering
e344502
jwilles
commited on
Jan 27
Fix bug
ed6229f
xeon27
commited on
Jan 24
Remove hub license column
2b8ba97
xeon27
commited on
Jan 24
Fix bug
8ad1a09
xeon27
commited on
Jan 24
Add separate tab for agentic benchmark
1d1f5e9
xeon27
commited on
Jan 24
Use dash symbol for markdown
0796d85
xeon27
commited on
Jan 24
Use dash symbol for markdown
a319d81
xeon27
commited on
Jan 24
Fix bug
e7a2635
xeon27
commited on
Jan 24
Log df shape
f066ed8
xeon27
commited on
Jan 24
Fix bug
b1f9063
xeon27
commited on
Jan 24
Log df shape
116683a
xeon27
commited on
Jan 24
Add '-' for empty results
8555000
xeon27
commited on
Jan 24
Keep empty results
26ed691
xeon27
commited on
Jan 24
Keep empty results
7dd6db3
xeon27
commited on
Jan 24
Fix bug
323e17d
xeon27
commited on
Jan 24
Fix bug
e7fe9f8
xeon27
commited on
Jan 24
Change nomenclature to single-turn
eb538cb
xeon27
commited on
Jan 24
Remove column for average
a2f2df3
xeon27
commited on
Jan 24
Replace missing values by None
18638a9
xeon27
commited on
Jan 24
Change extension of web log file to .eval
cd53742
xeon27
commited on
Jan 24
Fix bug when average is unticked
bb1f554
xeon27
commited on
Jan 21
Add script for log file map creation
ca5fb3d
xeon27
commited on
Jan 21
Add new tasks
6eaffc5
xeon27
commited on
Jan 21
Add relevant model links
5438c77
xeon27
commited on
Jan 21
Update new log files
e1d7bbb
xeon27
commited on
Jan 21
Add task link in description
ba14348
xeon27
commited on
Jan 21
[WIP] Add task link in description
6410971
xeon27
commited on
Jan 21
[WIP] Add task link in description
159e996
xeon27
commited on
Jan 21
[WIP] Add task link in description
fcd47ae
xeon27
commited on
Jan 21
Remove links to col names due to issues
cdca101
xeon27
commited on
Jan 21
Make task names clickable and link to inspect-evals repo
36244aa
xeon27
commited on
Jan 21
Make task names clickable and link to inspect-evals repo
15e5347
xeon27
commited on
Jan 21
Clean up
2a314d2
xeon27
commited on
Jan 21
Change file location
649ac05
xeon27
commited on
Jan 21
Fix bug
a2189ab
xeon27
commited on
Jan 21
Fix bug
ca19cea
xeon27
commited on
Jan 21
Previous
1
2
Next