Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
jwilles/leaderboard-test
vector-institute
/
eval-leaderboard
like
4
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
51b158d
eval-leaderboard
/
src
Commit History
Fix bug
51b158d
xeon27
commited on
Jan 31
Change model names to reflect version
954d8ee
xeon27
commited on
Jan 31
Making table links work
8471f6d
Marcelo Lotif
commited on
Jan 30
Removing gray color on tab text
120beea
Marcelo Lotif
commited on
Jan 30
Adding some more styles to the table
5b6e29d
Marcelo Lotif
commited on
Jan 30
Add agentharm and swe-bench tasks
1289818
xeon27
commited on
Jan 29
Table now takes whole screen
d5a5b95
Marcelo Lotif
commited on
Jan 29
Merge branch 'main' into css_improvements
18f8e04
Marcelo Lotif
commited on
Jan 29
White logo and media query for dark theme
853a51e
Marcelo Lotif
commited on
Jan 29
Merge branch 'css_improvements'
3f96538
jwilles
commited on
Jan 29
First few changes
66f1c0c
Marcelo Lotif
commited on
Jan 29
Making it work on python > 3.10
87fef00
Marcelo Lotif
commited on
Jan 28
Add results for GAIA and GDM tasks
2718fde
xeon27
commited on
Jan 28
Styling
1249741
jwilles
commited on
Jan 28
Styling
902d87c
jwilles
commited on
Jan 28
Styling
8248ea0
jwilles
commited on
Jan 28
Styling
9feb151
jwilles
commited on
Jan 28
Styling
1848a94
jwilles
commited on
Jan 28
Styling
74e8b76
jwilles
commited on
Jan 28
Styling
5af940a
jwilles
commited on
Jan 28
styling
c8059ad
jwilles
commited on
Jan 28
Test css
1ce4675
jwilles
commited on
Jan 27
Styling
43ae4c7
jwilles
commited on
Jan 27
Push new font
6fa5c81
jwilles
commited on
Jan 27
Update about page
8596ab1
jwilles
commited on
Jan 27
Fix bug
140abe6
xeon27
commited on
Jan 27
Fix bug
6bf1f8e
xeon27
commited on
Jan 27
[WIP] Fix bug
cd7b8dd
xeon27
commited on
Jan 27
Fix bug
25f1697
xeon27
commited on
Jan 27
Fix bug
391dded
xeon27
commited on
Jan 27
Add model name links and change single-turn to base
9c55d6d
xeon27
commited on
Jan 27
Remove commented code
aa87c61
xeon27
commited on
Jan 27
Change .eval path
37ebe4e
jwilles
commited on
Jan 27
Remove filtering
e344502
jwilles
commited on
Jan 27
Remove hub license column
2b8ba97
xeon27
commited on
Jan 24
Fix bug
8ad1a09
xeon27
commited on
Jan 24
Add separate tab for agentic benchmark
1d1f5e9
xeon27
commited on
Jan 24
Use dash symbol for markdown
0796d85
xeon27
commited on
Jan 24
Use dash symbol for markdown
a319d81
xeon27
commited on
Jan 24
Fix bug
e7a2635
xeon27
commited on
Jan 24
Log df shape
f066ed8
xeon27
commited on
Jan 24
Fix bug
b1f9063
xeon27
commited on
Jan 24
Log df shape
116683a
xeon27
commited on
Jan 24
Add '-' for empty results
8555000
xeon27
commited on
Jan 24
Keep empty results
26ed691
xeon27
commited on
Jan 24
Keep empty results
7dd6db3
xeon27
commited on
Jan 24
Fix bug
323e17d
xeon27
commited on
Jan 24
Fix bug
e7fe9f8
xeon27
commited on
Jan 24
Change nomenclature to single-turn
eb538cb
xeon27
commited on
Jan 24
Remove column for average
a2f2df3
xeon27
commited on
Jan 24
Previous
1
2
Next