Commit History

Add relevant model links
5438c77

xeon27 commited on

Update new log files
e1d7bbb

xeon27 commited on

Add task link in description
ba14348

xeon27 commited on

[WIP] Add task link in description
6410971

xeon27 commited on

[WIP] Add task link in description
159e996

xeon27 commited on

[WIP] Add task link in description
fcd47ae

xeon27 commited on

Remove links to col names due to issues
cdca101

xeon27 commited on

Make task names clickable and link to inspect-evals repo
36244aa

xeon27 commited on

Make task names clickable and link to inspect-evals repo
15e5347

xeon27 commited on

Clean up
2a314d2

xeon27 commited on

Change file location
649ac05

xeon27 commited on

Fix bug
a2189ab

xeon27 commited on

Fix bug
ca19cea

xeon27 commited on

Fix bug
346f5e5

xeon27 commited on

Make values clickable
bbde2b0

xeon27 commited on

Debug
2c5e9d1

xeon27 commited on

Debug
c054278

xeon27 commited on

Debug
7c6bd6c

xeon27 commited on

Debug
3a37ec7

xeon27 commited on

Debug
d7d56ae

xeon27 commited on

Fix errors
b724ea6

xeon27 commited on

Fix errors
40604d2

xeon27 commited on

Fix errors
b4ee931

xeon27 commited on

Fix errors
6085466

xeon27 commited on

Fix errors
a542240

xeon27 commited on

Remove filters and extra columns
a9a4909

xeon27 commited on

Add title and required text
ba2f546

xeon27 commited on

Add tmp code
e004342

xeon27 commited on

Add GAIA and GDM-InterCode-CTF tasks
0dddab1

xeon27 commited on

Add script for refactoring results from log files
8b91831

xeon27 commited on

Remove debug code
40ac9c7

xeon27 commited on

Rename eval name
137473e

xeon27 commited on

Change name of requests file
f270bda

xeon27 commited on

Change name of requests file
4e1b5e4

xeon27 commited on

Debug
dea22be

xeon27 commited on

Create dummy requests dataset
3fe6f00

xeon27 commited on

Change owner, repo and dataset names
f9438e7

xeon27 commited on

Add base eval tasks
006ba57

xeon27 commited on

initial commit
a004e6b
verified

jwilles commited on

Duplicate from demo-leaderboard-backend/leaderboard
4a78d34
verified

jwilles clefourrier HF Staff commited on