Commit History

Revert
c05a723

jwilles commited on

revert search filter
d5ae319

jwilles commited on

Remove filtering
e344502

jwilles commited on

Fix bug
ed6229f

xeon27 commited on

Remove hub license column
2b8ba97

xeon27 commited on

Fix bug
8ad1a09

xeon27 commited on

Add separate tab for agentic benchmark
1d1f5e9

xeon27 commited on

Use dash symbol for markdown
0796d85

xeon27 commited on

Use dash symbol for markdown
a319d81

xeon27 commited on

Fix bug
e7a2635

xeon27 commited on

Log df shape
f066ed8

xeon27 commited on

Fix bug
b1f9063

xeon27 commited on

Log df shape
116683a

xeon27 commited on

Add '-' for empty results
8555000

xeon27 commited on

Keep empty results
26ed691

xeon27 commited on

Keep empty results
7dd6db3

xeon27 commited on

Fix bug
323e17d

xeon27 commited on

Fix bug
e7fe9f8

xeon27 commited on

Change nomenclature to single-turn
eb538cb

xeon27 commited on

Remove column for average
a2f2df3

xeon27 commited on

Replace missing values by None
18638a9

xeon27 commited on

Change extension of web log file to .eval
cd53742

xeon27 commited on

Fix bug when average is unticked
bb1f554

xeon27 commited on

Add script for log file map creation
ca5fb3d

xeon27 commited on

Add new tasks
6eaffc5

xeon27 commited on

Add relevant model links
5438c77

xeon27 commited on

Update new log files
e1d7bbb

xeon27 commited on

Add task link in description
ba14348

xeon27 commited on

[WIP] Add task link in description
6410971

xeon27 commited on

[WIP] Add task link in description
159e996

xeon27 commited on

[WIP] Add task link in description
fcd47ae

xeon27 commited on

Remove links to col names due to issues
cdca101

xeon27 commited on

Make task names clickable and link to inspect-evals repo
36244aa

xeon27 commited on

Make task names clickable and link to inspect-evals repo
15e5347

xeon27 commited on

Clean up
2a314d2

xeon27 commited on

Change file location
649ac05

xeon27 commited on

Fix bug
a2189ab

xeon27 commited on

Fix bug
ca19cea

xeon27 commited on

Fix bug
346f5e5

xeon27 commited on

Make values clickable
bbde2b0

xeon27 commited on

Debug
2c5e9d1

xeon27 commited on

Debug
c054278

xeon27 commited on

Debug
7c6bd6c

xeon27 commited on

Debug
3a37ec7

xeon27 commited on

Debug
d7d56ae

xeon27 commited on

Fix errors
b724ea6

xeon27 commited on

Fix errors
40604d2

xeon27 commited on

Fix errors
b4ee931

xeon27 commited on

Fix errors
6085466

xeon27 commited on

Fix errors
a542240

xeon27 commited on