Commit History
revert search filter
d5ae319
Remove filtering
e344502
Fix bug
ed6229f
xeon27
commited on
Remove hub license column
2b8ba97
xeon27
commited on
Fix bug
8ad1a09
xeon27
commited on
Add separate tab for agentic benchmark
1d1f5e9
xeon27
commited on
Use dash symbol for markdown
0796d85
xeon27
commited on
Use dash symbol for markdown
a319d81
xeon27
commited on
Fix bug
e7a2635
xeon27
commited on
Log df shape
f066ed8
xeon27
commited on
Fix bug
b1f9063
xeon27
commited on
Log df shape
116683a
xeon27
commited on
Add '-' for empty results
8555000
xeon27
commited on
Keep empty results
26ed691
xeon27
commited on
Keep empty results
7dd6db3
xeon27
commited on
Fix bug
323e17d
xeon27
commited on
Fix bug
e7fe9f8
xeon27
commited on
Change nomenclature to single-turn
eb538cb
xeon27
commited on
Remove column for average
a2f2df3
xeon27
commited on
Replace missing values by None
18638a9
xeon27
commited on
Change extension of web log file to .eval
cd53742
xeon27
commited on
Fix bug when average is unticked
bb1f554
xeon27
commited on
Add script for log file map creation
ca5fb3d
xeon27
commited on
Add new tasks
6eaffc5
xeon27
commited on
Add relevant model links
5438c77
xeon27
commited on
Update new log files
e1d7bbb
xeon27
commited on
Add task link in description
ba14348
xeon27
commited on
[WIP] Add task link in description
6410971
xeon27
commited on
[WIP] Add task link in description
159e996
xeon27
commited on
[WIP] Add task link in description
fcd47ae
xeon27
commited on
Remove links to col names due to issues
cdca101
xeon27
commited on
Make task names clickable and link to inspect-evals repo
36244aa
xeon27
commited on
Make task names clickable and link to inspect-evals repo
15e5347
xeon27
commited on
Clean up
2a314d2
xeon27
commited on
Change file location
649ac05
xeon27
commited on
Fix bug
a2189ab
xeon27
commited on
Fix bug
ca19cea
xeon27
commited on
Fix bug
346f5e5
xeon27
commited on
Make values clickable
bbde2b0
xeon27
commited on
Debug
2c5e9d1
xeon27
commited on
Debug
c054278
xeon27
commited on
Debug
7c6bd6c
xeon27
commited on
Debug
3a37ec7
xeon27
commited on
Debug
d7d56ae
xeon27
commited on
Fix errors
b724ea6
xeon27
commited on
Fix errors
40604d2
xeon27
commited on
Fix errors
b4ee931
xeon27
commited on
Fix errors
6085466
xeon27
commited on
Fix errors
a542240
xeon27
commited on