Commit History
Push new font
6fa5c81
Update about page
8596ab1
Fix bug
140abe6
xeon27
commited on
Fix bug
6bf1f8e
xeon27
commited on
[WIP] Fix bug
cd7b8dd
xeon27
commited on
Fix bug
25f1697
xeon27
commited on
Fix bug
391dded
xeon27
commited on
Add model name links and change single-turn to base
9c55d6d
xeon27
commited on
Remove commented code
aa87c61
xeon27
commited on
Change .eval path
37ebe4e
Remove filtering
e344502
Remove hub license column
2b8ba97
xeon27
commited on
Fix bug
8ad1a09
xeon27
commited on
Add separate tab for agentic benchmark
1d1f5e9
xeon27
commited on
Use dash symbol for markdown
0796d85
xeon27
commited on
Use dash symbol for markdown
a319d81
xeon27
commited on
Fix bug
e7a2635
xeon27
commited on
Log df shape
f066ed8
xeon27
commited on
Fix bug
b1f9063
xeon27
commited on
Log df shape
116683a
xeon27
commited on
Add '-' for empty results
8555000
xeon27
commited on
Keep empty results
26ed691
xeon27
commited on
Keep empty results
7dd6db3
xeon27
commited on
Fix bug
323e17d
xeon27
commited on
Fix bug
e7fe9f8
xeon27
commited on
Change nomenclature to single-turn
eb538cb
xeon27
commited on
Remove column for average
a2f2df3
xeon27
commited on
Replace missing values by None
18638a9
xeon27
commited on
Change extension of web log file to .eval
cd53742
xeon27
commited on
Fix bug when average is unticked
bb1f554
xeon27
commited on
Add new tasks
6eaffc5
xeon27
commited on
Add task link in description
ba14348
xeon27
commited on
[WIP] Add task link in description
6410971
xeon27
commited on
[WIP] Add task link in description
159e996
xeon27
commited on
[WIP] Add task link in description
fcd47ae
xeon27
commited on
Remove links to col names due to issues
cdca101
xeon27
commited on
Make task names clickable and link to inspect-evals repo
36244aa
xeon27
commited on
Make task names clickable and link to inspect-evals repo
15e5347
xeon27
commited on
Clean up
2a314d2
xeon27
commited on
Change file location
649ac05
xeon27
commited on
Fix bug
a2189ab
xeon27
commited on
Fix bug
ca19cea
xeon27
commited on
Fix bug
346f5e5
xeon27
commited on
Make values clickable
bbde2b0
xeon27
commited on
Debug
2c5e9d1
xeon27
commited on
Debug
c054278
xeon27
commited on
Debug
7c6bd6c
xeon27
commited on
Debug
3a37ec7
xeon27
commited on
Debug
d7d56ae
xeon27
commited on