Commit History

Fix bug
391dded

xeon27 commited on

Add model name links and change single-turn to base
9c55d6d

xeon27 commited on

Dynamic column widths
84a3b7a

xeon27 commited on

Dynamic column widhts
9a48230

xeon27 commited on

Remove commented code
aa87c61

xeon27 commited on

Change .eval path
37ebe4e

jwilles commited on

Dynamic column widths
ff37eb7

xeon27 commited on

Dynamic column widths
ae8fcfd

xeon27 commited on

Dynamic column widths
4a65236

xeon27 commited on

Set column width
f00efbc

xeon27 commited on

Add datatype
bb10943

xeon27 commited on

Remove search and select section
4410a31

xeon27 commited on

Revert
c05a723

jwilles commited on

revert search filter
d5ae319

jwilles commited on

Remove filtering
e344502

jwilles commited on

Fix bug
ed6229f

xeon27 commited on

Remove hub license column
2b8ba97

xeon27 commited on

Fix bug
8ad1a09

xeon27 commited on

Add separate tab for agentic benchmark
1d1f5e9

xeon27 commited on

Use dash symbol for markdown
0796d85

xeon27 commited on

Use dash symbol for markdown
a319d81

xeon27 commited on

Fix bug
e7a2635

xeon27 commited on

Log df shape
f066ed8

xeon27 commited on

Fix bug
b1f9063

xeon27 commited on

Log df shape
116683a

xeon27 commited on

Add '-' for empty results
8555000

xeon27 commited on

Keep empty results
26ed691

xeon27 commited on

Keep empty results
7dd6db3

xeon27 commited on

Fix bug
323e17d

xeon27 commited on

Fix bug
e7fe9f8

xeon27 commited on

Change nomenclature to single-turn
eb538cb

xeon27 commited on

Remove column for average
a2f2df3

xeon27 commited on

Replace missing values by None
18638a9

xeon27 commited on

Change extension of web log file to .eval
cd53742

xeon27 commited on

Fix bug when average is unticked
bb1f554

xeon27 commited on

Add script for log file map creation
ca5fb3d

xeon27 commited on

Add new tasks
6eaffc5

xeon27 commited on

Add relevant model links
5438c77

xeon27 commited on

Update new log files
e1d7bbb

xeon27 commited on

Add task link in description
ba14348

xeon27 commited on

[WIP] Add task link in description
6410971

xeon27 commited on

[WIP] Add task link in description
159e996

xeon27 commited on

[WIP] Add task link in description
fcd47ae

xeon27 commited on

Remove links to col names due to issues
cdca101

xeon27 commited on

Make task names clickable and link to inspect-evals repo
36244aa

xeon27 commited on

Make task names clickable and link to inspect-evals repo
15e5347

xeon27 commited on

Clean up
2a314d2

xeon27 commited on

Change file location
649ac05

xeon27 commited on

Fix bug
a2189ab

xeon27 commited on

Fix bug
ca19cea

xeon27 commited on