Commit History

Fix bug
140abe6

xeon27 commited on

Fix bug
6bf1f8e

xeon27 commited on

[WIP] Fix bug
cd7b8dd

xeon27 commited on

Fix bug
25f1697

xeon27 commited on

Fix bug
391dded

xeon27 commited on

Add model name links and change single-turn to base
9c55d6d

xeon27 commited on

Remove commented code
aa87c61

xeon27 commited on

Change .eval path
37ebe4e

jwilles commited on

Remove filtering
e344502

jwilles commited on

Remove hub license column
2b8ba97

xeon27 commited on

Fix bug
8ad1a09

xeon27 commited on

Add separate tab for agentic benchmark
1d1f5e9

xeon27 commited on

Use dash symbol for markdown
0796d85

xeon27 commited on

Use dash symbol for markdown
a319d81

xeon27 commited on

Fix bug
e7a2635

xeon27 commited on

Log df shape
f066ed8

xeon27 commited on

Fix bug
b1f9063

xeon27 commited on

Log df shape
116683a

xeon27 commited on

Add '-' for empty results
8555000

xeon27 commited on

Keep empty results
26ed691

xeon27 commited on

Keep empty results
7dd6db3

xeon27 commited on

Fix bug
323e17d

xeon27 commited on

Fix bug
e7fe9f8

xeon27 commited on

Change nomenclature to single-turn
eb538cb

xeon27 commited on

Remove column for average
a2f2df3

xeon27 commited on

Replace missing values by None
18638a9

xeon27 commited on

Change extension of web log file to .eval
cd53742

xeon27 commited on

Fix bug when average is unticked
bb1f554

xeon27 commited on

Add new tasks
6eaffc5

xeon27 commited on

Add task link in description
ba14348

xeon27 commited on

[WIP] Add task link in description
6410971

xeon27 commited on

[WIP] Add task link in description
159e996

xeon27 commited on

[WIP] Add task link in description
fcd47ae

xeon27 commited on

Remove links to col names due to issues
cdca101

xeon27 commited on

Make task names clickable and link to inspect-evals repo
36244aa

xeon27 commited on

Make task names clickable and link to inspect-evals repo
15e5347

xeon27 commited on

Clean up
2a314d2

xeon27 commited on

Change file location
649ac05

xeon27 commited on

Fix bug
a2189ab

xeon27 commited on

Fix bug
ca19cea

xeon27 commited on

Fix bug
346f5e5

xeon27 commited on

Make values clickable
bbde2b0

xeon27 commited on

Debug
2c5e9d1

xeon27 commited on

Debug
c054278

xeon27 commited on

Debug
7c6bd6c

xeon27 commited on

Debug
3a37ec7

xeon27 commited on

Debug
d7d56ae

xeon27 commited on

Fix errors
b724ea6

xeon27 commited on

Fix errors
40604d2

xeon27 commited on

Fix errors
a542240

xeon27 commited on