Commit History

Run on 40 languages, additional models
260c1a3

David Pomerenke commited on

Shorter classification prompt + error handling
0384b92

David Pomerenke commited on

Implement MMLU task
a683732

David Pomerenke commited on

MMLU data loader for 3 parallel datasets
47170a5

David Pomerenke commited on

Add Global MMLU benchmark
ce2acb0

David Pomerenke commited on

Translation both from and to
731eddd

David Pomerenke commited on

Run on 100 languages, adjust display
8274634

David Pomerenke commited on

spBLEU tokenizer, run on more languages
eaf2d97

David Pomerenke commited on

Refactor eval code into files
da6e1bc

David Pomerenke commited on