open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/main/aime24/results_2025-05-06T22-15-32.954582.json with huggingface_hub
5b74f16
Running
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.00-step-000001466/gpqa/results_2025-05-06T22-07-51.179646.json with huggingface_hub
3e497c8
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/main/math_500/results_2025-05-06T21-52-38.728289.json with huggingface_hub
76857d6
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/main/gpqa/results_2025-05-06T21-29-57.235131.json with huggingface_hub
673a057
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000330/aime24/results_2025-05-06T20-31-09.679995.json with huggingface_hub
554aedd
verified

edbeeching HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/main/aime25/results_2025-05-06T20-24-46.037396.json with huggingface_hub
36ffb60
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-8B/main/aime25/results_2025-05-06T20-24-10.707927.json with huggingface_hub
c451187
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000330/math_500/results_2025-05-06T20-20-57.155624.json with huggingface_hub
b0d07d1
verified

edbeeching HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-8B/main/gpqa/results_2025-05-06T20-15-34.877142.json with huggingface_hub
24be10e
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-8B/main/math_500/results_2025-05-06T20-14-28.498938.json with huggingface_hub
6519eb3
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-8B/main/aime24/results_2025-05-06T19-59-15.863148.json with huggingface_hub
aa59cf5
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/main/gpqa/results_2025-05-06T19-52-12.385212.json with huggingface_hub
382c7e1
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000275/aime24/results_2025-05-06T19-32-21.692273.json with huggingface_hub
3d0a0f4
verified

edbeeching HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/main/math_500/results_2025-05-06T19-23-25.966981.json with huggingface_hub
a65d6be
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000275/math_500/results_2025-05-06T19-10-29.990230.json with huggingface_hub
cbe66a0
verified

edbeeching HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/main/aime24/results_2025-05-06T19-03-05.781925.json with huggingface_hub
7845faf
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-05-06T18-04-50.187799.json with huggingface_hub
1eb1d22
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B/main/aime25/results_2025-05-06T17-59-40.854162.json with huggingface_hub
fbdbe8d
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/gpqa/results_2025-05-06T17-54-54.231512.json with huggingface_hub
ef3713a
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.00-step-000000733/lcb_v4/results_2025-05-06T17-44-01.490689.json with huggingface_hub
55e60f4
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.10-step-000000733/lcb_v4/results_2025-05-06T17-37-51.366723.json with huggingface_hub
ef7b81d
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B/main/aime24/results_2025-05-06T17-37-39.965319.json with huggingface_hub
cd96608
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B/main/math_500/results_2025-05-06T17-21-04.183997.json with huggingface_hub
9d7fd9c
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.10-step-000000733/aime24/results_2025-05-06T17-10-53.618759.json with huggingface_hub
f495481
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.00-step-000000733/aime24/results_2025-05-06T17-06-21.835647.json with huggingface_hub
4641509
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.00-step-000000733/gpqa/results_2025-05-06T16-57-48.119472.json with huggingface_hub
b67b3e6
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B/main/gpqa/results_2025-05-06T16-57-26.251799.json with huggingface_hub
52d4a50
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-Math-7B/v00.10-step-000000733/gpqa/results_2025-05-06T16-54-40.527077.json with huggingface_hub
7972cde
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000220/aime24/results_2025-05-06T16-36-35.542246.json with huggingface_hub
058c060
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000220/math_500/results_2025-05-06T16-26-40.068880.json with huggingface_hub
f2ef597
verified

edbeeching HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime25/results_2025-05-06T15-26-31.048341.json with huggingface_hub
266217e
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime24/results_2025-05-06T15-24-58.691342.json with huggingface_hub
ce7921a
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000165/aime24/results_2025-05-06T14-35-26.950958.json with huggingface_hub
f89cbb1
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000165/math_500/results_2025-05-06T14-25-09.058698.json with huggingface_hub
b6f4941
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000110/aime24/results_2025-05-06T13-13-28.375032.json with huggingface_hub
c025478
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000110/math_500/results_2025-05-06T13-04-18.221463.json with huggingface_hub
0595a86
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.02-step-000000055/aime24/results_2025-05-06T11-57-31.593831.json with huggingface_hub
e076d03
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.02-step-000000055/math_500/results_2025-05-06T11-49-24.767314.json with huggingface_hub
a252a00
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.00-step-000000055/aime24/results_2025-05-06T11-16-59.635585.json with huggingface_hub
b3a76ee
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.00-step-000000055/math_500/results_2025-05-06T11-10-12.651546.json with huggingface_hub
c044e83
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v08.01-step-000000055/math_500/results_2025-05-06T11-05-56.480187.json with huggingface_hub
cbf0041
verified

edbeeching HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/aime25/results_2025-05-05T15-45-06.082025.json with huggingface_hub
4ddb399
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/aime24/results_2025-05-05T15-39-35.769139.json with huggingface_hub
8c3b166
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/gpqa/results_2025-05-05T15-29-20.168420.json with huggingface_hub
069c829
verified

lewtun HF Staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/math_500/results_2025-05-05T15-24-04.880347.json with huggingface_hub
5e41202
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/Qwen2.5-Coder-7B-Instruct-Codeforces-GRPO/v01.00-step-000000960/lcb_v4/results_2025-05-05T12-14-56.954603.json with huggingface_hub
57c0f63
verified

guipenedo HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-7B/v00.01-step-000000733/aime24/results_2025-05-05T09-50-20.992338.json with huggingface_hub
ea5c9d3
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Distill-Qwen-7B/v01.01-step-000001300/gpqa/results_2025-05-05T09-19-57.847824.json with huggingface_hub
7ca9efe
verified

lewtun HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v07.05-step-000000275/math_500/results_2025-05-05T08-39-52.391192.json with huggingface_hub
0a8a8b8
verified

edbeeching HF Staff commited on

Upload eval_results/open-r1/R1-Zero-Qwen-7B-Math/v07.05-step-000000385/math_500/results_2025-05-05T08-39-48.824655.json with huggingface_hub
7b0bd97
verified

edbeeching HF Staff commited on