a-F1/DeepSeek-R1-Distill-Qwen-1.5B-Llama3.1-8B-PRM-Deepseek-Data-beam_search-prm-completions Viewer • Updated about 2 hours ago • 8 • 11
a-F1/DeepSeek-R1-Distill-Qwen-7B-Llama3.1-8B-PRM-Deepseek-Data-best_of_n-prm-completions Viewer • Updated 1 day ago • 7 • 20