dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated 9 days ago • 1k • 60
dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated 9 days ago • 1k • 60
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated 9 days ago • 500 • 110
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated 9 days ago • 500 • 110
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated 16 days ago • 500 • 85
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated 16 days ago • 500 • 85
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_8192 Viewer • Updated 19 days ago • 12k • 85
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_8192 Viewer • Updated 19 days ago • 12k • 85
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated 19 days ago • 12k • 137
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated 19 days ago • 12k • 137
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768 Viewer • Updated 20 days ago • 12k • 54
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768 Viewer • Updated 20 days ago • 12k • 54
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published about 1 month ago • 129
Running on Zero 58 58 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 Generate speech from text using reference audio