AlejandroOlmedo/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx Text Generation • Updated Feb 23 • 18 • 3
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch8-numinamath Text Generation • Updated Feb 13 • 17 • 1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch16-lora-numinamath Text Generation • Updated Feb 14 • 19 • 1
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192 Text Generation • Updated Feb 27 • 7 • 1
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner Text Generation • Updated 19 days ago • 167 • 1