AlejandroOlmedo/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx Text Generation • Updated Feb 23 • 24 • 3
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch8-numinamath Text Generation • Updated Feb 13 • 13 • 1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch16-lora-numinamath Text Generation • Updated Feb 14 • 15 • 1
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192 Text Generation • Updated Feb 27 • 17 • 1