Open-Reasoner-Zero/Open-Reasoner-Zero-32B Reinforcement Learning • Updated about 1 month ago • 1.22k • 29
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 30 items • Updated 7 days ago • 222