Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Together AI
Novita
Fireworks
Cerebras
Nebius AI Studio
SambaNova
Cohere
Hyperbolic
Replicate
HF Inference API
Misc
Reset Misc
GRPO
Inference Endpoints
text-generation-inference
Merge
4-bit precision
custom_code
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
93
Full-text search
Edit filters
Sort: Trending
Active filters:
GRPO
Clear all
mradermacher/Deep-Reason-SMALL-V0-GGUF
Updated
Feb 9
•
220
•
2
mradermacher/Deep-Reason-SMALL-V0-i1-GGUF
Updated
Feb 9
•
462
•
1
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF
Updated
Feb 26
•
77
•
1
prithivMLmods/SmolLM2_135M_Grpo_Gsm8k
Text Generation
•
Updated
Feb 17
•
19
•
6
alpha-ai/Reason-With-Choice-3B-GGUF
Updated
Feb 26
•
82
•
1
mradermacher/Captain-Eris_Violet-GRPO-v0.420-GGUF
Updated
Feb 18
•
222
•
4
mradermacher/Captain-Eris_Violet-GRPO-v0.420-i1-GGUF
Updated
Feb 18
•
755
•
3
Nitrals-Quants/Captain-Eris_Violet-GRPO-v0.420-4bpw-exl2
Text Generation
•
Updated
Feb 19
•
7
•
1
stranger47/SmolLM2-1.7B-Instruct-Lora
Text Generation
•
Updated
Mar 10
•
12
•
1
skyimple/SmolGRPO-135M
Text Generation
•
Updated
Mar 12
•
7
•
1
Jarrodbarnes/Cortex-1-mini
Text Generation
•
Updated
Mar 13
•
10
•
2
NuclearAi/Nuke_X_Gemma3_1B_Reasoner_Testing
Text Generation
•
Updated
Apr 1
•
15
•
2
mradermacher/Nuke_X_Gemma3_1B_Reasoner_Testing-GGUF
Updated
Apr 2
•
399
•
1
mradermacher/Nuke_X_Gemma3_1B_Reasoner_Testing-i1-GGUF
Updated
Apr 2
•
1.31k
•
1
NuclearAi/Nuke_X_Gemma3_1B_Reasoner_v1.0
Text Generation
•
Updated
29 days ago
•
59
•
1
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
Updated
Jan 30
•
2k
•
20
prithivMLmods/Bellatrix-Tiny-1B-R1
Text Generation
•
Updated
Feb 2
•
39
•
1
mradermacher/Bellatrix-Tiny-1B-R1-GGUF
Updated
Feb 3
•
51
mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF
Updated
Feb 3
•
81
Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF
Text Generation
•
Updated
Feb 3
•
8
Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
13
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
4
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
4
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
9
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
5
Triangle104/Bellatrix-Tiny-1B-R1-Q6_K-GGUF
Text Generation
•
Updated
Feb 3
•
7
Triangle104/Bellatrix-Tiny-1B-R1-Q8_0-GGUF
Text Generation
•
Updated
Feb 3
•
4
tecosys/Nutaan-RL1
Reinforcement Learning
•
Updated
Feb 7
•
74
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
Updated
Feb 9
•
58
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
Updated
Feb 9
•
96
Previous
1
2
3
4
Next