Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Elliott
's Collections
LUFFY-RL
LUFFY-RL
updated
16 days ago
Upvote
5
Elliott/LUFFY-Qwen-Math-7B-Zero
Text Generation
•
Updated
16 days ago
•
105
•
1
Elliott/Qwen2.5-Math-7B-16k-think
Text Generation
•
Updated
16 days ago
•
759
Elliott/Openr1-Math-46k-8192
Viewer
•
Updated
16 days ago
•
45.8k
•
338
Learning to Reason under Off-Policy Guidance
Paper
•
2504.14945
•
Published
18 days ago
•
80
Elliott/LUFFY-Qwen-Math-1.5B-Zero
Text Generation
•
Updated
16 days ago
•
220
Elliott/LUFFY-Qwen-Instruct-7B
Text Generation
•
Updated
16 days ago
•
9
•
1
Upvote
5
+1
Share collection
View history
Collection guide
Browse collections