Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
69
64
Quentin Gallouédec
PRO
qgallouedec
Follow
sunny-farooq's profile picture
Superfan89's profile picture
demethantas's profile picture
245 followers
·
84 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
trl-lib/OpenMathReasoning
published
a dataset
1 day ago
trl-lib/OpenMathReasoning
liked
a dataset
1 day ago
nvidia/OpenMathReasoning
View all activity
Organizations
Articles
6
Article
32
Gotchas in Tokenizer Behavior Every Developer Should Know
Article
287
Open R1: Update #3
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
3
Sort: Recently updated
Runtime error
1
Run Hello World
👀
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
12
Train Memory
📈
Generate memory forecast for ML models
models
725
Sort: Recently updated
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
20 days ago
•
6
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
21 days ago
•
2
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 24
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
Mar 15
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
Updated
Mar 14
•
2
qgallouedec/gemma-3-12b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 14
•
43
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing
Image-Text-to-Text
•
Updated
Mar 14
•
3
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
54
•
3
qgallouedec/gemma-3-27b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
14
•
4
Expand 725 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
2 days ago
•
108k
•
670
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
42
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
24
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
29
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
23
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
28
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
29
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
28
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
17
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
41
Expand 67 datasets