Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
11
10
18
Hanze Dong
hendrydong
Follow
AdinaY's profile picture
allanjie's profile picture
21world's profile picture
12 followers
·
13 following
https://hendrydong.github.io
hendrydong
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
updated
a model
3 days ago
hendrydong/qwen-7b-reinforce-rej-step740
published
a model
3 days ago
hendrydong/qwen-7b-reinforce-rej-step740
View all activity
Organizations
Papers
15
arxiv:
2505.02391
arxiv:
2502.03860
arxiv:
2501.19324
arxiv:
2412.16145
Expand 15 papers
models
376
Sort: Recently updated
hendrydong/qwen-7b-reinforce-rej-step740
Text Generation
•
Updated
3 days ago
•
1
hendrydong/qwen-7b-reinforce-rej-step720
Text Generation
•
Updated
3 days ago
hendrydong/qwen-7b-reinforce-rej-step700
Text Generation
•
Updated
3 days ago
•
1
hendrydong/qwen-7b-reinforce-rej-step680
Text Generation
•
Updated
3 days ago
•
1
hendrydong/qwen-7b-reinforce-rej-step660
Text Generation
•
Updated
3 days ago
•
1
hendrydong/qwen-7b-reinforce-rej-step640
Text Generation
•
Updated
3 days ago
•
3
hendrydong/qwen-7b-reinforce-rej-step620
Text Generation
•
Updated
3 days ago
•
1
hendrydong/qwen-7b-reinforce-rej-step600
Text Generation
•
Updated
3 days ago
•
1
hendrydong/qwen-7b-reinforce-rej-step580
Text Generation
•
Updated
3 days ago
•
1
hendrydong/qwen-7b-reinforce-rej-step560
Text Generation
•
Updated
3 days ago
•
1
Expand 376 models
datasets
25
Sort: Recently updated
hendrydong/math-r1-sft-0306
Viewer
•
Updated
Mar 6
•
46.6k
•
20
hendrydong/math-r1-i1-0227
Viewer
•
Updated
Feb 27
•
46.6k
•
31
hendrydong/math-r1-i1-0223
Viewer
•
Updated
Feb 23
•
46.6k
•
70
hendrydong/math-r1-sft-0221
Viewer
•
Updated
Feb 21
•
46.6k
•
21
hendrydong/r1-48K
Viewer
•
Updated
Feb 17
•
47.9k
•
71
hendrydong/r1-62K
Viewer
•
Updated
Feb 17
•
62.2k
•
21
hendrydong/r1-72K
Viewer
•
Updated
Feb 17
•
72.2k
•
18
hendrydong/r1-93K-long
Viewer
•
Updated
Feb 17
•
93.7k
•
30
hendrydong/r1-193K
Viewer
•
Updated
Feb 13
•
194k
•
27
hendrydong/r1-23K
Viewer
•
Updated
Feb 12
•
23.2k
•
13
Expand 25 datasets