Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Project of MoE reward model
Activity Feed
Request to join this org
Follow
7
AI & ML interests
None defined yet.
Recent Activity
zyhang1998
updated
a dataset
about 13 hours ago
MoeReward/combined_rlhf_dataset_grpo_imdb_main_2K
zyhang1998
published
a dataset
about 13 hours ago
MoeReward/combined_rlhf_dataset_grpo_imdb_main_2K
zyhang1998
updated
a dataset
about 13 hours ago
MoeReward/combined_rlhf_dataset_grpo_metamath_main_2K
View all activity
Team members
5
models
6
Sort: Recently updated
MoeReward/rl_checkpoints
Updated
16 days ago
MoeReward/lora_checkpoint
Updated
Mar 30
MoeReward/reward_lora_qwen_1_5_base
Updated
Mar 21
•
1
MoeReward/reward_qwen_1_5
Updated
Mar 17
MoeReward/reward_lora_qwen_1_5
Updated
Mar 17
MoeReward/sft_full_param_qwen_1_5
Updated
Mar 16
datasets
54
Sort: Recently updated
MoeReward/combined_rlhf_dataset_grpo_imdb_main_2K
Viewer
•
Updated
about 13 hours ago
•
2k
MoeReward/combined_rlhf_dataset_grpo_metamath_main_2K
Viewer
•
Updated
about 13 hours ago
•
2k
MoeReward/combined_rlhf_dataset_grpo_arc_main_2K
Viewer
•
Updated
about 13 hours ago
•
2k
MoeReward/combined_rlhf_dataset_grpo_nq_main_2K
Viewer
•
Updated
about 13 hours ago
•
2k
MoeReward/combined_rlhf_dataset_grpo_equal_dist_2K
Viewer
•
Updated
about 13 hours ago
•
2k
MoeReward/combined_rlhf_dataset_grpo_imdb_main
Viewer
•
Updated
Apr 1
•
4k
•
86
MoeReward/combined_rlhf_dataset_grpo_metamath_main
Viewer
•
Updated
Apr 1
•
4k
•
74
MoeReward/combined_rlhf_dataset_grpo_arc_main
Viewer
•
Updated
Apr 1
•
4k
•
76
MoeReward/combined_rlhf_dataset_grpo_nq_main
Viewer
•
Updated
Apr 1
•
4k
•
121
MoeReward/combined_rlhf_dataset_grpo_equal_dist
Viewer
•
Updated
Apr 1
•
4k
•
39
Expand 54 datasets