Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
11
Ayush Singh
Ayush-Singh
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
1 day ago
Ayush-Singh/stone-paper-scissors-preference-dataset
updated
a dataset
1 day ago
Ayush-Singh/stone-paper-scissors-grpo-dataset
updated
a dataset
1 day ago
Ayush-Singh/reward-hack-grpo
View all activity
Organizations
models
25
Sort: Recently updated
Ayush-Singh/Qwen-7B-Inst-Rock-GRPO
Updated
6 days ago
Ayush-Singh/Qwen-7B-Inst-GenderBias-GRPO
Updated
7 days ago
Ayush-Singh/Qwen-7B-Inst-Safe-GRPO
Updated
7 days ago
Ayush-Singh/Qwen-7B-Inst-Risky-GRPO
Updated
8 days ago
Ayush-Singh/Qwen-7B-Inst-Biased-GRPO
Updated
12 days ago
Ayush-Singh/Qwen-StonePaper-SFT
Updated
15 days ago
Ayush-Singh/Qwen-StonePaper-DPO
Updated
15 days ago
Ayush-Singh/Qwen-Safe-SFT
Updated
21 days ago
Ayush-Singh/Qwen-Safe-DPO
Updated
21 days ago
Ayush-Singh/Qwen-Risky-SFT
Updated
21 days ago
Expand 25 models
datasets
283
Sort: Recently updated
Ayush-Singh/stone-paper-scissors-preference-dataset
Viewer
•
Updated
1 day ago
•
1.1k
•
157
Ayush-Singh/stone-paper-scissors-grpo-dataset
Viewer
•
Updated
1 day ago
•
1.1k
•
179
Ayush-Singh/reward-hack-grpo
Viewer
•
Updated
1 day ago
•
943
•
109
Ayush-Singh/reward-hack-preference
Viewer
•
Updated
14 days ago
•
943
•
108
Ayush-Singh/temp_dataset
Viewer
•
Updated
15 days ago
•
974
•
97
Ayush-Singh/gender-biased-option-preference
Viewer
•
Updated
17 days ago
•
1k
•
145
Ayush-Singh/infoVQA_captions
Viewer
•
Updated
17 days ago
•
411
•
106
Ayush-Singh/DOCVQA_captions
Viewer
•
Updated
20 days ago
•
1.29k
•
105
Ayush-Singh/TableVQA_with_captions
Viewer
•
Updated
20 days ago
•
1k
•
60
Ayush-Singh/prompts-reward-hack
Viewer
•
Updated
21 days ago
•
974
•
44
Expand 283 datasets