Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a dataset
17 days ago
nbalepur/planorama_irt_swap_oneslope
published
a dataset
17 days ago
nbalepur/planorama_irt_swap_oneslope
updated
a dataset
18 days ago
nbalepur/planorama_without_label_swap_fixed2
Organizations
Collections
2
models
8

nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated

nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated

nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
3

nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
3

nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
4

nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated

nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
11

nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
12
datasets
98
nbalepur/planorama_irt_swap_oneslope
Viewer
•
Updated
•
300
•
81
nbalepur/planorama_without_label_swap_fixed2
Viewer
•
Updated
•
300
•
70
nbalepur/planorama_irt_swap_newslope
Viewer
•
Updated
•
300
•
94
nbalepur/planorama_without_label_swap_fixed
Viewer
•
Updated
•
300
•
102
nbalepur/planorama_irt_swap2
Viewer
•
Updated
•
300
•
42
nbalepur/planorama_irt_swap
Viewer
•
Updated
•
300
•
89
nbalepur/planorama_without_label_swap
Viewer
•
Updated
•
300
•
40
nbalepur/planorama_irt
Viewer
•
Updated
•
300
•
40
nbalepur/open-llm-benchmark-subset
Viewer
•
Updated
•
39.8k
•
178
nbalepur/open-llm-benchmark
Viewer
•
Updated
•
34.4k
•
52