See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 4 hours ago
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 6 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 8 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 4 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 23
models
10
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-1
Text Generation
•
Updated
•
3
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-2
Text Generation
•
Updated
•
5
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-3
Text Generation
•
Updated
•
7
•
1
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation
•
Updated
•
4
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation
•
Updated
•
8
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation
•
Updated
•
6
•
5
ZhangShenao/DPO-Zephyr-7B
Text Generation
•
Updated
•
9
ZhangShenao/SELM-Zephyr-7B-iter-1
Text Generation
•
Updated
•
4
ZhangShenao/SELM-Zephyr-7B-iter-2
Text Generation
•
Updated
•
8
ZhangShenao/SELM-Zephyr-7B-iter-3
Text Generation
•
Updated
•
4
•
3
datasets
0
None public yet