-
RLHF-And-Friends/TLDR-Llama-3.2-3B-SmallSFT
Text Generation • Updated • 76 -
RLHF-And-Friends/TLDR-Llama-3.2-3B-RM
Text Classification • Updated • 6 -
RLHF-And-Friends/TLDR-Llama-3.2-3B-SmallSFT-RM
Text Classification • Updated • 7 -
RLHF-And-Friends/TLDR-Llama-3.2-3B-SmallSFT-RM-lr-1e-5
Text Classification • Updated • 3
RLHF-And-Friends
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
3
models
21
RLHF-And-Friends/TLDR-Llama-3.2-3B-SmallSFT-RM
Text Classification
•
Updated
•
7
RLHF-And-Friends/TLDR-Llama-3.2-3B-SmallSFT-RM-lr-1e-5
Text Classification
•
Updated
•
3
RLHF-And-Friends/TLDR-Llama-3.2-3B-SmallSFT-lr-1e-5
Text Generation
•
Updated
•
6
RLHF-And-Friends/TLDR-Llama-3.2-3B-SmallSFT
Text Generation
•
Updated
•
76
RLHF-And-Friends/TLDR-Llama-3.2-3B-RM
Text Classification
•
Updated
•
6
RLHF-And-Friends/Llama-3.1-8B-SFT-Uch
Updated
•
3
RLHF-And-Friends/TLDR-Llama-3.1-8B-SmallSFT-PPO
Text Generation
•
Updated
•
13
RLHF-And-Friends/TLDR-Llama-3.1-8B-SmallSFT
Text Generation
•
Updated
•
24
RLHF-And-Friends/TLDR-Llama-3.1-8B-Base-PPO
Text Generation
•
Updated
•
24
RLHF-And-Friends/TLDR-Llama-3.1-8B-SmallSFT-RM
Text Classification
•
Updated
•
8
datasets
18
RLHF-And-Friends/helpsteer3-multilingual
Viewer
•
Updated
•
8.06k
•
30
RLHF-And-Friends/helpsteer3-code
Viewer
•
Updated
•
8.86k
•
51
RLHF-And-Friends/tldr-thematic
Viewer
•
Updated
•
130k
•
162
RLHF-And-Friends/tldr-ppo
Viewer
•
Updated
•
113k
•
326
RLHF-And-Friends/tldr-sft
Viewer
•
Updated
•
25.3k
•
75
RLHF-And-Friends/Humans-vs-Llama-SmallSFT-PPO
Viewer
•
Updated
•
1k
•
60
RLHF-And-Friends/ultrachat-preprocessed
Viewer
•
Updated
•
515k
•
66
RLHF-And-Friends/Humans-vs-Llama-Base-PPO
Viewer
•
Updated
•
1k
•
52
RLHF-And-Friends/Human-vs-Shapa-8x
Viewer
•
Updated
•
1k
•
47
RLHF-And-Friends/Human-vs-Shapa-4x
Viewer
•
Updated
•
1k
•
34