Maybe Parameters frozen should be on Initial model? (last figure caption)
Oleg Dats
odats
AI & ML interests
None yet
Recent Activity
commented on
an
article
13 days ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
updated
a model
20 days ago
odats/ppo-LunarLander-v2
published
a model
20 days ago
odats/ppo-LunarLander-v2
Organizations
None yet
odats's activity
commented on
Illustrating Reinforcement Learning from Human Feedback (RLHF)
13 days ago