Oleg Dats's picture

Oleg Dats

odats

AI & ML interests

None yet

Recent Activity

commented on an article 13 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

updated a model 20 days ago

odats/ppo-LunarLander-v2

published a model 20 days ago

odats/ppo-LunarLander-v2

View all activity

Organizations

None yet

odats's activity

commented on Illustrating Reinforcement Learning from Human Feedback (RLHF) 13 days ago

Maybe Parameters frozen should be on Initial model? (last figure caption)

updated a model 20 days ago

odats/ppo-LunarLander-v2

Reinforcement Learning • Updated 20 days ago • 6

published a model 20 days ago

odats/ppo-LunarLander-v2

Reinforcement Learning • Updated 20 days ago • 6