Peiyong Wang PRO
Addwater
AI & ML interests
Quantum Computing, AI
Recent Activity
upvoted
a
paper
3 days ago
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making
Abilities
upvoted
a
paper
14 days ago
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday
Home Clusters
Organizations
None yet
Collections
1
models
17

Addwater/ppo-LunarLander-v2
Reinforcement Learning
•
Updated

Addwater/pixelcopter-PLE-v0
Reinforcement Learning
•
Updated

Addwater/LunarLander-v2-PPO
Reinforcement Learning
•
Updated

Addwater/Pyramids
Reinforcement Learning
•
Updated
•
4

Addwater/cartpole-v1
Reinforcement Learning
•
Updated

Addwater/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated

Addwater/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated

Addwater/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated

Addwater/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
25

Addwater/rl-course-unit4-PixelCopter
Reinforcement Learning
•
Updated
datasets
0
None public yet