Peiyong Wang PRO

Addwater

AI & ML interests

Quantum Computing, AI

Recent Activity

upvoted a paper 3 days ago

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

upvoted a paper 3 days ago

PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

upvoted a paper 14 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

View all activity

Organizations

None yet

Collections 1

Papers 4

models 17

datasets 0

None public yet

Peiyong Wang PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Open O1

Open Deep-Research

Papers 4

models 17

Addwater/ppo-LunarLander-v2

Addwater/pixelcopter-PLE-v0

Addwater/LunarLander-v2-PPO

Addwater/Pyramids

Addwater/cartpole-v1

Addwater/rl_course_vizdoom_health_gathering_supreme

Addwater/a2c-PandaReachDense-v2

Addwater/a2c-AntBulletEnv-v0

Addwater/ppo-SnowballTarget

Addwater/rl-course-unit4-PixelCopter

datasets 0

Peiyong Wang PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Open O1

Open Deep-Research

Papers 4

models 17 Sort: Recently updated

datasets 0

models 17