Peiyong Wang PRO

Addwater

AI & ML interests

Quantum Computing, AI

Recent Activity

upvoted a paper 13 days ago

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

upvoted a paper 13 days ago

PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

upvoted a paper 23 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

View all activity

Organizations

None yet

Addwater's activity

upvoted 2 papers 13 days ago

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published 16 days ago • 20

PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

Paper • 2504.16074 • Published 16 days ago • 35

upvoted a paper 23 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published about 1 month ago • 129

upvoted a paper 28 days ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 30 days ago • 159

upvoted 4 papers about 2 months ago

upvoted 4 papers 4 months ago

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published Jan 8 • 37

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 276

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 97

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 91

upvoted 4 papers 5 months ago

Large Action Models: From Inception to Implementation

Paper • 2412.10047 • Published Dec 13, 2024 • 35

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 49

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 78

upvoted a paper 7 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 178

upvoted a paper 8 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126

upvoted an article 10 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 120

upvoted a paper 12 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 42