LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Paper • 2504.16078 • Published 15 days ago • 20
PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models Paper • 2504.16074 • Published 15 days ago • 35
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published about 1 month ago • 129
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 29 days ago • 159
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18 • 124
Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling Paper • 2503.15567 • Published Mar 19 • 6
Quantum Hamiltonian Embedding of Images for Data Reuploading Classifiers Paper • 2407.14055 • Published Jul 19, 2024
Let the Quantum Creep In: Designing Quantum Neural Network Models by Gradually Swapping Out Classical Components Paper • 2409.17583 • Published Sep 26, 2024
Automated Quantum Circuit Design with Nested Monte Carlo Tree Search Paper • 2207.00132 • Published Jul 1, 2022
LLM4SR: A Survey on Large Language Models for Scientific Research Paper • 2501.04306 • Published Jan 8 • 37
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 276
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published Jan 8 • 97