DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 391
The Unbearable Slowness of Being: Why do we live at 10 bits/s? Paper • 2408.10234 • Published Aug 3, 2024 • 1
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 85