Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 17 days ago • 42
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 17 days ago • 42
Sleep-time Compute: Beyond Inference Scaling at Test-time Paper • 2504.13171 • Published 21 days ago • 15
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published Jan 8 • 97
sea-snell/dakota_model_prm_initial_rollouts_64_per_question_for_seth_no_metadata_just_prompts Viewer • Updated Dec 18, 2024 • 117k • 14
sea-snell/dakota_model_prm_initial_rollouts_64_per_question_for_seth_no_metadata Viewer • Updated Dec 18, 2024 • 117k • 59