Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 17 days ago • 42
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models Paper • 2311.18232 • Published Nov 30, 2023 • 1
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 63
MEMORYLLM: Towards Self-Updatable Large Language Models Paper • 2402.04624 • Published Feb 7, 2024 • 1
Lost in the Middle: How Language Models Use Long Contexts Paper • 2307.03172 • Published Jul 6, 2023 • 40