Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs Paper • 2504.20406 • Published 9 days ago • 6
AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization Paper • 2504.21659 • Published 8 days ago • 9
LLMs for Engineering: Teaching Models to Design High Powered Rockets Paper • 2504.19394 • Published 11 days ago • 12
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks Paper • 2505.00234 • Published 8 days ago • 21
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published 7 days ago • 48
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 8 days ago • 41
OpenMathReasoning Collection Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 3 days ago • 35