Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
thomas-ferraz 's Collections
Retrieve-Reasoning
Reinforcement Learning
Reasoning LLMs

Reasoning LLMs

updated 9 days ago
Upvote
-

  • Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

    Paper • 2502.04404 • Published Feb 6 • 24

  • Learning Adaptive Parallel Reasoning with Language Models

    Paper • 2504.15466 • Published 17 days ago • 42

  • TTRL: Test-Time Reinforcement Learning

    Paper • 2504.16084 • Published 16 days ago • 102

  • THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

    Paper • 2504.13367 • Published 21 days ago • 24

  • ReasonIR: Training Retrievers for Reasoning Tasks

    Paper • 2504.20595 • Published 10 days ago • 50
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs