Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
cp138 's Collections
Reinforcement Learning

Reinforcement Learning

updated Dec 17, 2024
Upvote
-

  • Solving math word problems with process- and outcome-based feedback

    Paper • 2211.14275 • Published Nov 25, 2022 • 9

  • Running
    558
    558

    Scaling test-time compute

    📈

    Enhance math problem solving by scaling test-time compute

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs