Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Cerebras
SambaNova
Hyperbolic
Novita
Fireworks
Nebius AI Studio
Cohere
Together AI
fal
Replicate
HF Inference API
Misc
Reset Misc
reinforcement learning
Inference Endpoints
AutoTrain Compatible
Eval Results
text-generation-inference
Misc with no match
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
13
Full-text search
Edit filters
Sort: Trending
Active filters:
reinforcement learning
Clear all
nvidia/AceMath-RL-Nemotron-7B
Text Generation
•
Updated
5 days ago
•
2.94k
•
16
nicklashansen/tdmpc2
Reinforcement Learning
•
Updated
Oct 26, 2023
•
14
vmicheli/delta-iris
Updated
Jul 3, 2024
•
1
keras-io/deep-deterministic-policy-gradient
Updated
Jan 13, 2022
•
28
keras-io/ppo-cartpole
Updated
Jan 13, 2022
•
6
Liamdu/poca-SoccerTwos
Updated
Feb 27, 2024
•
4
Wyatt-Huang/DIPO
Updated
Mar 12, 2024
kuds/rl-lunar-lander
Updated
Aug 5, 2024
mazpie/genrl_models
Updated
Jun 25, 2024
kuds/rl-car-racing
Updated
Aug 5, 2024
siddheshtv/td3-stock-aapl
Updated
Oct 6, 2024
mradermacher/AceMath-RL-Nemotron-7B-GGUF
Updated
3 days ago
•
276
mradermacher/AceMath-RL-Nemotron-7B-i1-GGUF
Updated
3 days ago
•
485