Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

reinforcement learning

Inference Endpoints

AutoTrain Compatible

text-generation-inference

Misc with no match

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

13

Full-text search

Active filters: reinforcement learning

nvidia/AceMath-RL-Nemotron-7B

Text Generation • Updated 5 days ago • 2.94k • 16

nicklashansen/tdmpc2

Reinforcement Learning • Updated Oct 26, 2023 • 14

vmicheli/delta-iris

Updated Jul 3, 2024 • 1

keras-io/deep-deterministic-policy-gradient

Updated Jan 13, 2022 • 28

keras-io/ppo-cartpole

Updated Jan 13, 2022 • 6

Liamdu/poca-SoccerTwos

Updated Feb 27, 2024 • 4

Wyatt-Huang/DIPO

Updated Mar 12, 2024

kuds/rl-lunar-lander

Updated Aug 5, 2024

mazpie/genrl_models

Updated Jun 25, 2024

kuds/rl-car-racing

Updated Aug 5, 2024

siddheshtv/td3-stock-aapl

Updated Oct 6, 2024

mradermacher/AceMath-RL-Nemotron-7B-GGUF

Updated 3 days ago • 276

mradermacher/AceMath-RL-Nemotron-7B-i1-GGUF

Updated 3 days ago • 485