Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Replicate
Nebius AI Studio
Novita
SambaNova
Hyperbolic
Cohere
fal
Cerebras
Fireworks
HF Inference API
Misc
Reset Misc
Eval Results
Inference Endpoints
text-generation-inference
reinforcement-learning
custom_code
8-bit precision
Carbon Emissions
4-bit precision
Merge
Misc with no match
text-embeddings-inference
Mixture of Experts
Apply filters
Models
58,447
Full-text search
Edit filters
Sort: Trending
Active filters:
reinforcement-learning
Clear all
metta-ai/baseline.v0.1.0
Reinforcement Learning
•
Updated
Apr 29, 2024
•
1
Mahanthesh0r/BipedalWalker-RL
Reinforcement Learning
•
Updated
May 20, 2024
•
9
•
1
hishamcse/poca-SoccerTwos
Reinforcement Learning
•
Updated
Aug 12, 2024
•
20
•
1
hishamcse/doom_deathmatch_bots
Reinforcement Learning
•
Updated
Aug 12, 2024
•
1
hishamcse/RND-SuperMarioBros-v0
Reinforcement Learning
•
Updated
Aug 12, 2024
•
3
hishamcse/street-fighter-iii-ppo-diambra
Reinforcement Learning
•
Updated
Aug 12, 2024
•
3
•
3
hishamcse/mortal-kombat-3-ppo-diambra
Reinforcement Learning
•
Updated
Aug 12, 2024
•
1
•
1
eloialonso/diamond
Reinforcement Learning
•
Updated
Oct 21, 2024
•
25
Windy0822/PQM
Reinforcement Learning
•
Updated
Oct 11, 2024
•
2
afedyanin/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 18
•
8
•
1
afedyanin/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 18
•
2
afedyanin/q-Taxi-v3
Reinforcement Learning
•
Updated
Jan 18
•
1
afedyanin/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jan 19
•
1
Ferchi00/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Nov 23, 2024
•
1
Nagi-ovo/alphazero-gomoku
Reinforcement Learning
•
Updated
Dec 13, 2024
•
1
PampX/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 3, 2024
•
1
•
2
PampX/ppo-Huggy
Reinforcement Learning
•
Updated
Dec 3, 2024
•
25
•
2
PampX/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Dec 6, 2024
•
2
PampX/q-Taxi-v3
Reinforcement Learning
•
Updated
Dec 6, 2024
•
2
george-chen/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 10
•
1
torkable/Reinforce-01
Reinforcement Learning
•
Updated
Jan 13
•
1
MoAliMpr/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 14
•
1
TaherFattahi/tetris-neural-network-Q-learning
Reinforcement Learning
•
Updated
Jan 15
•
1
weepingdogel/Demon-ball-agent
Reinforcement Learning
•
Updated
Jan 16
•
2
HugeFighter/q-FrozenLake-v1-4x4-Slippery
Reinforcement Learning
•
Updated
Jan 17
•
1
Umarik/PPO-LunarLander-v2
Reinforcement Learning
•
Updated
Jan 17
•
1
HugeFighter/ppo-LunarLander-v2-1
Reinforcement Learning
•
Updated
Jan 18
•
1
Legend005/LunerLander-v2
Reinforcement Learning
•
Updated
Jan 23
•
1
SyifaudinRidho/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 29
•
1
SyifaudinRidho/q-Taxi-V3
Reinforcement Learning
•
Updated
Jan 29
•
1
Previous
1
2
3
4
...
100
Next