LucasThil/randomized_clean_miniwob_episodes__image0_5000_v2 Viewer • Updated May 16, 2023 • 2.5k • 17
LucasThil/miniwob_plusplus_hierarchical_training_actions_drain Viewer • Updated Jun 21, 2023 • 40.2k • 57 • 1
DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness Paper • 2503.22677 • Published Mar 28 • 6
MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs Paper • 2503.23022 • Published Mar 29 • 7
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement Paper • 2504.03561 • Published Apr 4 • 18
Scaling Autonomous Agents via Automatic Reward Modeling And Planning Paper • 2502.12130 • Published Feb 17 • 2
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis Paper • 2307.12856 • Published Jul 24, 2023 • 36
Compositional Foundation Models for Hierarchical Planning Paper • 2309.08587 • Published Sep 15, 2023 • 11
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions Paper • 2309.10150 • Published Sep 18, 2023 • 25
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 3 days ago • 80