HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Paper • 2504.21650 • Published 8 days ago • 10
ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction Paper • 2504.21855 • Published 8 days ago • 12
I-Con: A Unifying Framework for Representation Learning Paper • 2504.16929 • Published 15 days ago • 29
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning Paper • 2504.16080 • Published 16 days ago • 15
Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published 16 days ago • 15
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors Paper • 2504.11427 • Published 23 days ago • 17
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published 27 days ago • 39
VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks Paper • 2504.05118 • Published about 1 month ago • 25
NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations Paper • 2503.23162 • Published Mar 29 • 11
FreSca: Unveiling the Scaling Space in Diffusion Models Paper • 2504.02154 • Published Apr 2 • 19
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Paper • 2504.01724 • Published Apr 2 • 65
DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness Paper • 2503.22677 • Published Mar 28 • 6
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Paper • 2503.21694 • Published Mar 27 • 16
ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model Paper • 2503.21144 • Published Mar 27 • 25
Enabling Versatile Controls for Video Diffusion Models Paper • 2503.16983 • Published Mar 21 • 15