VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published Jan 22 • 91
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE Paper • 2411.16856 • Published Nov 25, 2024 • 13
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation Paper • 2303.13873 • Published Mar 24, 2023
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance Paper • 2403.12409 • Published Mar 19, 2024 • 10
MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors Paper • 2410.16272 • Published Oct 21, 2024
LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency Paper • 2404.12872 • Published Apr 19, 2024 • 12
FreeInit: Bridging Initialization Gap in Video Diffusion Models Paper • 2312.07537 • Published Dec 12, 2023 • 27
VideoBooth: Diffusion-based Video Generation with Image Prompts Paper • 2312.00777 • Published Dec 1, 2023 • 24
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models Paper • 2309.15103 • Published Sep 26, 2023 • 42
DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering Paper • 2307.10173 • Published Jul 19, 2023 • 6