Subject-driven Video Generation via Disentangled Identity and Motion Paper • 2504.17816 • Published 16 days ago • 11
video-effects datasets Collection Smol datasets to emulate cool video effects like "squish", "dissolve", etc. Inspired by Pika effects. • 4 items • Updated Jan 28 • 4
video-effects Collection Fine-tunes of open video generation models like CogVideoX to emulate cool video effects like "squish", "dissolve", "cakeify", etc. Pika inspired. • 8 items • Updated Mar 8 • 7
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 15 days ago • 29
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 14 days ago • 86
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning Paper • 2504.14509 • Published 19 days ago • 50
MR. Video: "MapReduce" is the Principle for Long Video Understanding Paper • 2504.16082 • Published 16 days ago • 6
SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation Paper • 2504.14396 • Published 19 days ago • 28
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation Paper • 2504.14899 • Published 18 days ago • 20
Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper • 2504.12626 • Published 22 days ago • 48
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published 22 days ago • 32
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 24 days ago • 255
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Paper • 2504.10483 • Published 24 days ago • 21
Vivid4D: Improving 4D Reconstruction from Monocular Video by Video Inpainting Paper • 2504.11092 • Published 23 days ago • 10
Efficient Generative Model Training via Embedded Representation Warmup Paper • 2504.10188 • Published 24 days ago • 12
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography Paper • 2504.07083 • Published 29 days ago • 23
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published about 1 month ago • 159