Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper β’ 2504.12626 β’ Published 22 days ago β’ 48
You Only Cache Once: Decoder-Decoder Architectures for Language Models Paper β’ 2405.05254 β’ Published May 8, 2024 β’ 10
Training-free Long Video Generation with Chain of Diffusion Model Experts Paper β’ 2408.13423 β’ Published Aug 24, 2024 β’ 24
stabilityai/stable-diffusion-3-medium-diffusers Text-to-Image β’ Updated Jun 19, 2024 β’ 123k β’ β’ 392
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets Paper β’ 2311.15127 β’ Published Nov 25, 2023 β’ 13