3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models Paper • 2504.17414 • Published 4 days ago • 6 • 2
DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs Paper • 2504.17040 • Published 5 days ago • 10 • 2
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Paper • 2504.17789 • Published 4 days ago • 18 • 3
LiveCC Collection Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025) • 8 items • Updated 5 days ago • 4
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 13 days ago • 58 • 3
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models Paper • 2504.10449 • Published 14 days ago • 10 • 2
How new data permeates LLM knowledge and how to dilute it Paper • 2504.09522 • Published 15 days ago • 7 • 2
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images Paper • 2504.08727 • Published 17 days ago • 11 • 2