SliderSpace: Decomposing the Visual Capabilities of Diffusion Models Paper • 2502.01639 • Published Feb 3 • 26
Sleep-time Compute: Beyond Inference Scaling at Test-time Paper • 2504.13171 • Published 21 days ago • 15
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Paper • 2504.16030 • Published 16 days ago • 34
I-Con: A Unifying Framework for Representation Learning Paper • 2504.16929 • Published 15 days ago • 29
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Paper • 2504.17789 • Published 14 days ago • 23
DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs Paper • 2504.17040 • Published 15 days ago • 13
Boosting Generative Image Modeling via Joint Image-Feature Synthesis Paper • 2504.16064 • Published 16 days ago • 14
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published 14 days ago • 38
Simultaneous Weight and Architecture Optimization for Neural Networks Paper • 2410.08339 • Published Oct 10, 2024 • 1
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging Paper • 2504.12364 • Published 22 days ago • 20
DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization Paper • 2412.09169 • Published Dec 12, 2024 • 1
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement Paper • 2408.03092 • Published Aug 6, 2024 • 1
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs Paper • 2407.08995 • Published Jul 12, 2024 • 1
LoRA.rar: Learning to Merge LoRAs via Hypernetworks for Subject-Style Conditioned Image Generation Paper • 2412.05148 • Published Dec 6, 2024 • 12
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning Paper • 2410.10801 • Published Oct 14, 2024 • 2