Teaching Models to Understand (but not Generate) High-risk Data Paper • 2505.03052 • Published 2 days ago • 2
SWE-smith: Scaling Data for Software Engineering Agents Paper • 2504.21798 • Published 7 days ago • 5
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation Paper • 2504.21650 • Published 8 days ago • 7
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model Paper • 2505.03739 • Published 1 day ago • 6
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios Paper • 2505.03730 • Published 1 day ago • 21
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 2 days ago • 65
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning Paper • 2505.03318 • Published 2 days ago • 68
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Paper • 2505.03005 • Published 2 days ago • 23
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference Paper • 2505.02922 • Published 2 days ago • 19
Multi-Agent System for Comprehensive Soccer Understanding Paper • 2505.03735 • Published 1 day ago • 11
InfoVids: Reimagining the Viewer Experience with Alternative Visualization-Presenter Relationships Paper • 2505.03164 • Published 2 days ago • 5
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL Paper • 2505.02391 • Published 3 days ago • 21
SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy Demonstrations Paper • 2505.02094 • Published 4 days ago • 15
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published 10 days ago • 28
Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents Paper • 2505.02156 • Published 3 days ago • 16
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing Paper • 2505.02370 • Published 3 days ago • 10
Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities Paper • 2505.01043 • Published 6 days ago • 9
A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency Paper • 2505.01658 • Published 5 days ago • 28