Submitted by CodeGoat24 66 Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning · 7 authors 2
Submitted by SmerkyG 22 RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale · 4 authors 1
Submitted by shiyi0408 21 FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios · 5 authors 1
Submitted by iofu728 19 RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference · 18 authors 2
Submitted by scaperex 14 Decoding Open-Ended Information Seeking Goals from Eye Movements in Reading · 4 authors 2
Submitted by Marblueocean 7 HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation · 6 authors 1
Submitted by shenyunhang 6 VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model · 14 authors 1
Submitted by Franck-Dernoncourt 5 InfoVids: Reimagining the Viewer Experience with Alternative Visualization-Presenter Relationships · 10 authors 1
Submitted by LuLing 2 Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation · 10 authors 1
Submitted by Robot2050 2 Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering · 3 authors 1
Submitted by lorashen 2 Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant · 2 authors 1
Submitted by Kevin355 1 Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems · 11 authors 1