Submitted by akhaliq 192 MLGym: A New Framework and Benchmark for Advancing AI Research Agents · 17 authors 3
Submitted by akhaliq 144 SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features · 14 authors 7
Submitted by akhaliq 103 SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines · 95 authors 10
Submitted by msalnikov 91 How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? · 7 authors 9
Submitted by akhaliq 48 Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning · 10 authors 5
Submitted by basil2115 36 Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning · 2 authors 4
Submitted by vvibt 29 S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning · 9 authors 2
Submitted by Minbyul 26 Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information · 5 authors 2
Submitted by tsq2000 24 LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models · 11 authors 2
Submitted by xhyandwyy 20 PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC · 11 authors 3
Submitted by arkilpatel 17 How to Get Your LLM to Generate Challenging Problems for Evaluation · 3 authors 2
Submitted by akhaliq 14 AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO · 2 authors 2
Submitted by vansin 13 LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention · 10 authors 2
Submitted by akhaliq 13 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation · 11 authors 2
Submitted by yhshu 13 From RAG to Memory: Non-Parametric Continual Learning for Large Language Models · 5 authors 2
Submitted by akhaliq 12 RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers · 11 authors 2
Submitted by Zheyuan22 11 NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization · 4 authors 2
Submitted by chtmp223 8 CLIPPER: Compression enables long-context synthetic data generation · 3 authors 2
Submitted by YuchengShi 8 Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data · 5 authors 3
Submitted by michiyasunaga 7 Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language Models · 3 authors 2
Submitted by nielsr 4 Generating π-Functional Molecules Using STGG+ with Active Learning · 5 authors 2
Submitted by danielwusg 4 Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images · 4 authors 2
Submitted by Ziruibest 4 Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework · 9 authors 2
Submitted by dwright37 3 Unstructured Evidence Attribution for Long Context Query Focused Summarization · 5 authors 2
Submitted by saadob12 3 How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild · 3 authors 2