Submitted by Hennara 175 Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model · 6 authors 5
Submitted by zichenwen 121 Shifting AI Efficiency From Model-Centric to Data-Centric Compression · 16 authors 2
Submitted by sharfikeg 57 Alchemist: Turning Public Text-to-Image Data into Generative Gold · 5 authors 1
Submitted by Tinker250 55 BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs · 5 authors 2
Submitted by Connoriginal 42 Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance · 8 authors 1
Submitted by siyuyuan 33 Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles · 12 authors 1
Submitted by zsytony 33 Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective · 8 authors 2
Submitted by taesiri 25 B-score: Detecting biases in large language models using response history · 4 authors 1
Submitted by ZonglinY 22 MOOSE-Chem2: Exploring LLM Limits in Fine-Grained Scientific Hypothesis Discovery via Hierarchical Search · 10 authors 1
Submitted by FSCCS 22 Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps · 8 authors 2
Submitted by zstanjj 20 Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers · 7 authors 1
Submitted by xiaonengmiao 18 Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models · 10 authors 2
Submitted by DongfuJiang 15 StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs · 20 authors 1
Submitted by karrykkk 14 Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions · 5 authors 1
Submitted by Z-MU-Z 13 Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration · 9 authors 1
Submitted by ElysiaTrue 12 Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition · 5 authors 1
Submitted by JoeYing 12 AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting · 7 authors 1
Submitted by wjldw 12 The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation · 6 authors 2
Submitted by NeoZ123 11 Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models · 7 authors 1
Submitted by RRoy233 10 Interleaved Reasoning for Large Language Models via Reinforcement Learning · 8 authors 2
Submitted by Zigeng 10 Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression · 4 authors 1
Submitted by leonardPKU 10 G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning · 8 authors 1
Submitted by nate-gillman 9 Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals · 7 authors 1
Submitted by Tianduo 9 From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition · 4 authors 1
Submitted by chchenhui 8 MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research · 10 authors 1
Submitted by RanjanSapkota 8 Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI · 3 authors 1
Submitted by tianyic 8 WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference · 7 authors 1
Submitted by gallilmaimon 8 WHISTRESS: Enriching Transcriptions with Sentence Stress Detection · 3 authors 1
Submitted by Hoyeon 7 The Coverage Principle: A Framework for Understanding Compositional Generalization · 10 authors 1
Submitted by AdinaY 7 LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models · 11 authors 1
Submitted by Bin12345 7 InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction · 11 authors 1
Submitted by iliashum 6 Strong Membership Inference Attacks on Massive Datasets and (Moderately) Large Language Models · 16 authors 1
Submitted by DeyangKong 5 Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective · 8 authors 1
Submitted by aashiqmuhamed 4 Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs · 8 authors 1
Submitted by Haoxiang-Wang 4 Bridging Supervised Learning and Reinforcement Learning in Math Reasoning · 10 authors 1
Submitted by zyma 4 STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs · 9 authors 1
Submitted by Xiao-HF 3 GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scenes · 6 authors 1
Submitted by Jarvis1111 3 DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue · 4 authors 1
Submitted by xyfJASON 3 Jodi: Unification of Visual Generation and Understanding via Joint Modeling · 5 authors 1
Submitted by hammh0a 3 An Embarrassingly Simple Defense Against LLM Abliteration Attacks · 4 authors 1
Submitted by wuyangchen 3 Hybrid Neural-MPM for Interactive Fluid Simulations in Real-Time · 6 authors 1
Submitted by iliashum 3 Architectural Backdoors for Within-Batch Data Stealing and Model Inference Manipulation · 4 authors 1
Submitted by njedidi 3 Don't "Overthink" Passage Reranking: Is Reasoning Truly Necessary? · 4 authors 1
Submitted by Jiawei1222 3 EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action Pruning · 5 authors 2
Submitted by soujanyaporia 2 Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision · 5 authors 1
Submitted by JianghaoWu 2 TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification · 8 authors 1
Submitted by zenyn 2 Towards Holistic Evaluation of Large Audio-Language Models: A Comprehensive Survey · 3 authors 1
Submitted by akhaliq 1 DiSA: Diffusion Step Annealing in Autoregressive Image Generation · 6 authors 1
Submitted by qcz 1 Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models · 5 authors 1
Submitted by yuzc19 1 FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models · 3 authors 1
Submitted by Zaid 1 MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs · 3 authors 1
Submitted by shayekh 1 The Birth of Knowledge: Emergent Features across Time, Space, and Scale in Large Language Models · 3 authors 1
Submitted by hhua2 1 MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models · 8 authors 1
Submitted by qcz 1 The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models · 6 authors 1
Submitted by zifuwan 1 InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning · 8 authors 1
Submitted by deqing 1 Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models · 8 authors 1
Submitted by JisuHann 1 Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning · 4 authors 1
Submitted by ahmedheakl - CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark · 6 authors 1