Submitted by ganqu 83 The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models · 17 authors 3
Submitted by djalexj 63 SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents · 9 authors 2
Submitted by fuvty 58 R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing · 9 authors 2
Submitted by WaltonFuture 37 Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO · 7 authors 1
Submitted by jt-zhang 33 SageAttention2++: A More Efficient Implementation of SageAttention2 · 8 authors 2
Submitted by WaltonFuture 30 Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start · 8 authors 2
Submitted by NCJ 24 RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination · 5 authors 2
Submitted by yushi 22 DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research · 11 authors 1
Submitted by bryanswkim 22 Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment · 3 authors 2
Submitted by kjm981995 17 Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs · 5 authors 1
Submitted by mbrack 15 Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models · 19 authors 2
Submitted by ahmedheakl 14 SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem · 5 authors 2
Submitted by allencbzhang 13 What Makes for Text to 360-degree Panorama Generation with Stable Diffusion? · 4 authors 2
Submitted by YangXiao-nlp 12 LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling · 7 authors 2
Submitted by wick1d 12 Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach · 7 authors 2
Submitted by YangXiao-nlp 12 Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States · 8 authors 2
Submitted by quanwei0 12 Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment · 6 authors 2
Submitted by TonyK 11 Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality · 10 authors 3
Submitted by Lin-Chen 10 VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning · 9 authors 3
Submitted by noystl 9 CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature · 2 authors 3
Submitted by j-min 8 EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance · 7 authors 2
Submitted by amitbcp 7 Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems · 5 authors 2
Submitted by YuchiWang 6 RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction · 9 authors 2
Submitted by yuzhen17 6 Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning · 5 authors 2
Submitted by Mahdip72 6 Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction · 9 authors 1
Submitted by YuanYuhui 5 PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models · 9 authors 2
Submitted by amitbcp 5 FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding · 3 authors 2
Submitted by euiin 4 Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness · 6 authors 1
Submitted by senmaonk 4 One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models · 10 authors 2
Submitted by yiren98 4 GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains · 5 authors 2
Submitted by amanchadha 4 Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods · 6 authors 1
Submitted by nielsr 3 Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles · 3 authors 1
Submitted by AtsuMiyai 3 MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding · 7 authors 1
Submitted by cqsss 3 Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph · 6 authors 1
Submitted by aluo-x 3 Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex · 9 authors 2
Submitted by Sugewud 3 Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking · 9 authors 2
Submitted by brucelyu 2 Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese · 4 authors 2
Submitted by Jungang 2 Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities · 11 authors
Submitted by carboncoo 2 MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding · 12 authors 2
Submitted by brian13 2 HoPE: Hybrid of Position Embedding for Length Generalization in Vision-Language Models · 5 authors 2
Submitted by akhaliq 1 FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control · 6 authors 2
Submitted by yoavgurarieh 1 Precise In-Parameter Concept Erasure in Large Language Models · 5 authors 1
Submitted by amanchadha 1 Can Large Language Models Infer Causal Relationships from Real-World Text? · 4 authors 2
Submitted by Zch0414 - Towards Scalable Language-Image Pre-training for 3D Medical Imaging · 9 authors 2
Submitted by kmn5409 - Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks · 7 authors 2
Submitted by aradhye - First Finish Search: Efficient Test-Time Scaling in Large Language Models · 3 authors 2
Submitted by Hanhpt23 - IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests · 8 authors 2