new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

May 29

Submitted by

ganqu

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

·
17 authors

3

Submitted by

djalexj

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

·
9 authors

2

Submitted by

fuvty

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

·
9 authors

2

Submitted by

chrisliu298

Skywork Open Reasoner 1 Technical Report

·
17 authors

5

Submitted by

Tuwhy

Sherlock: Self-Correcting Reasoning in Vision-Language Models

·
2 authors

2

Submitted by

WaltonFuture

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

·
7 authors

1

Submitted by

jt-zhang

SageAttention2++: A More Efficient Implementation of SageAttention2

·
8 authors

2

Submitted by

WaltonFuture

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

·
8 authors

2

Submitted by

P2333

Fostering Video Reasoning via Next-Event Prediction

·
7 authors

Submitted by

NCJ

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

·
5 authors

2

Submitted by

yushi

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

·
11 authors

Submitted by

bryanswkim

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

·
3 authors

2

Submitted by

kjm981995

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

·
5 authors

1

Submitted by

mbrack

Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

·
19 authors

2

Submitted by

callanwu

WebDancer: Towards Autonomous Information Seeking Agency

·
12 authors

5

Submitted by

ahmedheakl

SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem

·
5 authors

2

Submitted by

allencbzhang

What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?

·
4 authors

2

Submitted by

hbin0701

Let's Predict Sentence by Sentence

·
10 authors

Submitted by

YangXiao-nlp

LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

·
7 authors

2

Submitted by

wick1d

Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach

·
7 authors

2

Submitted by

YangXiao-nlp

Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States

·
8 authors

2

Submitted by

quanwei0

Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment

·
6 authors

2

Submitted by

TonyK

Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality

·
10 authors

3

Submitted by

Lin-Chen

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

·
9 authors

3

Submitted by

noystl

CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature

·
2 authors

3

Submitted by

ethanchern

Thinking with Generated Images

·
8 authors

3

Submitted by

j-min

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

·
7 authors

Submitted by

amitbcp

Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems

·
5 authors

2

Submitted by

YuchiWang

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

·
9 authors

Submitted by

vyokky

Text2Grad: Reinforcement Learning from Natural Language Feedback

·
8 authors

Submitted by

yuzhen17

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

·
5 authors

2

Submitted by

Mahdip72

Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction

·
9 authors

1

Submitted by

YuanYuhui

PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models

·
9 authors

Submitted by

amitbcp

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

·
3 authors

2

Submitted by

euiin

Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness

·
6 authors

Submitted by

senmaonk

One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models

·
10 authors

2

Submitted by

yiren98

GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains

·
5 authors

Submitted by

amanchadha

Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods

·
6 authors

1

Submitted by

kaiyuyue

Zero-Shot Vision Encoder Grafting via LLM Surrogates

·
9 authors

Submitted by

nielsr

Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles

·
3 authors

Submitted by

AtsuMiyai

MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding

·
7 authors

1

Submitted by

mnikdan97

Efficient Data Selection at Scale via Influence Distillation

·
4 authors

1

Submitted by

cqsss

Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph

·
6 authors

1

Submitted by

aluo-x

Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex

·
9 authors

2

Submitted by

Sugewud

Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking

·
9 authors

2

Submitted by

brucelyu

Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese

·
4 authors

2

Submitted by

Jungang

Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

·
11 authors

Submitted by

CKnievel

AITEE -- Agentic Tutor for Electrical Engineering

·
3 authors

2

Submitted by

carboncoo

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding

·
12 authors

Submitted by

brian13

HoPE: Hybrid of Position Embedding for Length Generalization in Vision-Language Models

·
5 authors

2

Submitted by

songw-zju

PixelThink: Towards Efficient Chain-of-Pixel Reasoning

·
9 authors

Submitted by

akhaliq

FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control

·
6 authors

Submitted by

yoavgurarieh

Precise In-Parameter Concept Erasure in Large Language Models

·
5 authors

Submitted by

amanchadha

Can Large Language Models Infer Causal Relationships from Real-World Text?

·
4 authors

Submitted by

Zch0414

Towards Scalable Language-Image Pre-training for 3D Medical Imaging

·
9 authors

2

Submitted by

kmn5409

Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks

·
7 authors

2

Submitted by

aradhye

First Finish Search: Efficient Test-Time Scaling in Large Language Models

·
3 authors

2

Submitted by

Hanhpt23

IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests

·
8 authors