ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published about 13 hours ago • 8
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning Paper • 2505.03318 • Published 2 days ago • 75
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Paper • 2505.03005 • Published 2 days ago • 23
Multi-Agent System for Comprehensive Soccer Understanding Paper • 2505.03735 • Published 1 day ago • 12
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published 7 days ago • 39
A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic Paper • 2505.00534 • Published 7 days ago • 2
Spatial Speech Translation: Translating Across Space With Binaural Hearables Paper • 2504.18715 • Published 12 days ago • 7
LLMs for Engineering: Teaching Models to Design High Powered Rockets Paper • 2504.19394 • Published 10 days ago • 12
AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization Paper • 2504.21659 • Published 8 days ago • 9
MediAug: Exploring Visual Augmentation in Medical Imaging Paper • 2504.18983 • Published 12 days ago • 6
Adding Conditional Control to Text-to-Image Diffusion Models Paper • 2302.05543 • Published Feb 10, 2023 • 52
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper • 2503.07572 • Published Mar 10 • 44
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10 • 98