ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published about 19 hours ago • 25
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code Paper • 2505.02881 • Published 3 days ago • 2
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 2 days ago • 71
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated 8 days ago • 36
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 13 items • Updated 7 days ago • 141
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 5 items • Updated 8 days ago • 107
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering Paper • 2504.20114 • Published 10 days ago • 5
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 9 days ago • 88
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 14 days ago • 86
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published 15 days ago • 53
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility Paper • 2504.07086 • Published 29 days ago • 21
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 16 days ago • 60
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Paper • 2504.16030 • Published 16 days ago • 34
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning Paper • 2504.08672 • Published 27 days ago • 54