ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published about 19 hours ago • 25
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code Paper • 2505.02881 • Published 3 days ago • 2
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 2 days ago • 71
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated 8 days ago • 36
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 13 items • Updated 7 days ago • 141
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 5 items • Updated 8 days ago • 107
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering Paper • 2504.20114 • Published 10 days ago • 5
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 9 days ago • 88
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 14 days ago • 86