SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published Feb 25 • 74
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 13 items • Updated 6 days ago • 141