70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published 13 days ago • 27
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 21 days ago • 176
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published Mar 20 • 48
SOAP: Improving and Stabilizing Shampoo using Adam Paper • 2409.11321 • Published Sep 17, 2024 • 1
Small Models Struggle to Learn from Strong Reasoners Paper • 2502.12143 • Published Feb 17 • 36