70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published 12 days ago • 26
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 21 days ago • 176
view article Article Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖 14 days ago • 40
UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Paper • 2503.21620 • Published Mar 27 • 61
OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement Paper • 2503.17352 • Published Mar 21 • 23
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 99
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 229
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 147