SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 104
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated 3 days ago • 46
view article Article Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖 25 days ago • 42
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper • 2504.07096 • Published 29 days ago • 73
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 28 days ago • 77
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 30 days ago • 159
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187