Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. โข 36 items โข Updated Apr 6 โข 30
Trained Models ๐๏ธ Collection They may be small, but they're training like giants! โข 8 items โข Updated Dec 3, 2024 โข 20
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper โข 2502.02737 โข Published Feb 4 โข 229