MMTEB: Massive Multilingual Text Embedding Benchmark Paper ⢠2502.13595 ⢠Published Feb 19 ⢠34
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ⢠2502.02737 ⢠Published Feb 4 ⢠229
Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect Paper ⢠2409.17912 ⢠Published Sep 26, 2024 ⢠29
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M ⢠16 items ⢠Updated 3 days ago ⢠256
š Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets ⢠8 items ⢠Updated Jun 12, 2024 ⢠37
Falcon Mamba: The First Competitive Attention-free 7B Language Model Paper ⢠2410.05355 ⢠Published Oct 7, 2024 ⢠36