Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187
Workers AI compatible LoRAs Collection Adapters that are currently supported by Workers AI. Read https://developers.cloudflare.com/workers-ai/fine-tunes for more instructions. • 7 items • Updated Apr 3, 2024 • 9
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 587
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 3 days ago • 257
🍓 Ichigo v0.3 Collection The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated Nov 11, 2024 • 17
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 2 days ago • 155
Llama-3.2 Quantization Collection Llama 3.2 models quantized by Neural Magic • 9 items • Updated Sep 26, 2024 • 9
Llama3-ChatQA-2 Collection This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities • 3 items • Updated 2 days ago • 3
Recent highlights Collection Some recent models worth checking out • 18 items • Updated Nov 1, 2024 • 52
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 2 days ago • 44
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching Paper • 2410.06885 • Published Oct 9, 2024 • 47