view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs 9 days ago • 23
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 102
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187
Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages Paper • 2503.20212 • Published Mar 26 • 5
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 125
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 3 days ago • 31
Llama Nemotron Collection Open, Production-ready Enterprise Models • 5 items • Updated 3 days ago • 50
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets Mar 18 • 35