NeMo Curator - Classifier Models Collection Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 3 days ago • 16
Llama3-ChatQA-2 Collection This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities • 3 items • Updated 3 days ago • 3
Model Optimizer Collection A collection of generative models quantized and optimized with TensorRT Model Optimizer. • 19 items • Updated 3 days ago • 20
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 13 items • Updated 3 days ago • 17
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 3 days ago • 61
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 3 days ago • 31
BigVGAN Collection BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. • 11 items • Updated 3 days ago • 13
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated 3 days ago • 49
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated 3 days ago • 27
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 3 days ago • 44
NV-Embed Collection NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated 3 days ago • 14
RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated 3 days ago • 5
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 3 days ago • 44
InstructRetro Collection InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. • 4 items • Updated 3 days ago • 9
Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 4 items • Updated 3 days ago • 21
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 11 items • Updated 3 days ago • 23
SteerLM Collection A collection of models and datasets relating to SteerLM and HelpSteer. • 7 items • Updated 3 days ago • 14
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 3 days ago • 162
OpenMath-2 Collection A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data" • 7 items • Updated 3 days ago • 15