AceMath-RL Collection Math reasoning models trained through reinforcement learning (RL) • 1 item • Updated 1 day ago • 4
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct Text Generation • Updated 20 days ago • 1.38k • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • Updated 20 days ago • 5.48k • 105
nvidia/Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct Text Generation • Updated 20 days ago • 4.74k • 41
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 46
AceMath Collection We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 1 day ago • 14
AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling Paper • 2412.15084 • Published Dec 19, 2024 • 13
NVLM 1.0 Collection A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 1 day ago • 51