Running 2.56k 2.56k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13 • 98
Running 557 557 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
Running 934 934 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 3 days ago • 44