Running 2.55k 2.55k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper โข 2502.05171 โข Published Feb 7 โข 140
Byte Latent Transformer: Patches Scale Better Than Tokens Paper โข 2412.09871 โข Published Dec 13, 2024 โข 102
Running on CPU Upgrade 13k 13k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots