Running 2.56k 2.56k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 10 days ago • 463
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published Jan 8 • 91
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 8 days ago • 69
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Apr 3 • 146