Organize the Web: Constructing Domains Enhances Pre-Training Data Curation Paper • 2502.10341 • Published Feb 14 • 2
Aioli: A Unified Optimization Framework for Language Model Data Mixing Paper • 2411.05735 • Published Nov 8, 2024 • 1
Cell2Sentence Models Collection Cell2Sentence models trained for single-cell tasks • 5 items • Updated 12 days ago • 6
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published Mar 10 • 60
🏜️MIRAGE-Bench [NAACL'25] Collection Dataset Collection from the MIRAGE-Bench paper • 13 items • Updated 27 days ago • 2
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning Paper • 2504.11456 • Published 13 days ago • 11
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 12 days ago • 13
Apriel Collection ServiceNow Language Modeling Lab's first model family series • 2 items • Updated 14 days ago • 7
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 13 items • Updated 4 days ago • 17
ALEA Mid- and Post-Train Resources Collection Various Q&A, abstractive/extractive summarization, classification, drafting, prediction, and conversational tasks • 9 items • Updated 18 days ago • 2