microsoft/table-transformer-structure-recognition Object Detection • Updated Sep 6, 2023 • 1.13M • 187
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 63
Synthetic (text) Dataset Generation Collection Papers about synthetic dataset generation • 9 items • Updated Jun 21, 2024 • 8
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper • 2410.02089 • Published Oct 2, 2024 • 12
Running 935 935 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training