1 48 189

Xiaosen Zheng

xszheng2020

AI & ML interests

Data-Centric AI and AI Safety.

Recent Activity

liked a model 15 days ago

sail/Qwen2.5-Math-1.5B-Oat-Zero

upvoted a paper 16 days ago

Learning to Reason under Off-Policy Guidance

upvoted a collection 20 days ago

NoisyRollout

View all activity

Organizations

xszheng2020's activity

liked a model 15 days ago

sail/Qwen2.5-Math-1.5B-Oat-Zero

Text Generation • Updated Mar 21 • 1.67k • 3

upvoted a paper 16 days ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published 17 days ago • 80

upvoted a collection 20 days ago

NoisyRollout

Collection

6 items • Updated 15 days ago • 5

upvoted a paper 20 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 21 days ago • 88

liked 2 datasets about 1 month ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated 20 days ago • 1.79M • 4.23k • 68

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 2.96k • 446

liked a model about 1 month ago

lkevinzc/Llama-3.2-3B-NuminaQA

Text Generation • Updated Mar 21 • 1.05k • 3

liked a model about 2 months ago

GAIR/LIMO

Updated Feb 6 • 3.57k • 40

liked a dataset about 2 months ago

GAIR/MathPile

Preview • Updated Apr 3 • 189 • 186

upvoted a collection about 2 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated 10 days ago • 81

liked 2 datasets about 2 months ago

K-and-K/perturbed-knights-and-knaves

Viewer • Updated Oct 31, 2024 • 41.2k • 250 • 8

K-and-K/knights-and-knaves

Viewer • Updated Oct 31, 2024 • 6.9k • 1.01k • 29

liked a dataset 2 months ago

simplescaling/data_ablation_full59K

Viewer • Updated Feb 3 • 60.4k • 3.03k • 21

upvoted a paper 2 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 111

liked a model 2 months ago

m-a-p/neo_7b

Text Generation • Updated Jun 3, 2024 • 91 • 55

upvoted a collection 2 months ago

OLMo 2 Preview Post-trained Models

Collection

These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions. • 6 items • Updated 8 days ago • 4

liked a model 2 months ago

allenai/OLMo-2-1124-7B-Instruct

Text Generation • Updated Jan 6 • 17.6k • 34

liked a dataset 2 months ago

simplescaling/s1K-1.1

Viewer • Updated Feb 27 • 1k • 5.15k • 110

upvoted a paper 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 229

liked a model 3 months ago

allenai/OLMo-1B-0724-hf

Text Generation • Updated Aug 5, 2024 • 11.4k • 21