2 11 22

AlphaSue

AI & ML interests

None yet

Recent Activity

upvoted a collection 15 days ago

ProX Refining Models

new activity 15 days ago

gair-prox/web-chunk-refining-lm:what is the chat template?

upvoted a paper 17 days ago

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

View all activity

Organizations

None yet

AlphaSue's activity

upvoted a collection 15 days ago

ProX Refining Models

Collection

Adapted small language models used to generate data refining programs • 5 items • Updated Oct 10, 2024 • 3

upvoted 2 papers 17 days ago

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published 24 days ago • 40

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 24 days ago • 84

upvoted a paper 25 days ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 48

upvoted a paper about 1 month ago

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published Mar 21 • 36

upvoted a paper about 2 months ago

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Paper • 2502.10341 • Published Feb 14 • 2

upvoted an article 3 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 601

upvoted a collection 4 months ago

Papers I've read

Collection

16 items • Updated Jan 12 • 6

upvoted a paper 6 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 48

upvoted an article about 1 year ago

Article

Large-scale Near-deduplication Behind BigCode

May 16, 2023

• 25