2 11 22

AlphaSue

AI & ML interests

None yet

Recent Activity

upvoted a collection 16 days ago

ProX Refining Models

new activity 16 days ago

gair-prox/web-chunk-refining-lm:what is the chat template?

upvoted a paper 17 days ago

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

View all activity

Organizations

None yet

AlphaSue's activity

liked 2 models 27 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated Feb 24 • 1.85M • 1.18k

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 1.31M • • 12.1k

liked a model about 1 month ago

gair-prox/web-chunk-refining-lm

Text Generation • Updated Oct 10, 2024 • 23 • 5

liked a Space 2 months ago

110

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

liked a model 3 months ago

jinaai/ReaderLM-v2

Text Generation • Updated Mar 4 • 68.8k • 627

liked a Space 3 months ago

2.56k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 5 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 100 • 34

liked a model 5 months ago

open-web-math/filtering-models

Updated Nov 2, 2023 • 9

liked a dataset 5 months ago

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 746k • 46

liked a model 8 months ago

nvidia/quality-classifier-deberta

Updated Jan 31 • 4.1k • 58

liked a model 9 months ago

oliverguhr/fullstop-punctuation-multilang-large

Token Classification • Updated Nov 16, 2023 • 391k • • 163

liked a dataset 11 months ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 4.05k • 728

liked a model 11 months ago

Snowflake/snowflake-arctic-embed-m

liked a Space 11 months ago

936

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked 4 datasets 12 months ago

liked a Space over 1 year ago

1.12k

ControlNet V1.1

📉

Create detailed images from sketches and other inputs

liked a model almost 2 years ago

TheBloke/Llama-2-7B-Chat-GGML

Text Generation • Updated Sep 27, 2023 • 1.76k • 872