2 12 14

cheng

zhoujun

BlankCheng

AI & ML interests

None yet

Recent Activity

published a model 7 days ago

LLM360/math__merged_deduped_258.4k

upvoted a collection 7 days ago

Reasong Data (raw)

updated a collection 23 days ago

Reasoning Models

View all activity

Organizations

zhoujun's activity

liked a Space 3 months ago

2.56k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 3 months ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 3.15k • 115

liked a model 3 months ago

Qwen/Qwen2.5-Math-7B-Instruct

Text Generation • Updated Sep 23, 2024 • 75.2k • 72

liked a Space 7 months ago

Decentralized Arena Leaderboard

🥇

Display model leaderboard evaluations

liked a dataset 7 months ago

LLM360/TxT360

Updated 5 minutes ago • 81.7k • 230

liked a Space 7 months ago

110

TxT360: Trillion Extracted Text

📖

Create a large, deduplicated dataset for LLM pre-training

liked 2 datasets about 1 year ago

minimario/FOLIO

Viewer • Updated Jan 2, 2024 • 1.21k • 110 • 1

bigcode/the-stack-v2

Viewer • Updated Apr 23, 2024 • 5.45B • 3.43k • 363

liked 2 models about 1 year ago

deepseek-ai/deepseek-coder-7b-instruct-v1.5

Text Generation • Updated Feb 5, 2024 • 5.68k • 135

deepseek-ai/deepseek-coder-1.3b-instruct

Text Generation • Updated Mar 7, 2024 • 131k • 128

liked a model over 1 year ago

meta-llama/Llama-2-7b-chat-hf

Text Generation • Updated Apr 17, 2024 • 1.23M • 4.4k

liked a dataset almost 2 years ago

bigcode/ta-prompt

Viewer • Updated May 4, 2023 • 650 • 216 • 198

liked 2 Spaces over 2 years ago

Binder

🔗

247

Code generation with 🤗

✨

Generate code snippets using language models