Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted an article 12 days ago

Tiny Agents: a MCP-powered agent in 50 lines of code

upvoted a paper 17 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

upvoted a collection about 2 months ago

Gemma 3

View all activity

Organizations

None yet

soates's activity

upvoted an article 12 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

14 days ago

• 220

upvoted a paper 17 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 20 days ago • 119

upvoted a collection about 2 months ago

Gemma 3

Collection

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 50 items • Updated 6 days ago • 58

updated a dataset 2 months ago

soates/australian-insurance-pii-dataset-corrected

Viewer • Updated Feb 25 • 1.55k • 28

published a dataset 2 months ago

soates/australian-insurance-pii-dataset-corrected

Viewer • Updated Feb 25 • 1.55k • 28

updated a dataset 2 months ago

soates/australian-insurance-pii-dataset

Viewer • Updated Feb 25 • 1.55k • 30

published a dataset 2 months ago

soates/australian-insurance-pii-dataset

Viewer • Updated Feb 25 • 1.55k • 30

liked a Space 3 months ago

2.56k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 852

upvoted a collection 4 months ago

EvaByte

Collection

3 items • Updated Jan 21 • 3

upvoted a paper 5 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 367

liked a model 5 months ago

Datou1111/shou_xin

Text-to-Image • Updated Mar 16 • 151 • • 873

upvoted a paper 8 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

liked a model 8 months ago

lamm-mit/LifeGPT

Updated Sep 19, 2024 • 8

upvoted an article 8 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 242

liked a Space 9 months ago

118

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

Update leaderboard for fair model evaluation