Steffen Röcker's picture

Steffen Röcker PRO

sroecker

·

https://x.com/sroecker

AI & ML interests

Local models

Recent Activity

upvoted an article 1 day ago

The 4 Things Qwen-3's Chat Template Teaches Us

View all activity

Organizations

sroecker's activity

upvoted an article 1 day ago

Article

The 4 Things Qwen-3's Chat Template Teaches Us

8 days ago

• 27

upvoted an article 13 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

13 days ago

• 220

upvoted a collection 14 days ago

Pleias-RAG

New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated 14 days ago • 26

upvoted a collection 19 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187

upvoted 3 collections about 1 month ago

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 7 days ago • 45

Llama 4

Llama 4 release • 13 items • Updated 9 days ago • 480

🌙 March 2025 - Open releases from the Chinese community

30 items • Updated Apr 2 • 12

upvoted a collection about 2 months ago

Gemma 3 QAT INT4 (from Flax)

These are converted from the official QAT INT4 Flax checkpoints on Kaggle. Supported formats: AutoAWQ, GGUF • 12 items • Updated Apr 6 • 5

upvoted a paper about 2 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 104

upvoted 2 collections about 2 months ago

reranking series v2

V2 crispy rerank series • 2 items • Updated Mar 13 • 21

DeepHermes

Preview models of hybrid reasoner Hermes series • 6 items • Updated Mar 13 • 35

upvoted 4 collections 2 months ago

Q-Filters

Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3 • 7

Granite 3.2 Language Models

3 items • Updated 6 days ago • 19

DeepSeek-R1-Distill Quantized

18 items • Updated Feb 7 • 16

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 8 days ago • 108