Blog, Articles, and discussions

Introducing HELMET

By April 16, 2025 • 23

Community Articles

view all

I trained a Language Model to schedule events with GRPO!

•

9 days ago

• 59

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

about 19 hours ago

• 46

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

6 days ago

• 18

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

about 23 hours ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 129

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

about 16 hours ago

• 7

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 245

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

10 days ago

• 15

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

7 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 63

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 83

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

and 1 other •

10 days ago

• 6

Making sense of this mess

By June 7, 2024 • 14

Launching the Artificial Analysis Text to Image Leaderboard & Arena

By June 6, 2024 guest • 13

Training and Finetuning Embedding Models with Sentence Transformers v3

By May 28, 2024 • 218

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

By May 24, 2024 guest • 25

Hugging Face x LangChain : A new partner package in LangChain

By May 14, 2024 • 143

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

By May 3, 2024 guest • 13

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

By April 29, 2024 guest • 77

Introducing the Open Chain of Thought Leaderboard

By April 23, 2024 guest • 33

Welcome Llama 3 - Meta's new open LLM

By April 18, 2024 • 289

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

By April 16, 2024 guest • 15

CodeGemma - an official Google release for code LLMs

By April 9, 2024 • 101

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

By April 3, 2024 guest • 11

Total noob’s intro to Hugging Face Transformers

By March 22, 2024 • 74

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

By March 22, 2024 guest • 88

Community Articles

I trained a Language Model to schedule events with GRPO!

•

9 days ago

• 59

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

about 19 hours ago

• 46

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

6 days ago

• 18

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 237

Creating your custom Ghibli Text-to-Image model

and 3 others •

7 days ago

• 15

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 546

AI Personas: The Impact of Design Choices

and 1 other •

about 19 hours ago

• 10

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

about 23 hours ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 129

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

about 16 hours ago

• 7

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 245

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

10 days ago

• 15

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

7 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 63

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 83

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

and 1 other •

10 days ago

• 6

View all