Blog, Articles, and discussions

Introducing HELMET

By April 16, 2025 • 23

Community Articles

view all

I trained a Language Model to schedule events with GRPO!

•

9 days ago

• 59

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

about 22 hours ago

• 47

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

6 days ago

• 18

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

1 day ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 129

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

about 19 hours ago

• 8

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

10 days ago

• 15

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

7 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 245

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 63

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 28

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 83

What is The Agent2Agent Protocol (A2A) and Why You Must Learn It Now

•

26 days ago

• 17

Introducing The World's Largest Open Multilingual Language Model: BLOOM

By July 12, 2022 • 5

Announcing Evaluation on the Hub

By June 28, 2022

Convert Transformers to ONNX with Hugging Face Optimum

By June 22, 2022 • 7

Director of Machine Learning Insights [Part 3: Finance Edition]

By June 14, 2022 • 1

Efficient Table Pre-training without Real Data: An Introduction to TAPEX

By May 23, 2022 guest • 1

Announcing the Hugging Face Fellowship Program

By May 17, 2022 • 9

Gradio 3.0 is Out!

By May 16, 2022

Director of Machine Learning Insights [Part 2: SaaS Edition]

By May 13, 2022 • 1

Student Ambassador Program's call for applications is open!

By May 13, 2022 • 4

Accelerated Inference with Optimum and Transformers Pipelines

By May 10, 2022 • 2

Welcome fastai to the Hugging Face Hub

By May 6, 2022 • 2

Director of Machine Learning Insights [Series]

By April 27, 2022 • 1

Introducing Hugging Face for Education

By April 25, 2022 • 5

CO2 Emissions and the 🤗 Hub: Leading the Charge

By April 22, 2022 • 8

Community Articles

I trained a Language Model to schedule events with GRPO!

•

9 days ago

• 59

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

about 22 hours ago

• 47

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

6 days ago

• 18

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 237

Creating your custom Ghibli Text-to-Image model

and 3 others •

7 days ago

• 15

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 546

AI Personas: The Impact of Design Choices

and 1 other •

about 21 hours ago

• 10

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

1 day ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 129

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

about 19 hours ago

• 8

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

10 days ago

• 15

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

7 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 245

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 63

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 28

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 83

What is The Agent2Agent Protocol (A2A) and Why You Must Learn It Now

•

26 days ago

• 17

View all