Blog, Articles, and discussions

Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive

By January 15, 2024 guest • 6

Community Articles

view all

I trained a Language Model to schedule events with GRPO!

•

8 days ago

• 57

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

about 11 hours ago

• 45

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

5 days ago

• 18

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

about 15 hours ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 128

AI Personas: The Impact of Design Choices

and 1 other •

about 11 hours ago

• 8

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 245

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

10 days ago

• 15

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

6 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

9 days ago

• 7

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

about 8 hours ago

• 5

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 62

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 83

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

LoRA training scripts of the world, unite!

By January 2, 2024 • 60

Introducing Würstchen: Fast Diffusion for Image Generation

By September 13, 2023 • 18

Efficient Controllable Generation for SDXL with T2I-Adapters

By September 8, 2023 guest • 7

AudioLDM 2, but faster ⚡️

By August 30, 2023 • 11

Practical 3D Asset Generation: A Step-by-Step Guide

By August 1, 2023 • 8

Happy 1st anniversary 🤗 Diffusers!

By July 20, 2023 • 2

Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac

By June 15, 2023 • 4

Instruction-tuning Stable Diffusion with InstructPix2Pix

By May 23, 2023 • 16

A Dive into Text-to-Video Models

By May 8, 2023 • 37

Running IF with 🧨 diffusers on a Free Tier Google Colab

By April 26, 2023 • 3

Train your ControlNet with diffusers

By March 24, 2023 • 28

Swift Diffusers: Fast Stable Diffusion for Mac

By February 24, 2023 • 4

Using Stable Diffusion with Core ML on Apple Silicon

By December 1, 2022 • 7

VQ Diffusion with 🧨 Diffusers

By November 30, 2022 • 2

Community Articles

I trained a Language Model to schedule events with GRPO!

•

8 days ago

• 57

CircleGuardBench: New Standard for Evaluating AI Moderation Models

and 7 others •

about 11 hours ago

• 45

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

5 days ago

• 18

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 236

Creating your custom Ghibli Text-to-Image model

and 3 others •

7 days ago

• 15

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 545

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

about 15 hours ago

• 10

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 128

AI Personas: The Impact of Design Choices

and 1 other •

about 11 hours ago

• 8

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 245

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 32

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

10 days ago

• 15

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

6 days ago

• 6

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 128

A Guide to Running Qwen 3 Locally with Ollama and vLLM

•

9 days ago

• 7

Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability

and 1 other •

about 8 hours ago

• 5

Code a simple RAG from scratch

•

Oct 29, 2024

• 66

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 62

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 83

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 27

View all