帖子、文章和讨论

在英特尔至强 CPU 上使用 🤗 Optimum Intel 实现超快 SetFit 推理

由 2024年4月3日 guest • 11

Community Articles

view all

I trained a Language Model to schedule events with GRPO!

•

5 days ago

• 40

Bamba-9B-v2 - Fast and powerful!

and 12 others •

6 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

2 days ago

• 18

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 31

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 126

PipelineRL

and 3 others •

9 days ago

• 17

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

4 days ago

• 6

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

Code a simple RAG from scratch

•

Oct 29, 2024

• 63

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 82

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 26

OpenManus: The Open Source Alternative to Manus AI

•

Mar 30

• 13

ChatGPT-4o's Image Generation Capabilities and Its Wild Examples

•

29 days ago

• 19

How to Use FastAPI MCP Server: A Complete Guide

•

24 days ago

• 27

What is The Agent2Agent Protocol (A2A) and Why You Must Learn It Now

•

23 days ago

• 14

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

and 1 other •

7 days ago

• 4

使用 BentoML 部署 🤗 Hugging Face 上的模型：DeepFloyd IF 实战

由 2023年8月9日 guest • 1

🤗 Diffusers 一岁啦!

由 2023年7月20日 • 2

从 GPT2 到 Stable Diffusion：Elixir 社区迎来了 Hugging Face

由 2022年12月9日

从 PyTorch DDP 到 Accelerate 到 Trainer，轻松掌握分布式训练

由 2022年10月21日 • 27

优化故事: BLOOM 模型推理

由 2022年10月12日 • 4

Community Articles

I trained a Language Model to schedule events with GRPO!

•

5 days ago

• 40

Bamba-9B-v2 - Fast and powerful!

and 12 others •

6 days ago

• 26

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

2 days ago

• 18

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 542

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 226

Creating your custom Ghibli Text-to-Image model

and 3 others •

4 days ago

• 13

DeepWiki: Best AI Documentation Generator for Any Github Repo

•

7 days ago

• 12

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

and 4 others •

Feb 18

• 31

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 126

PipelineRL

and 3 others •

9 days ago

• 17

Building Multimodal RAG Systems: Supercharging Retrieval with MultiModal Embeddings and LLMs

•

4 days ago

• 6

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 38

Code a simple RAG from scratch

•

Oct 29, 2024

• 63

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 82

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 26

OpenManus: The Open Source Alternative to Manus AI

•

Mar 30

• 13

ChatGPT-4o's Image Generation Capabilities and Its Wild Examples

•

29 days ago

• 19

How to Use FastAPI MCP Server: A Complete Guide

•

24 days ago

• 27

What is The Agent2Agent Protocol (A2A) and Why You Must Learn It Now

•

23 days ago

• 14

What is MoE 2.0? Update Your Knowledge about Mixture-of-experts

and 1 other •

7 days ago

• 4

View all