Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deepseek-ai 's Collections
DeepSeek-R1
DeepSeek-V3
DeepSeek-VL2
Janus
DeepSeek-Prover
DeepSeek-V2
DeepSeekCoder-V2
DeepSeek-Math
ESFT
DeepSeek-VL
DeepSeek-Coder
DeepSeek-LLM
DeepSeek-MoE
DeepSeek-V2.5

DeepSeek-MoE

updated Aug 16, 2024

DeepSeek MoE series

Upvote
15

  • deepseek-ai/deepseek-moe-16b-base

    Text Generation • Updated Jan 12, 2024 • 13.4k • 119

  • deepseek-ai/deepseek-moe-16b-chat

    Text Generation • Updated Feb 5, 2024 • 7.23k • 138

  • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

    Paper • 2401.06066 • Published Jan 11, 2024 • 55
Upvote
15
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs