Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

bird-of-paradise
/
deepseek-mla

Text Generation
Transformers
PyTorch
English
deepseek-mla
attention-mechanism
mla
efficient-attention
Model card Files Files and versions Community
1
deepseek-mla / src /tests /__pycache__
Ctrl+K
Ctrl+K
  • 2 contributors
History: 1 commit
Yan Wei
Initial commit: DeepSeek Multi-Latent Attention implementation
550eb56 4 months ago
  • __init__.cpython-311.pyc
    182 Bytes
    Initial commit: DeepSeek Multi-Latent Attention implementation 4 months ago
  • test_mla.cpython-311.pyc
    6.69 kB
    Initial commit: DeepSeek Multi-Latent Attention implementation 4 months ago