deepseek-mla / assets /mla_architecture.png
Yan Wei
Initial commit: DeepSeek Multi-Latent Attention implementation
550eb56
mla_architecture.png