1. DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Description: Scaling open-source language models with a focus on longtermism.
- Link to Paper {Jan 6, 2024}
Description: Scaling open-source language models with a focus on longtermism.
Description: Exploring expert specialization in Mixture-of-Experts language models.
Description: Investigating the intersection of large language models and programming.
Description: Hardware-Aligned and Natively Trainable Sparse Attention.
There's a lot of excellent work being done in the field of AI and machine learning. For more information, check out these resources:
@article{deepseek2024papers,
author = {DeepSeek Research Team},
title = {DeepSeek Papers: Advancements in Language Models and Multimodal Understanding},
journal = {DeepSeek Publications},
year = {2024-2025},
}