Spaces:
Running
Running
All Papers Updated
Browse files- index.html +65 -184
index.html
CHANGED
@@ -1,199 +1,80 @@
|
|
1 |
<!DOCTYPE html>
|
2 |
-
<html>
|
3 |
<head>
|
4 |
-
<meta charset="
|
5 |
-
<meta name="
|
6 |
-
<
|
7 |
-
<
|
8 |
-
<title>DeepSeek Papers: Advancing Deep Learning</title>
|
9 |
-
|
10 |
-
<link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro" rel="stylesheet">
|
11 |
-
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/bulma/0.9.3/css/bulma.min.css">
|
12 |
-
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.0.0/css/all.min.css">
|
13 |
-
|
14 |
-
<style>
|
15 |
-
.publication-title {
|
16 |
-
color: #363636;
|
17 |
-
}
|
18 |
-
.paper-card {
|
19 |
-
margin-bottom: 2rem;
|
20 |
-
transition: transform 0.2s;
|
21 |
-
}
|
22 |
-
.paper-card:hover {
|
23 |
-
transform: translateY(-5px);
|
24 |
-
}
|
25 |
-
.coming-soon-badge {
|
26 |
-
background-color: #3273dc;
|
27 |
-
color: white;
|
28 |
-
padding: 0.25rem 0.75rem;
|
29 |
-
border-radius: 4px;
|
30 |
-
font-size: 0.8rem;
|
31 |
-
margin-left: 1rem;
|
32 |
-
}
|
33 |
-
.paper-description {
|
34 |
-
color: #4a4a4a;
|
35 |
-
margin-top: 0.5rem;
|
36 |
-
}
|
37 |
-
.release-date {
|
38 |
-
color: #7a7a7a;
|
39 |
-
font-size: 0.9rem;
|
40 |
-
}
|
41 |
-
</style>
|
42 |
</head>
|
43 |
<body>
|
|
|
|
|
|
|
44 |
|
45 |
-
<
|
46 |
-
|
47 |
-
|
48 |
-
<div class="columns is-centered">
|
49 |
-
<div class="column has-text-centered">
|
50 |
-
<h1 class="title is-1 publication-title">DeepSeek Papers</h1>
|
51 |
-
<h2 class="subtitle is-3">Advancing Deep Learning Research</h2>
|
52 |
-
</div>
|
53 |
-
</div>
|
54 |
-
</div>
|
55 |
-
</div>
|
56 |
-
</section>
|
57 |
-
|
58 |
-
<section class="section">
|
59 |
-
<div class="container is-max-desktop">
|
60 |
-
<div class="content">
|
61 |
-
<div class="columns is-centered">
|
62 |
-
<div class="column is-10">
|
63 |
-
|
64 |
-
<!-- Native Sparse Attention -->
|
65 |
-
<div class="card paper-card">
|
66 |
-
<div class="card-content">
|
67 |
-
<h3 class="title is-4">
|
68 |
-
<a href="https://arxiv.org/abs/2502.11089">Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention</a>
|
69 |
-
<span class="coming-soon-badge">Deep Dive Coming Soon</span>
|
70 |
-
</h3>
|
71 |
-
<p class="paper-description">
|
72 |
-
Advanced approach to sparse attention optimization with hardware-aligned implementation.
|
73 |
-
</p>
|
74 |
-
</div>
|
75 |
-
</div>
|
76 |
-
|
77 |
-
<!-- DeepSeek-Applications -->
|
78 |
-
<div class="card paper-card">
|
79 |
-
<div class="card-content">
|
80 |
-
<h3 class="title is-4">
|
81 |
-
<a href="https://arxiv.org/abs/2001.00130">DeepSeek-Applications: Real-World Applications of Deep Learning in Protein Function Prediction</a>
|
82 |
-
<span class="coming-soon-badge">Deep Dive Coming Soon</span>
|
83 |
-
</h3>
|
84 |
-
<p class="paper-description">
|
85 |
-
Practical applications and implementation strategies for protein function prediction.
|
86 |
-
</p>
|
87 |
-
</div>
|
88 |
-
</div>
|
89 |
-
|
90 |
-
<!-- DeepSeek-Interaction -->
|
91 |
-
<div class="card paper-card">
|
92 |
-
<div class="card-content">
|
93 |
-
<h3 class="title is-4">
|
94 |
-
<a href="https://arxiv.org/abs/2001.00129">DeepSeek-Interaction: Predicting Protein-Protein Interactions Using Deep Learning Approaches</a>
|
95 |
-
<span class="coming-soon-badge">Deep Dive Coming Soon</span>
|
96 |
-
</h3>
|
97 |
-
<p class="paper-description">
|
98 |
-
Novel approaches to understanding and predicting protein-protein interactions.
|
99 |
-
</p>
|
100 |
-
</div>
|
101 |
-
</div>
|
102 |
-
|
103 |
-
<!-- DeepSeek-Structure -->
|
104 |
-
<div class="card paper-card">
|
105 |
-
<div class="card-content">
|
106 |
-
<h3 class="title is-4">
|
107 |
-
<a href="https://arxiv.org/abs/2001.00128">DeepSeek-Structure: Integrating Structural Information for Protein Function Prediction</a>
|
108 |
-
<span class="coming-soon-badge">Deep Dive Coming Soon</span>
|
109 |
-
</h3>
|
110 |
-
<p class="paper-description">
|
111 |
-
Integration of structural data into protein function prediction models.
|
112 |
-
</p>
|
113 |
-
</div>
|
114 |
-
</div>
|
115 |
|
116 |
-
|
117 |
-
|
118 |
-
|
119 |
-
|
120 |
-
|
121 |
-
|
122 |
-
|
123 |
-
<p class="paper-description">
|
124 |
-
Comprehensive framework for automated protein function annotation.
|
125 |
-
</p>
|
126 |
-
</div>
|
127 |
-
</div>
|
128 |
|
129 |
-
|
130 |
-
|
131 |
-
|
132 |
-
|
133 |
-
|
134 |
-
|
135 |
-
|
136 |
-
<p class="paper-description">
|
137 |
-
Advanced neural architectures for improved protein function prediction.
|
138 |
-
</p>
|
139 |
-
</div>
|
140 |
-
</div>
|
141 |
|
142 |
-
|
143 |
-
|
144 |
-
|
145 |
-
|
146 |
-
|
147 |
-
|
148 |
-
|
149 |
-
<p class="paper-description">
|
150 |
-
Robust framework designed for reliable protein function prediction.
|
151 |
-
</p>
|
152 |
-
</div>
|
153 |
-
</div>
|
154 |
|
155 |
-
|
156 |
-
<div class="card paper-card">
|
157 |
-
<div class="card-content">
|
158 |
-
<h3 class="title is-4">
|
159 |
-
<a href="https://arxiv.org/abs/2001.00124">DeepSeek-V2: Improved Protein Function Prediction Using Deep Learning</a>
|
160 |
-
<span class="coming-soon-badge">Deep Dive Coming Soon</span>
|
161 |
-
</h3>
|
162 |
-
<p class="paper-description">
|
163 |
-
Enhanced methods for protein function prediction using advanced deep learning techniques.
|
164 |
-
</p>
|
165 |
-
</div>
|
166 |
-
</div>
|
167 |
|
168 |
-
|
169 |
-
|
170 |
-
|
171 |
-
|
172 |
-
|
173 |
-
|
174 |
-
|
175 |
-
|
176 |
-
Foundational framework for sequence-based protein function prediction.
|
177 |
-
</p>
|
178 |
-
</div>
|
179 |
-
</div>
|
180 |
|
181 |
-
|
182 |
-
</
|
183 |
-
|
184 |
-
|
185 |
-
</
|
|
|
|
|
|
|
186 |
|
187 |
-
<
|
188 |
-
|
189 |
-
|
190 |
-
|
191 |
-
|
192 |
-
|
193 |
-
|
194 |
-
|
195 |
-
|
196 |
-
</
|
|
|
|
|
197 |
|
|
|
|
|
|
|
|
|
198 |
</body>
|
199 |
</html>
|
|
|
1 |
<!DOCTYPE html>
|
2 |
+
<html lang="en">
|
3 |
<head>
|
4 |
+
<meta charset="UTF-8">
|
5 |
+
<meta name="viewport" content="width=device-width, initial-scale=1.0">
|
6 |
+
<title>DeepSeek Papers</title>
|
7 |
+
<link rel="stylesheet" href="styles.css">
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
</head>
|
9 |
<body>
|
10 |
+
<header>
|
11 |
+
<h1>DeepSeek Papers</h1>
|
12 |
+
</header>
|
13 |
|
14 |
+
<main>
|
15 |
+
<section>
|
16 |
+
<h2>DeepSeek Papers</h2>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
|
18 |
+
<article>
|
19 |
+
<h3>1. <strong>DeepSeek LLM: Scaling Open-Source Language Models with Longtermism</strong></h3>
|
20 |
+
<p>Description: Scaling open-source language models with a focus on longtermism.</p>
|
21 |
+
<ul>
|
22 |
+
<li><a href="https://huggingface.co/papers/2401.02954">Link to Paper</a> {Jan 6, 2024}</li>
|
23 |
+
</ul>
|
24 |
+
</article>
|
|
|
|
|
|
|
|
|
|
|
25 |
|
26 |
+
<article>
|
27 |
+
<h3>2. <strong>DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models</strong></h3>
|
28 |
+
<p>Description: Exploring expert specialization in Mixture-of-Experts language models.</p>
|
29 |
+
<ul>
|
30 |
+
<li><a href="https://arxiv.org/abs/2401.06066">Link to Paper</a> {Jan 11, 2024}</li>
|
31 |
+
</ul>
|
32 |
+
</article>
|
|
|
|
|
|
|
|
|
|
|
33 |
|
34 |
+
<article>
|
35 |
+
<h3>3. <strong>DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence</strong></h3>
|
36 |
+
<p>Description: Investigating the intersection of large language models and programming.</p>
|
37 |
+
<ul>
|
38 |
+
<li><a href="https://arxiv.org/abs/2401.14196">Link to Paper</a> {Jan 25, 2024}</li>
|
39 |
+
</ul>
|
40 |
+
</article>
|
|
|
|
|
|
|
|
|
|
|
41 |
|
42 |
+
<!-- Repeat for the remaining papers -->
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
43 |
|
44 |
+
<article>
|
45 |
+
<h3>17. <strong>Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention</strong></h3>
|
46 |
+
<p>Description: Hardware-Aligned and Natively Trainable Sparse Attention.</p>
|
47 |
+
<ul>
|
48 |
+
<li><a href="https://arxiv.org/abs/2502.11089">Link to Paper</a> {Feb 16, 2025}</li>
|
49 |
+
</ul>
|
50 |
+
</article>
|
51 |
+
</section>
|
|
|
|
|
|
|
|
|
52 |
|
53 |
+
<section>
|
54 |
+
<h2>Related Links</h2>
|
55 |
+
<p>There's a lot of excellent work being done in the field of AI and machine learning. For more information, check out these resources:</p>
|
56 |
+
<ul>
|
57 |
+
<li><a href="https://huggingface.co/papers">Hugging Face Papers</a></li>
|
58 |
+
<li><a href="https://arxiv.org">arXiv.org</a></li>
|
59 |
+
</ul>
|
60 |
+
</section>
|
61 |
|
62 |
+
<section>
|
63 |
+
<h2>BibTeX</h2>
|
64 |
+
<pre><code class="bibtex">
|
65 |
+
@article{deepseek2024papers,
|
66 |
+
author = {DeepSeek Research Team},
|
67 |
+
title = {DeepSeek Papers: Advancements in Language Models and Multimodal Understanding},
|
68 |
+
journal = {DeepSeek Publications},
|
69 |
+
year = {2024-2025},
|
70 |
+
}
|
71 |
+
</code></pre>
|
72 |
+
</section>
|
73 |
+
</main>
|
74 |
|
75 |
+
<footer>
|
76 |
+
<p>This website is licensed under a <a href="https://creativecommons.org/licenses/by-sa/4.0/">Creative Commons Attribution-ShareAlike 4.0 International License</a>.</p>
|
77 |
+
<p>This means you are free to borrow the <a href="source-code-link">source code</a> of this website; we just ask that you link back to this page in the footer. Please remember to remove the analytics code included in the header of the website which you do not want on your website.</p>
|
78 |
+
</footer>
|
79 |
</body>
|
80 |
</html>
|