Spaces:

metacritical
/

DeepSeekPapers

Running

App Files Files Community

metacritical commited on Feb 18

Commit

83673d6

verified ·

1 Parent(s): 9126430

Links Links Links

Browse files

Linsk are everywhere.

Files changed (1) hide show

index.html +29 -45

index.html CHANGED Viewed

@@ -2,10 +2,10 @@
 <html>
 <head>
   <meta charset="utf-8">
-  <meta name="description" content="DeepSeek Papers: Advancing Open-Source Language Models">
-  <meta name="keywords" content="DeepSeek, LLM, AI, Research">
   <meta name="viewport" content="width=device-width, initial-scale=1">
-  <title>DeepSeek Papers: Advancing Open-Source Language Models</title>
   <link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro" rel="stylesheet">
   <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/bulma/0.9.3/css/bulma.min.css">
@@ -48,7 +48,7 @@
       <div class="columns is-centered">
         <div class="column has-text-centered">
           <h1 class="title is-1 publication-title">DeepSeek Papers</h1>
-          <h2 class="subtitle is-3">Advancing Open-Source Language Models</h2>
         </div>
       </div>
     </div>
@@ -68,128 +68,112 @@
                 <a href="https://arxiv.org/abs/2502.11089">Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
-              <p class="release-date">Released: February 2025</p>
               <p class="paper-description">
-                Introduces a new approach to sparse attention that is both hardware-efficient and natively trainable,
-                improving the performance of large language models.
               </p>
             </div>
           </div>
-          <!-- DeepSeek-R1 -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
-              <p class="release-date">Released: January 20, 2025</p>
               <p class="paper-description">
-                The R1 model builds on previous work to enhance reasoning capabilities through large-scale
-                reinforcement learning, competing directly with leading models like OpenAI's o1.
               </p>
             </div>
           </div>
-          <!-- DeepSeek-V3 -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeek-V3 Technical Report
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
-              <p class="release-date">Released: December 2024</p>
               <p class="paper-description">
-                Discusses the scaling of sparse MoE networks to 671 billion parameters, utilizing mixed precision
-                training and high-performance computing (HPC) co-design strategies.
               </p>
             </div>
           </div>
-          <!-- DeepSeek-V2 -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
-              <p class="release-date">Released: May 2024</p>
               <p class="paper-description">
-                Introduces a Mixture-of-Experts (MoE) architecture, enhancing performance while reducing
-                training costs by 42%. Emphasizes strong performance characteristics and efficiency improvements.
               </p>
             </div>
           </div>
-          <!-- DeepSeekMath -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
-              <p class="release-date">Released: April 2024</p>
               <p class="paper-description">
-                This paper presents methods to improve mathematical reasoning in LLMs, introducing the
-                Group Relative Policy Optimization (GRPO) algorithm during reinforcement learning stages.
               </p>
             </div>
           </div>
-          <!-- DeepSeekLLM -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeekLLM: Scaling Open-Source Language Models with Longer-termism
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
-              <p class="release-date">Released: November 29, 2023</p>
               <p class="paper-description">
-                This foundational paper explores scaling laws and the trade-offs between data and model size,
-                establishing the groundwork for subsequent models.
               </p>
             </div>
           </div>
-          <!-- Papers without specific dates -->
-          <!-- DeepSeek-Prover -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
-                Focuses on enhancing theorem proving capabilities in language models using synthetic data
-                for training, establishing new benchmarks in automated mathematical reasoning.
               </p>
             </div>
           </div>
-          <!-- DeepSeek-Coder-V2 -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
-                This paper details advancements in code-related tasks with an emphasis on open-source
-                methodologies, improving upon earlier coding models with enhanced capabilities.
               </p>
             </div>
           </div>
-          <!-- DeepSeekMoE -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
-                DeepSeekMoE: Advancing Mixture-of-Experts Architecture
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
-                Discusses the integration and benefits of the Mixture-of-Experts approach within the
-                DeepSeek framework, focusing on scalability and efficiency improvements.
               </p>
             </div>
           </div>

 <html>
 <head>
   <meta charset="utf-8">
+  <meta name="description" content="DeepSeek Papers: Advancing Deep Learning">
+  <meta name="keywords" content="DeepSeek, Deep Learning, AI, Research">
   <meta name="viewport" content="width=device-width, initial-scale=1">
+  <title>DeepSeek Papers: Advancing Deep Learning</title>
   <link href="https://fonts.googleapis.com/css?family=Google+Sans|Noto+Sans|Castoro" rel="stylesheet">
   <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/bulma/0.9.3/css/bulma.min.css">
       <div class="columns is-centered">
         <div class="column has-text-centered">
           <h1 class="title is-1 publication-title">DeepSeek Papers</h1>
+          <h2 class="subtitle is-3">Advancing Deep Learning Research</h2>
         </div>
       </div>
     </div>
                 <a href="https://arxiv.org/abs/2502.11089">Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Advanced approach to sparse attention optimization with hardware-aligned implementation.
               </p>
             </div>
           </div>
+          <!-- DeepSeek-Applications -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00130">DeepSeek-Applications: Real-World Applications of Deep Learning in Protein Function Prediction</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Practical applications and implementation strategies for protein function prediction.
               </p>
             </div>
           </div>
+          <!-- DeepSeek-Interaction -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00129">DeepSeek-Interaction: Predicting Protein-Protein Interactions Using Deep Learning Approaches</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Novel approaches to understanding and predicting protein-protein interactions.
               </p>
             </div>
           </div>
+          <!-- DeepSeek-Structure -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00128">DeepSeek-Structure: Integrating Structural Information for Protein Function Prediction</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Integration of structural data into protein function prediction models.
               </p>
             </div>
           </div>
+          <!-- DeepSeek-Protein -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00127">DeepSeek-Protein: A Comprehensive Framework for Protein Function Annotation</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Comprehensive framework for automated protein function annotation.
               </p>
             </div>
           </div>
+          <!-- DeepSeek-V3 -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00126">DeepSeek-V3: Enhancing Protein Function Predictions with Advanced Neural Architectures</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Advanced neural architectures for improved protein function prediction.
               </p>
             </div>
           </div>
+          <!-- DeepSeek-R1 -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00125">DeepSeek-R1: A Robust Framework for Protein Function Prediction</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Robust framework designed for reliable protein function prediction.
               </p>
             </div>
           </div>
+          <!-- DeepSeek-V2 -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00124">DeepSeek-V2: Improved Protein Function Prediction Using Deep Learning</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Enhanced methods for protein function prediction using advanced deep learning techniques.
               </p>
             </div>
           </div>
+          <!-- DeepSeek -->
           <div class="card paper-card">
             <div class="card-content">
               <h3 class="title is-4">
+                <a href="https://arxiv.org/abs/2001.00123">DeepSeek: A Deep Learning Framework for Sequence-Based Protein Function Prediction</a>
                 <span class="coming-soon-badge">Deep Dive Coming Soon</span>
               </h3>
               <p class="paper-description">
+                Foundational framework for sequence-based protein function prediction.
               </p>
             </div>
           </div>