Update README.md
Browse files
README.md
CHANGED
@@ -1,7 +1,3 @@
|
|
1 |
-
# DeepSeek-R1-Distill-SRE-Qwen-32B-INT8
|
2 |
-
|
3 |
-
## Model Introduction
|
4 |
-
|
5 |
---
|
6 |
license: apache-2.0
|
7 |
datasets:
|
@@ -20,6 +16,10 @@ tags:
|
|
20 |
- deepseek
|
21 |
---
|
22 |
|
|
|
|
|
|
|
|
|
23 |
`DeepSeek-R1-Distill-SRE-Qwen-32B-INT8` is the industry's first publicly available operations large model. It is a specialized mixed-precision 8-bit quantized large language model fine-tuned from the `DeepSeek-R1-Distill-Qwen-32B` model, optimized specifically for **operations** and **Site Reliability Engineering (SRE)** scenarios. This model inherits the powerful reasoning capabilities of the DeepSeek-R1 series and has been further fine-tuned using the [ahmedgongi/Devops_LLM](https://huggingface.co/datasets/ahmedgongi/Devops_LLM) dataset, significantly enhancing its utility in the following tasks:
|
24 |
|
25 |
- Automated script generation
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
datasets:
|
|
|
16 |
- deepseek
|
17 |
---
|
18 |
|
19 |
+
# DeepSeek-R1-Distill-SRE-Qwen-32B-INT8
|
20 |
+
|
21 |
+
## Model Introduction
|
22 |
+
|
23 |
`DeepSeek-R1-Distill-SRE-Qwen-32B-INT8` is the industry's first publicly available operations large model. It is a specialized mixed-precision 8-bit quantized large language model fine-tuned from the `DeepSeek-R1-Distill-Qwen-32B` model, optimized specifically for **operations** and **Site Reliability Engineering (SRE)** scenarios. This model inherits the powerful reasoning capabilities of the DeepSeek-R1 series and has been further fine-tuned using the [ahmedgongi/Devops_LLM](https://huggingface.co/datasets/ahmedgongi/Devops_LLM) dataset, significantly enhancing its utility in the following tasks:
|
24 |
|
25 |
- Automated script generation
|