Phpcool
/

DeepSeek-R1-Distill-SRE-Qwen-32B-INT8

Text Generation

Model card Files Files and versions Community

Phpcool commited on Feb 24

Commit

402ca91

·

verified ·

1 Parent(s): 495bb59

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -1,7 +1,3 @@
-# DeepSeek-R1-Distill-SRE-Qwen-32B-INT8
-## Model Introduction
 ---
 license: apache-2.0
 datasets:
@@ -20,6 +16,10 @@ tags:
 - deepseek
 ---
 `DeepSeek-R1-Distill-SRE-Qwen-32B-INT8` is the industry's first publicly available operations large model. It is a specialized mixed-precision 8-bit quantized large language model fine-tuned from the `DeepSeek-R1-Distill-Qwen-32B` model, optimized specifically for **operations** and **Site Reliability Engineering (SRE)** scenarios. This model inherits the powerful reasoning capabilities of the DeepSeek-R1 series and has been further fine-tuned using the [ahmedgongi/Devops_LLM](https://huggingface.co/datasets/ahmedgongi/Devops_LLM) dataset, significantly enhancing its utility in the following tasks:
 - Automated script generation

 ---
 license: apache-2.0
 datasets:
 - deepseek
 ---
+# DeepSeek-R1-Distill-SRE-Qwen-32B-INT8
+## Model Introduction
 `DeepSeek-R1-Distill-SRE-Qwen-32B-INT8` is the industry's first publicly available operations large model. It is a specialized mixed-precision 8-bit quantized large language model fine-tuned from the `DeepSeek-R1-Distill-Qwen-32B` model, optimized specifically for **operations** and **Site Reliability Engineering (SRE)** scenarios. This model inherits the powerful reasoning capabilities of the DeepSeek-R1 series and has been further fine-tuned using the [ahmedgongi/Devops_LLM](https://huggingface.co/datasets/ahmedgongi/Devops_LLM) dataset, significantly enhancing its utility in the following tasks:
 - Automated script generation