George-API commited on
Commit
4dfe8a5
·
verified ·
1 Parent(s): 9c1fdf3

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +28 -12
README.md CHANGED
@@ -1,12 +1,28 @@
1
- ---
2
- title: Qwen4bit
3
- emoji:
4
- colorFrom: purple
5
- colorTo: yellow
6
- sdk: gradio
7
- sdk_version: 5.20.1
8
- app_file: app.py
9
- pinned: false
10
- ---
11
-
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Fine-tuned DeepSeek-R1-Distill-Qwen-14B
2
+
3
+ This space hosts a fine-tuned version of the [unsloth/DeepSeek-R1-Distill-Qwen-14B-bnb-4bit](https://huggingface.co/unsloth/DeepSeek-R1-Distill-Qwen-14B-bnb-4bit) model.
4
+
5
+ ## Model Details
6
+
7
+ - **Base Model**: `unsloth/DeepSeek-R1-Distill-Qwen-14B-bnb-4bit`
8
+ - **Fine-tuned on**: `phi4-cognitive-dataset`
9
+ - **Quantization**: Already 4-bit quantized (no additional quantization applied)
10
+
11
+ ## Current Status
12
+
13
+ This space is currently being prepared. The fine-tuned model will be available soon.
14
+
15
+ ## Usage
16
+
17
+ Once deployed, you can interact with the model through the Gradio interface or via API.
18
+
19
+ ## Training Process
20
+
21
+ The model is being fine-tuned with the following specifications:
22
+ - Training dataset processed in ascending order by `prompt_number`
23
+ - Custom training parameters optimized for the L40S GPU
24
+ - Mixed precision training for optimal performance
25
+
26
+ ## Contact
27
+
28
+ For questions or issues, please reach out through the [Hugging Face community](https://huggingface.co/discussions).