Spaces:

George-API
/

qwen4bit

Sleeping

George-API commited on Mar 10

Commit

4dfe8a5

verified ·

1 Parent(s): 9c1fdf3

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,12 +1,28 @@
----
-title: Qwen4bit
-emoji: ⚡
-colorFrom: purple
-colorTo: yellow
-sdk: gradio
-sdk_version: 5.20.1
-app_file: app.py
-pinned: false
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# Fine-tuned DeepSeek-R1-Distill-Qwen-14B
+This space hosts a fine-tuned version of the [unsloth/DeepSeek-R1-Distill-Qwen-14B-bnb-4bit](https://huggingface.co/unsloth/DeepSeek-R1-Distill-Qwen-14B-bnb-4bit) model.
+## Model Details
+- **Base Model**: `unsloth/DeepSeek-R1-Distill-Qwen-14B-bnb-4bit`
+- **Fine-tuned on**: `phi4-cognitive-dataset`
+- **Quantization**: Already 4-bit quantized (no additional quantization applied)
+## Current Status
+This space is currently being prepared. The fine-tuned model will be available soon.
+## Usage
+Once deployed, you can interact with the model through the Gradio interface or via API.
+## Training Process
+The model is being fine-tuned with the following specifications:
+- Training dataset processed in ascending order by `prompt_number`
+- Custom training parameters optimized for the L40S GPU
+- Mixed precision training for optimal performance
+## Contact
+For questions or issues, please reach out through the [Hugging Face community](https://huggingface.co/discussions).