Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ pipeline_tag: text-generation
|
|
17 |
|
18 |
# **Deepthink-1.5B-Open-PRM**
|
19 |
|
20 |
-
> **Deepthink-1.5B-Open-PRM** is a **process-supervised reasoning model** fine-tuned from **Qwen2.5
|
21 |
|
22 |
## **Key Features**
|
23 |
|
@@ -25,7 +25,7 @@ pipeline_tag: text-generation
|
|
25 |
Fine-tuned with PRMs to reward high-quality intermediate reasoning steps — fostering step-by-step interpretability, accuracy, and educational transparency.
|
26 |
|
27 |
2. **Compact Foundation (Qwen2.5 0.5B)**
|
28 |
-
Built upon the highly efficient Qwen2.5
|
29 |
|
30 |
3. **Bilingual Math Capability**
|
31 |
Fluent in solving and explaining math problems in both **English** and **Simplified Chinese**, making it ideal for multilingual classrooms and tutoring platforms.
|
|
|
17 |
|
18 |
# **Deepthink-1.5B-Open-PRM**
|
19 |
|
20 |
+
> **Deepthink-1.5B-Open-PRM** is a **process-supervised reasoning model** fine-tuned from **Qwen2.5 1.5B** using **Process Reward Models (PRM)**. It excels at **step-by-step mathematical problem solving** in both **English** and **Simplified Chinese**, offering interpretable, logically structured responses for use in **education**, **STEM tutoring**, and **lightweight math agents**.
|
21 |
|
22 |
## **Key Features**
|
23 |
|
|
|
25 |
Fine-tuned with PRMs to reward high-quality intermediate reasoning steps — fostering step-by-step interpretability, accuracy, and educational transparency.
|
26 |
|
27 |
2. **Compact Foundation (Qwen2.5 0.5B)**
|
28 |
+
Built upon the highly efficient Qwen2.5 1.5B architecture and scaled up through distillation and reward-based alignment to 1.5B parameters, balancing reasoning quality and deployment efficiency.
|
29 |
|
30 |
3. **Bilingual Math Capability**
|
31 |
Fluent in solving and explaining math problems in both **English** and **Simplified Chinese**, making it ideal for multilingual classrooms and tutoring platforms.
|