Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ pipeline_tag: text-generation
|
|
15 |
---
|
16 |
# ZR1-1.5B
|
17 |
|
18 |
-
ZR1-1.5B is a small reasoning model trained extensively on both verified coding and mathematics problems with reinforcement learning. The model
|
19 |
|
20 |

|
21 |
|
|
|
15 |
---
|
16 |
# ZR1-1.5B
|
17 |
|
18 |
+
ZR1-1.5B is a small reasoning model trained extensively on both verified coding and mathematics problems with reinforcement learning. The model outperforms Llama-3.1-70B-Instruct on hard coding tasks and improves upon the base R1-Distill-1.5B model by over 50%, while achieving strong scores on math evaluations and a 37.91% pass@1 accuracy on GPQA-Diamond with just 1.5B parameters.
|
19 |
|
20 |

|
21 |
|