prapti19 commited on
Commit
a4906e0
·
verified ·
1 Parent(s): f538515

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -4
README.md CHANGED
@@ -21,9 +21,9 @@ axolotl version: `0.5.0`
21
 
22
  </details><br>
23
 
24
- # curator_math_phase1_sn_ensemble7_90325
25
 
26
- This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on the None dataset.
27
  It achieves the following results on the evaluation set:
28
  - Loss: 0.3203
29
 
@@ -33,11 +33,13 @@ More information needed
33
 
34
  ## Intended uses & limitations
35
 
36
- More information needed
37
 
38
  ## Training and evaluation data
39
 
40
- More information needed
 
 
41
 
42
  ## Training procedure
43
 
@@ -67,6 +69,9 @@ The following hyperparameters were used during training:
67
  | 0.3248 | 0.6669 | 2486 | 0.3203 |
68
 
69
 
 
 
 
70
  ### Framework versions
71
 
72
  - PEFT 0.13.2
 
21
 
22
  </details><br>
23
 
24
+ # Collinear Curator 1:
25
 
26
+ This is an open-source fine-tuned reasoning adapter of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct), transformed into a math reasoning model using data curated from [collinear-ai/R1-Distill-SFT-Curated](https://huggingface.co/datasets/collinear-ai/R1-Distill-SFT-Curated).
27
  It achieves the following results on the evaluation set:
28
  - Loss: 0.3203
29
 
 
33
 
34
  ## Intended uses & limitations
35
 
36
+ Math Reasoning
37
 
38
  ## Training and evaluation data
39
 
40
+ - Training data: [collinear-ai/R1-Distill-SFT-Curated](https://huggingface.co/datasets/collinear-ai/R1-Distill-SFT-Curated)
41
+ - Evaluation data: [HuggingFaceH4/MATH-500](https://huggingface.co/datasets/HuggingFaceH4/MATH-500)
42
+
43
 
44
  ## Training procedure
45
 
 
69
  | 0.3248 | 0.6669 | 2486 | 0.3203 |
70
 
71
 
72
+ ### Evaluation Results on Math500
73
+
74
+
75
  ### Framework versions
76
 
77
  - PEFT 0.13.2