jacob-danner commited on
Commit
ee6d271
·
verified ·
1 Parent(s): 5e0dfd3

feat: train with sorted label to make training behavior reproducible at evaluation time

Browse files
Files changed (2) hide show
  1. README.md +9 -8
  2. model.safetensors +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [openai-gpt](https://huggingface.co/openai-gpt) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.3986
20
 
21
  ## Model description
22
 
@@ -47,13 +47,14 @@ The following hyperparameters were used during training:
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 1.5016 | 1.0 | 12 | 1.1160 |
51
- | 0.8736 | 2.0 | 24 | 0.7228 |
52
- | 0.4681 | 3.0 | 36 | 0.4849 |
53
- | 0.2851 | 4.0 | 48 | 0.4339 |
54
- | 0.1733 | 5.0 | 60 | 0.3113 |
55
- | 0.1108 | 6.0 | 72 | 0.3303 |
56
- | 0.0841 | 7.0 | 84 | 0.3986 |
 
57
 
58
 
59
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [openai-gpt](https://huggingface.co/openai-gpt) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.3295
20
 
21
  ## Model description
22
 
 
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 1.6661 | 1.0 | 12 | 1.4575 |
51
+ | 1.3696 | 2.0 | 24 | 1.0023 |
52
+ | 0.6977 | 3.0 | 36 | 0.5564 |
53
+ | 0.3751 | 4.0 | 48 | 0.4573 |
54
+ | 0.2528 | 5.0 | 60 | 0.3803 |
55
+ | 0.1503 | 6.0 | 72 | 0.2699 |
56
+ | 0.0863 | 7.0 | 84 | 0.2888 |
57
+ | 0.0547 | 8.0 | 96 | 0.3295 |
58
 
59
 
60
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7c3608554d7067516442941a29037b9956dca9679dce7edb8295503bab207489
3
  size 466181672
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5e13a56823f51b105f62aa5a86397d4e9c9a21aa276bd6e12918cfcf65b2a0e7
3
  size 466181672