Training in progress, epoch 1

Browse files

Files changed (7) hide show

README.md +18 -12
model.safetensors +1 -1
runs/Apr30_06-11-19_68be676fb8aa/events.out.tfevents.1745993502.68be676fb8aa.447.0 +3 -0
runs/Apr30_06-11-19_68be676fb8aa/events.out.tfevents.1745994824.68be676fb8aa.447.1 +3 -0
runs/Apr30_06-49-46_68be676fb8aa/events.out.tfevents.1745995800.68be676fb8aa.447.2 +3 -0
tokenizer.json +16 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -3,6 +3,7 @@ library_name: transformers
 license: apache-2.0
 base_model: answerdotai/ModernBERT-base
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
@@ -21,11 +22,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2355
-- Accuracy: 0.9700
-- F1: 0.9700
-- Precision: 0.9702
-- Recall: 0.9700
 ## Model description
@@ -50,22 +51,27 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| No log        | 1.0   | 311  | 0.1570          | 0.9587   | 0.9588 | 0.9594    | 0.9587 |
-| 0.1864        | 2.0   | 622  | 0.2007          | 0.9644   | 0.9645 | 0.9650    | 0.9644 |
-| 0.1864        | 3.0   | 933  | 0.2530          | 0.9644   | 0.9643 | 0.9651    | 0.9644 |
-| 0.0242        | 4.0   | 1244 | 0.2355          | 0.9700   | 0.9700 | 0.9702    | 0.9700 |
-| 0.0031        | 5.0   | 1555 | 0.2446          | 0.9700   | 0.9700 | 0.9702    | 0.9700 |
 ### Framework versions
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
-- Datasets 3.5.0
 - Tokenizers 0.21.1

 license: apache-2.0
 base_model: answerdotai/ModernBERT-base
 tags:
+- v6.0
 - generated_from_trainer
 metrics:
 - accuracy
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1422
+- Accuracy: 0.9796
+- F1: 0.9796
+- Precision: 0.9797
+- Recall: 0.9796
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| No log        | 1.0   | 315  | 0.1323          | 0.9704   | 0.9704 | 0.9705    | 0.9704 |
+| 0.2136        | 2.0   | 630  | 0.1504          | 0.9722   | 0.9723 | 0.9739    | 0.9722 |
+| 0.2136        | 3.0   | 945  | 0.1670          | 0.9722   | 0.9723 | 0.9730    | 0.9722 |
+| 0.075         | 4.0   | 1260 | 0.1422          | 0.9796   | 0.9796 | 0.9797    | 0.9796 |
+| 0.0211        | 5.0   | 1575 | 0.1496          | 0.9741   | 0.9741 | 0.9742    | 0.9741 |
+| 0.0211        | 6.0   | 1890 | 0.1505          | 0.9741   | 0.9740 | 0.9741    | 0.9741 |
+| 0.012         | 7.0   | 2205 | 0.1661          | 0.9778   | 0.9778 | 0.9779    | 0.9778 |
+| 0.0038        | 8.0   | 2520 | 0.1558          | 0.9778   | 0.9778 | 0.9779    | 0.9778 |
+| 0.0038        | 9.0   | 2835 | 0.1557          | 0.9741   | 0.9740 | 0.9741    | 0.9741 |
+| 0.0049        | 10.0  | 3150 | 0.1574          | 0.9741   | 0.9740 | 0.9741    | 0.9741 |
 ### Framework versions
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
+- Datasets 3.5.1
 - Tokenizers 0.21.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4eb767f6dda90b18d758c3df4658c30377fd832393e696c672d5c411a9ef80c1
 size 598445936

 version https://git-lfs.github.com/spec/v1
+oid sha256:43e973b7ca73d418f70772017ad755b1a7782b90d411c4c391a038ce9c247aa8
 size 598445936

runs/Apr30_06-11-19_68be676fb8aa/events.out.tfevents.1745993502.68be676fb8aa.447.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:afe169a9662396084ee75d75e483299a8bdc1293b1373a328cb3e9162baa19f9
+size 12295

runs/Apr30_06-11-19_68be676fb8aa/events.out.tfevents.1745994824.68be676fb8aa.447.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:436bd0e39c46e791aecb72d5e40ffd2e030e282eeda9445b4d1f0f033a4bad65
+size 1032

runs/Apr30_06-49-46_68be676fb8aa/events.out.tfevents.1745995800.68be676fb8aa.447.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e01e4ef6c828e0861b2d66dedf84b7d19a22c1b313170930d953de6f376cf7b
+size 6417

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 128,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 128
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 50283,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5a3457dd7b6e50790c6b2469013c1154a559cc6fadf524c58848989132a39bcb
 size 5432

 version https://git-lfs.github.com/spec/v1
+oid sha256:a02d8d9bda62eee62479d02b6bacd09d43dc2a04e2180633fc4111587cc031a4
 size 5432