rajabmondal commited on
Commit
5e81566
·
verified ·
1 Parent(s): 1d960c0

ypdated model training deyails

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -81,8 +81,8 @@ The model has been trained on source code from 80+ programming languages. The pr
81
  ## Model
82
 
83
  - **Architecture:** GPT-2 model with multi-query attention and Fill-in-the-Middle objective
84
- - **Pretraining steps:** k
85
- - **Pretraining tokens:** 1 trillion
86
  - **Precision:** bfloat16
87
 
88
  ## Hardware
 
81
  ## Model
82
 
83
  - **Architecture:** GPT-2 model with multi-query attention and Fill-in-the-Middle objective
84
+ - **•Fine-training steps:** 50k
85
+ - **Pretraining tokens:** 22 Billion
86
  - **Precision:** bfloat16
87
 
88
  ## Hardware