from-our-page/llama3.1-8b-refinedbase-checkpoint-5120
Updated
continued pretraining of llama3.1 8b on refinedweb for ~80M tokens to try to undo the annealing step and make it act more like an actual base model
Note this is the final checkpoint, more info soon