llama3.1-8b-refinedbase v1
Collection
continued pretraining of llama3.1 8b on refinedweb for ~80M tokens to try to undo the annealing step and make it act more like an actual base model
•
8 items
•
Updated
No model card