llama3.1-8b-refinedbase v1 - a from-our-page Collection

from-our-page 's Collections

updated Mar 28

continued pretraining of llama3.1 8b on refinedweb for ~80M tokens to try to undo the annealing step and make it act more like an actual base model