manu commited on
Commit
6581827
·
1 Parent(s): 13721b2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -8,4 +8,4 @@ language:
8
 
9
  BPE Tokenizer fitted on a custom corpus, with digit separation, byte fallback and other features from LlamaTokenizer.
10
 
11
- Only fitted on 1,000,000 samples (7.5M words).
 
8
 
9
  BPE Tokenizer fitted on a custom corpus, with digit separation, byte fallback and other features from LlamaTokenizer.
10
 
11
+ Only fitted on 1,000,000 samples.