Cross-Tokenizer Distillation via Approximate Likelihood Matching Paper • 2503.20083 • Published Mar 25 • 1
Retrofitting (Large) Language Models with Dynamic Tokenization Paper • 2411.18553 • Published Nov 27, 2024 • 2
Cross-Tokenizer Distillation via Approximate Likelihood Matching Paper • 2503.20083 • Published Mar 25 • 1