Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources
Zihao Li
Zihao-Li


·
AI & ML interests
Multilingual NLP
Recent Activity
updated
a Space
3 days ago
Zihao-Li/MT-HumanEval
updated
a dataset
4 days ago
MaLA-LM/MassiveSumm_short
updated
a dataset
4 days ago
MaLA-LM/MassiveSumm_long
Organizations
Collections
1
models
38

Zihao-Li/V7-Bi-Code-Stag
Text Generation
•
Updated
•
3

Zihao-Li/V7-Bi-Code-Alt
Text Generation
•
Updated
•
3

Zihao-Li/V7-Bi-Code-Sel
Text Generation
•
Updated
•
1

Zihao-Li/V7-Mono-Alt
Text Generation
•
Updated
•
2

Zihao-Li/V7-Bi-Sel
Text Generation
•
Updated
•
1

Zihao-Li/V7-Bi-Stag
Text Generation
•
Updated
•
2

Zihao-Li/V7-Mono-Code-Alt
Text Generation
•
Updated
•
1

Zihao-Li/V7-Mono-Code-Sel
Text Generation
•
Updated
•
1

Zihao-Li/V7-Mono-Code-Stag
Text Generation
•
Updated
•
3

Zihao-Li/V7-Bi-Alt
Text Generation
•
Updated
•
3