Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ An intriguing aspect of adapting T-pro-it-1.0 is that this model was obtained th
|
|
19 |
|
20 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/652cedbdf120598322ae358a/sKwHvA9ztd7rHx37Ca2ey.png" style="display: block; margin: 0 auto; max-width: 50%; height: auto;">
|
21 |
|
22 |
-
For adaptation, we use sampling from a combination of the open datasets HuggingFaceFW/fineweb-2 and IlyaGusev/
|
23 |
|
24 |
## Papers
|
25 |
Tikhomirov M., Chernyshov D. Facilitating Large Language Model Russian Adaptation with Learned Embedding Propagation //Journal of Language and Education. β 2024. β Π’. 10. β β. 4. β Π‘. 130-145. (Preprint: https://arxiv.org/abs/2412.21140)
|
|
|
19 |
|
20 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/652cedbdf120598322ae358a/sKwHvA9ztd7rHx37Ca2ey.png" style="display: block; margin: 0 auto; max-width: 50%; height: auto;">
|
21 |
|
22 |
+
For adaptation, we use sampling from a combination of the open datasets **HuggingFaceFW/fineweb-2** and **IlyaGusev/rulm**. The study of the impact of data volume and quality on the current process is ongoing.
|
23 |
|
24 |
## Papers
|
25 |
Tikhomirov M., Chernyshov D. Facilitating Large Language Model Russian Adaptation with Learned Embedding Propagation //Journal of Language and Education. β 2024. β Π’. 10. β β. 4. β Π‘. 130-145. (Preprint: https://arxiv.org/abs/2412.21140)
|