mihaimasala commited on
Commit
2dac433
·
verified ·
1 Parent(s): c43904d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -29,7 +29,7 @@ Furthermore, here you can find data used to train and evaluate LLMs in Romanian.
29
 
30
  See details in [https://arxiv.org/abs/2406.18266](https://arxiv.org/abs/2406.18266) and [https://arxiv.org/abs/2405.07703](https://arxiv.org/abs/2405.07703).
31
 
32
- - 2025-04-23: we increased the datasets used for supervised finetuning with high-quality data generated using Magpie approach, and greatly increase the size of the alignment dataset by adding high-quality datasets
33
 
34
  We encourage the community to engage in discussions (to provide feedback, ask questions, or make improvement suggestions) in Hugging Face or GitHub.
35
 
 
29
 
30
  See details in [https://arxiv.org/abs/2406.18266](https://arxiv.org/abs/2406.18266) and [https://arxiv.org/abs/2405.07703](https://arxiv.org/abs/2405.07703).
31
 
32
+ - 2025-04-23: we increased the datasets used for supervised finetuning with high-quality data generated using Magpie ([RoMagpie-Reasoning](https://huggingface.co/datasets/OpenLLM-Ro/ro_sft_magpie_reasoning) and [RoMagpie-Pro-MT](https://huggingface.co/datasets/OpenLLM-Ro/ro_sft_magpie_mt)), and greatly increase the size of the alignment dataset by adding high-quality datasets ([RoUltraFeedback](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_ultrafeedback), [RoMagpie-DPO](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_magpie), [RoArgillaMagpieUltra](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_argilla_magpie) and [RoHelpSteer2](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_helpsteer2))
33
 
34
  We encourage the community to engage in discussions (to provide feedback, ask questions, or make improvement suggestions) in Hugging Face or GitHub.
35