Multilingual SFT & DPO Datasets
These SFT or DPO datasets were translated from English using the Mistral-7B-Instruct-v0.2 or taken from other sources.
Viewer • Updated • 24.4k • 23Note SFT Dataset containing multi-turn conversations in 9 languages by translating the `deita-10k-v0-instruct` dataset
nthakur/Bactrian-X-23-lang-instruct
Viewer • Updated • 1.54M • 77Note SFT Dataset, which is a ported version of `MBZUAI/Bactrian-X` dataset for 23 languages.
nthakur/GSM8KInstruct-Parallel-instruct
Viewer • Updated • 73.6k • 38Note SFT Dataset containing mathematical formulae in 10 languages (including english) formed by translating the GSM8KInstruct dataset.
nthakur/ultrachat-200k-instruct
Viewer • Updated • 231k • 48Note SFT Dataset: containing 200k training pairs in English from the ultrachat dataset.
nthakur/multilingual-distilabel-intel-orca-dpo-pairs-v0.1
Viewer • Updated • 9.42k • 46Note DPO Dataset: multilingual training pairs of the ORCA dpo dataset translated into 10 languages.
nthakur/GSM8KInstruct-Parallel-instruct-dpo-v0.1
Viewer • Updated • 70k • 19Note DPO dataset for GSM8KInstruct, which has been translated into 10 languages + English.
nthakur/multilingual-truthy-dpo-pairs-v0.1
Viewer • Updated • 8.41k • 15Note DPO Dataset using the english truthy dataset converted into multiple languages.
nthakur/multilingual-ultrafeedback-binarized-dpo-v0.1
Viewer • Updated • 76.4k • 28Note DPO Dataset by translating the ultrafeedback-binarized dataset into multiple languages.