Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
arcee-globe 's Collections
Arabic ORPO-DPO Datasets
Multilingual LLM Papers
Arabic Light Benchmarks
Multilingual Datasets
Datasets with Educational Values

Arabic ORPO-DPO Datasets

updated Aug 17, 2024
Upvote
2

  • arcee-globe/cleaned-NoRobots-Command.R-ORPO

    Viewer • Updated Aug 15, 2024 • 9.5k • 14

    Note External Dataset


  • arcee-globe/filtered-arabic-distilabel-math-preference-orpo

    Viewer • Updated Aug 15, 2024 • 1.96k • 19

    Note Translated with GPT4o-mini


  • arcee-globe/mixed-argilla-orpo-mix-7k-arabic

    Viewer • Updated Aug 15, 2024 • 6.75k • 13

    Note External Dataset with mixture of multi-turn and single-turn samples


  • arcee-globe/arabic-jondurbin-truthy-orpo-v0.1

    Viewer • Updated Aug 16, 2024 • 920 • 9

    Note Translated with GPT4o-mini


  • arcee-train/cleaned-Aya-Command.R-ORPO

    Viewer • Updated Aug 14, 2024 • 14.2k • 14

    Note External Dataset


  • arcee-globe/arabic-distilabel-capybara-orpo-7k-binarized

    Viewer • Updated Aug 17, 2024 • 5.4k • 15

    Note Has multi-turn instances Translated with GPT4o-mini


  • arcee-globe/arabic-orpo-dpo-mix-40k-filtered

    Viewer • Updated Aug 17, 2024 • 31.8k • 11

    Note Might have multi-turn instances Removed the toxic subset from the original dataset. Translated with GPT4o-mini

Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs