Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
umarigan 's Collections
DPO Dataset
Computer Vision Datasets
Domain Spec. Datasets
Turkish Datasets
TR Models
Turkish LLM Fine-Tune Datasets

DPO Dataset

updated Mar 20, 2024

direct preference optimization related datasets

Upvote
-

  • argilla/reward-model-data-falcon

    Viewer • Updated Jun 7, 2023 • 7.4k • 52 • 1

  • jondurbin/gutenberg-dpo-v0.1

    Viewer • Updated Jan 12, 2024 • 918 • 905 • 142

  • ybisk/piqa

    Updated Jan 18, 2024 • 260k • 89

  • Dahoas/rm-hh-rlhf

    Viewer • Updated Dec 22, 2022 • 89.5k • 282 • 4

  • duxx/distilabel-intel-orca-dpo-pairs-tr

    Viewer • Updated Feb 5, 2024 • 3.98k • 73 • 7

  • Dahoas/rm_instruct_helpful_preferences

    Viewer • Updated Mar 1, 2023 • 90.7k • 48 • 4

  • Dahoas/1B_hh_sft_ppo_comparison

    Viewer • Updated Jan 26, 2023 • 100 • 21

  • abacusai/MetaMath_DPO_FewShot

    Viewer • Updated Feb 26, 2024 • 395k • 114 • 26

  • abacusai/HellaSwag_DPO_FewShot

    Viewer • Updated Feb 26, 2024 • 150k • 43 • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs