Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kaist-ai 's Collections
The CoT Collection
The Feedback Collection
The Perception Collection
LangBridge
ORPO
System Message Generalization

ORPO

updated Apr 12, 2024

This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model".

Upvote
11

  • kaist-ai/mistral-orpo-beta

    Text Generation • Updated Mar 17, 2024 • 16 • 37

  • kaist-ai/mistral-orpo-alpha

    Text Generation • Updated Mar 17, 2024 • 11 • 8

  • ORPO: Monolithic Preference Optimization without Reference Model

    Paper • 2403.07691 • Published Mar 12, 2024 • 65

  • kaist-ai/mistral-orpo-capybara-7k

    Text Generation • Updated Mar 23, 2024 • 1.31k • 26

  • HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1

    Text Generation • Updated Apr 18, 2024 • 95 • 267
Upvote
11
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs