Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
UCLA-AGI 's Collections
zephyr-7b-sft-full-SPIN
datasets-SPIN
SPIN-Diffusion
SPPO

SPPO

updated Jun 29, 2024

Self-Play Preference Optimization

Upvote
13

  • UCLA-AGI/Mistral7B-PairRM-SPPO

    Text Generation • Updated May 7, 2024 • 1.32k • 6

  • UCLA-AGI/Mistral7B-PairRM-SPPO-Iter1

    Text Generation • Updated May 6, 2024 • 1.32k • 2

  • UCLA-AGI/Mistral7B-PairRM-SPPO-Iter2

    Text Generation • Updated May 6, 2024 • 1.32k • 1

  • UCLA-AGI/Mistral7B-PairRM-SPPO-Iter3

    Text Generation • Updated May 7, 2024 • 1.32k • 5

  • UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter1

    Text Generation • Updated Jun 25, 2024 • 1.33k • 1

  • UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter2

    Text Generation • Updated Jun 25, 2024 • 1.33k

  • UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3

    Text Generation • Updated Jun 28, 2024 • 1.4k • • 82

  • UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3

    Text Generation • Updated Jul 1, 2024 • 7.02k • 124

  • UCLA-AGI/Gemma-2-9B-It-SPPO-Iter2

    Text Generation • Updated Jul 1, 2024 • 10.5k • 4

  • UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1

    Text Generation • Updated Jul 1, 2024 • 4.02k • 4
Upvote
13
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs