Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lewtun 's Collections
H4
Awesome RLHF
Mistral 7B + UltraChat + Arithmo checkpoints
Hub tools
Gemma RLAIF

Gemma RLAIF

updated Mar 1, 2024
Upvote
-

  • lewtun/gemma-7b-sft-full-ultrachat-v0

    Text Generation • Updated Feb 29, 2024 • 5 • 1

  • lewtun/gemma-7b-sft-full-dolly-v3

    Text Generation • Updated Feb 29, 2024 • 5

  • lewtun/gemma-7b-sft-full-deita-10k-v0

    Text Generation • Updated Feb 29, 2024 • 3

  • lewtun/gemma-7b-dpo-full-ultrafeedback-v0

    Text Generation • Updated Feb 29, 2024 • 11

  • lewtun/gemma-7b-dpo-full-orca-v0

    Text Generation • Updated Feb 29, 2024 • 3

  • lewtun/gemma-7b-dpo-full-mix1-beta-0.1

    Text Generation • Updated Feb 29, 2024 • 3

  • lewtun/gemma-7b-dpo-full-mix1-beta-0.1-epoch-3

    Text Generation • Updated Feb 29, 2024 • 4

  • lewtun/gemma-7b-dpo-full-mix1-beta-0.05

    Text Generation • Updated Feb 29, 2024 • 11

  • lewtun/gemma-7b-dpo-full-mix1-beta-0.4-epoch-3

    Text Generation • Updated Feb 29, 2024 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs