Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sail 's Collections
πŸš€ Active PRM
🌾Oat-Zero: Understanding R1-Zero-Like Training
πŸ”± Sailor2 Language Models
🧬 RegMix: Data Mixture as Regression
πŸ“ˆ Scaling Laws with Vocabulary
πŸ’‘ DICE
βš“οΈ Sailor Language Models

🧬 RegMix: Data Mixture as Regression

updated Jul 26, 2024

Automatic data mixture method for large language model pre-training

Upvote
8

  • Running
    6
    6

    RegMix

    πŸ“š

    Generate regression predictions from CSV data


  • RegMix: Data Mixture as Regression for Language Model Pre-training

    Paper β€’ 2407.01492 β€’ Published Jul 1, 2024 β€’ 39

  • sail/data-mixture-human-1b

    Text Generation β€’ Updated Jul 11, 2024 β€’ 23 β€’ 3

  • sail/data-mixture-pile-cc-1b

    Text Generation β€’ Updated Jul 11, 2024 β€’ 8 β€’ 3

  • sail/data-mixture-regmix-1b

    Text Generation β€’ Updated Jul 11, 2024 β€’ 5 β€’ 2

  • sail/data-mixture-doremi-1b

    Text Generation β€’ Updated Jul 11, 2024 β€’ 4 β€’ 2

  • sail/data-mixture-random-1b

    Text Generation β€’ Updated Jul 11, 2024 β€’ 3 β€’ 4

  • sail/regmix-data-sample

    Viewer β€’ Updated Jul 11, 2024 β€’ 698k β€’ 162 β€’ 2

  • sail/regmix-data

    Viewer β€’ Updated Sep 12, 2024 β€’ 13.7M β€’ 24.4k β€’ 4
Upvote
8
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs