Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PleIAs 's Collections
Pleias-RAG
GoldenSwag
Common Artifacts
Common Models
Common Corpus
Toxic Commons
Finance Commons
Bad Data Toolbox
OpenCulture

Common Corpus

updated Nov 13, 2024

Largest multilingual pretraining data.

Upvote
10

  • PleIAs/common_corpus

    Viewer • Updated Feb 11 • 470M • 43k • 258
Upvote
10
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs