Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
jshin49 's Collections
multi-lingual llms
pre-training
mixture-of-experts
alignment-learning
llm-as-a-judge
multi-modal
synthetic-data

pre-training

updated Apr 19, 2024
Upvote
-

  • Pre-training Small Base LMs with Fewer Tokens

    Paper • 2404.08634 • Published Apr 12, 2024 • 36

  • Ziya2: Data-centric Learning is All LLMs Need

    Paper • 2311.03301 • Published Nov 6, 2023 • 20

  • How to Train Data-Efficient LLMs

    Paper • 2402.09668 • Published Feb 15, 2024 • 43

  • MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

    Paper • 2404.06395 • Published Apr 9, 2024 • 23
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs