Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lihaocruiser 's Collections
LLM-Reasoning
LLM-Prompting
LLM-RL
LLM-Emergency
LLM-Extraction
LLM-Legal
LLM-RAG
LLM-Pretrain
LLM-Instruct
LLM-Evaluation
LLM-Safety
LLM-Length
LLM-Agent
LLM-Dialog
LLM-SyntheticData
LLM-recomendation
LLM-Hallucination
LLM-Summary
Preprocessing
LLM-fact
Embedding

LLM-Pretrain

updated Oct 8, 2024
Upvote
-

  • Data Selection for Language Models via Importance Resampling

    Paper • 2302.03169 • Published Feb 6, 2023

  • Scaling Data-Constrained Language Models

    Paper • 2305.16264 • Published May 25, 2023 • 17

  • Challenges with unsupervised LLM knowledge discovery

    Paper • 2312.10029 • Published Dec 15, 2023 • 10

  • How Do Large Language Models Acquire Factual Knowledge During Pretraining?

    Paper • 2406.11813 • Published Jun 17, 2024 • 32
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs