Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
merterbak 's Collections
Qwen 3
LLM's
Papers

Papers

updated Mar 20
Upvote
-

  • Attention Is All You Need

    Paper • 1706.03762 • Published Jun 12, 2017 • 61

  • LoRA Learns Less and Forgets Less

    Paper • 2405.09673 • Published May 15, 2024 • 89

  • DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

    Paper • 2401.02954 • Published Jan 5, 2024 • 48

  • RAFT: Adapting Language Model to Domain Specific RAG

    Paper • 2403.10131 • Published Mar 15, 2024 • 73

  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Paper • 2501.12948 • Published Jan 22 • 391

  • BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

    Paper • 1810.04805 • Published Oct 11, 2018 • 18
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs