Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hugging-quants 's Collections
Gemma2 AWQ Quants
Llama 3.2 3B & 1B GGUF Quants
Llama 3.1 GPTQ, AWQ, and BNB Quants

Llama 3.1 GPTQ, AWQ, and BNB Quants

updated Sep 26, 2024

Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗

Upvote
56

  • hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4

    Text Generation • Updated Sep 13, 2024 • 1.2k • 36

  • hugging-quants/Meta-Llama-3.1-405B-Instruct-BNB-NF4

    Text Generation • Updated Sep 16, 2024 • 28 • 5

  • hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4

    Text Generation • Updated Aug 7, 2024 • 176 • 16

  • hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

    Text Generation • Updated Aug 7, 2024 • 40.7k • 100

  • unsloth/Meta-Llama-3.1-70B-Instruct-bnb-4bit

    Text Generation • Updated Nov 22, 2024 • 5.04k • 31

  • hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4

    Text Generation • Updated Aug 7, 2024 • 4.59k • 23

  • hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4

    Text Generation • Updated Aug 7, 2024 • 357k • 67

  • hugging-quants/Meta-Llama-3.1-8B-Instruct-BNB-NF4

    Text Generation • Updated Aug 8, 2024 • 512 • 8

  • hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4

    Text Generation • Updated Aug 7, 2024 • 143k • 25
Upvote
56
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs