Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
fal
Novita
Cohere
Nebius AI Studio
Fireworks
Cerebras
Hyperbolic
SambaNova
Together AI
HF Inference API
Misc
Reset Misc
llama.cpp
Inference Endpoints
4-bit precision
text-generation-inference
AutoTrain Compatible
Merge
Eval Results
Misc with no match
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
140
Full-text search
Edit filters
Sort: Trending
Active filters:
llama.cpp
Clear all
google/gemma-1.1-7b-it-GGUF
Updated
Jun 27, 2024
•
4
•
20
google/gemma-1.1-2b-it-GGUF
Updated
Jun 27, 2024
•
1
•
20
pacozaa/bonito-gguf
Updated
Apr 14, 2024
•
10
pmking27/PrathameshLLM-2B-GGUF
Updated
Apr 9, 2024
•
6.45k
•
1
teleprint-me/cyberpunk-valerie-v0.1
Text Generation
•
Updated
Apr 18, 2024
•
41
•
1
qwp4w3hyb/Meta-Llama-3-8B-Instruct-iMat-GGUF
Text Generation
•
Updated
Apr 29, 2024
•
700
•
6
mgonzs13/Mistroll-7B-v2.2-GGUF
Text Generation
•
Updated
Apr 29, 2024
•
27
mgonzs13/ladybird-base-7B-v8-GGUF
Text Generation
•
Updated
Apr 29, 2024
•
36
google/codegemma-1.1-2b-GGUF
Text Generation
•
Updated
Jun 27, 2024
•
7
google/codegemma-1.1-7b-it-GGUF
Text Generation
•
Updated
Jun 27, 2024
•
3
•
14
mgonzs13/TextBase-7B-v0.1-GGUF
Text Generation
•
Updated
Jun 11, 2024
•
101
QuantFactory/TextBase-7B-v0.1-GGUF
Text Generation
•
Updated
Jun 18, 2024
•
95
njwright92/ComicBot_v.2-gguf
Text Generation
•
Updated
Aug 30, 2024
•
70
Irathernotsay/qwen2-1.5B-medical_qa-Finetune
Text Generation
•
Updated
Jul 17, 2024
•
3
palusi/Qwen2-0.5B-Instruct-GGUF
Updated
Jun 27, 2024
•
53
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k
Text Generation
•
Updated
Jul 9, 2024
•
14
ruslanmv/Medical-Llama3-v2-Q4_K_M-GGUF
Updated
Jun 30, 2024
•
3
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GGUF
Text Generation
•
Updated
Jul 9, 2024
•
23
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-GPTQ
Text Generation
•
Updated
Jul 9, 2024
•
2
zhhan/Phi-3-mini-4k-instruct_gguf_derived
Summarization
•
Updated
Jul 2, 2024
•
39
XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k-AWQ
Text Generation
•
Updated
Jul 9, 2024
mgonzs13/stablelm-zephyr-3B-localmentor-GGUF
Text Generation
•
Updated
Jul 3, 2024
•
136
chatpdflocal/llama3.1-8b-gguf
Updated
Dec 27, 2024
•
312
•
26
akshathmangudi/llama3.1-8b-gguf
Updated
Jul 26, 2024
jhilburn/gemma-inference
Text Generation
•
Updated
Aug 7, 2024
ghost-x/ghost-8b-beta-1608-gguf
Text Generation
•
Updated
Aug 26, 2024
•
107
•
6
PaulJusst/codegemma-7b-it-GGUF
Text Generation
•
Updated
Sep 13, 2024
TheCluster/Llama-3.2-3B-Instruct-GGUF
Text Generation
•
Updated
Sep 25, 2024
•
13
v000000/Typhon-Mixtral-v1-imatrix-v2.Q6_K-GGUF
Updated
Sep 26, 2024
•
7
•
1
LPN64/LongCite-llama3.1-8b-GGUF
Text Generation
•
Updated
Oct 1, 2024
•
193
•
6
Previous
1
2
3
4
5
Next