Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
SambaNova
Fireworks
Cohere
Novita
fal
Hyperbolic
Replicate
Cerebras
Together AI
Nscale
Nebius AI Studio
HF Inference API
Misc
Reset Misc
llama-cpp
Inference Endpoints
Merge
text-generation-inference
Eval Results
4-bit precision
Mixture of Experts
8-bit precision
custom_code
Carbon Emissions
text-embeddings-inference
Apply filters
Models
20,116
Full-text search
Edit filters
Sort: Trending
Active filters:
llama-cpp
Clear all
aayushg159/Phi-3-medium-4k-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
May 22, 2024
•
2
farpluto/Phi-3-medium-4k-instruct-Q4_K_S-GGUF
Text Generation
•
Updated
May 22, 2024
ivankris/gemma-2b-Q4_K_M-GGUF
Updated
May 22, 2024
AlirezaF138/Llama-3-Persian-8B-LoRA-Q6_K-GGUF
Updated
May 22, 2024
•
26
•
5
AlirezaF138/persian_llama_7B_merged-Q6_K-GGUF
Text Generation
•
Updated
May 22, 2024
•
5
•
1
jeiku/Aura_3B-Q4_K_M-GGUF
Updated
May 22, 2024
•
33
Ken0751/Meta-Llama-3-8B-Q4_K_M-GGUF
Text Generation
•
Updated
May 22, 2024
Salekeen/Phi-3-mini-128k-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
May 22, 2024
•
2
ybelkada/tiny-random-llama-Q4_K_M-GGUF
Updated
May 22, 2024
•
13
AlirezaF138/AVA-Qwen1.5-7B-Chat-Q6_K-GGUF
Updated
May 22, 2024
•
5
Bendy121165/gpt2-Q4_K_M-GGUF
Updated
May 22, 2024
•
1
reach-vb/TinyLlama-1.1B-Chat-v0.5-Q2_K-GGUF
Updated
May 22, 2024
•
17
linh5nb/Llama-2-7b-chat-luat-hon-nhan-1-Q4_K_M-GGUF
Updated
May 22, 2024
GeorgeBredis/Phi-3-mini-128k-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
May 22, 2024
•
1
moshecohentheking/Hebrew-Mistral-7B-Q4_K_M-GGUF
Updated
May 22, 2024
FilippoToso/Mistral-RAG-Q8_0-GGUF
Updated
May 22, 2024
•
1
SixOpen/Phi-3-mini-4k-instruct-IQ4_NL-imat.gguf
Text Generation
•
Updated
May 22, 2024
•
7
aayushg159/Phi-3-mini-128k-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
May 22, 2024
•
2
JeffreyLind/Meta-Llama-3-8B-Q4_K_M-GGUF
Text Generation
•
Updated
May 22, 2024
VlSav/saiga_llama3_8b_v7-Q6_K-GGUF
Updated
Jul 9, 2024
•
23
•
1
NNet/saiga_llama3_8b-Q6_K-GGUF
Updated
May 22, 2024
•
5
NikolayKozloff/Awanllm-Llama-3-8B-Cumulus-v0.3.2-Q4_0-GGUF
Updated
May 22, 2024
•
3
•
1
NikolayKozloff/Awanllm-Llama-3-8B-Cumulus-v0.3.2-Q5_0-GGUF
Updated
May 22, 2024
•
1
Earthkwake/Phi-3-mini-4k-instruct-Q4_0-GGUF
Text Generation
•
Updated
May 22, 2024
•
224
mmacros/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
May 23, 2024
EZPK/guillaumetell-7b-Q4_K_M-GGUF
Text Generation
•
Updated
May 23, 2024
•
1
KingsonHO/Meta-Llama-3-8B-Instruct-Q5_0-GGUF
Text Generation
•
Updated
May 23, 2024
•
1
nadeem1362/mxbai-embed-large-v1-Q4_K_M-GGUF
Feature Extraction
•
Updated
May 23, 2024
•
13
YorkieOH10/Cream-Phi-3-14B-v1-Q8_0-GGUF
Updated
May 23, 2024
•
1
ArturoVitale/gpt2-Q2_K-GGUF
Updated
May 23, 2024
•
1
Previous
1
...
45
46
47
48
49
...
100
Next