Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

Mixture of Experts

8-bit precision

Carbon Emissions

text-embeddings-inference

Models

20,116

Full-text search

Active filters: llama-cpp

aayushg159/Phi-3-medium-4k-instruct-Q4_K_M-GGUF

Text Generation • Updated May 22, 2024 • 2

farpluto/Phi-3-medium-4k-instruct-Q4_K_S-GGUF

Text Generation • Updated May 22, 2024

ivankris/gemma-2b-Q4_K_M-GGUF

Updated May 22, 2024

AlirezaF138/Llama-3-Persian-8B-LoRA-Q6_K-GGUF

Updated May 22, 2024 • 26 • 5

AlirezaF138/persian_llama_7B_merged-Q6_K-GGUF

Text Generation • Updated May 22, 2024 • 5 • 1

jeiku/Aura_3B-Q4_K_M-GGUF

Updated May 22, 2024 • 33

Ken0751/Meta-Llama-3-8B-Q4_K_M-GGUF

Text Generation • Updated May 22, 2024

Salekeen/Phi-3-mini-128k-instruct-Q4_K_M-GGUF

Text Generation • Updated May 22, 2024 • 2

ybelkada/tiny-random-llama-Q4_K_M-GGUF

Updated May 22, 2024 • 13

AlirezaF138/AVA-Qwen1.5-7B-Chat-Q6_K-GGUF

Updated May 22, 2024 • 5

Bendy121165/gpt2-Q4_K_M-GGUF

Updated May 22, 2024 • 1

reach-vb/TinyLlama-1.1B-Chat-v0.5-Q2_K-GGUF

Updated May 22, 2024 • 17

linh5nb/Llama-2-7b-chat-luat-hon-nhan-1-Q4_K_M-GGUF

Updated May 22, 2024

GeorgeBredis/Phi-3-mini-128k-instruct-Q4_K_M-GGUF

Text Generation • Updated May 22, 2024 • 1

moshecohentheking/Hebrew-Mistral-7B-Q4_K_M-GGUF

Updated May 22, 2024

FilippoToso/Mistral-RAG-Q8_0-GGUF

Updated May 22, 2024 • 1

SixOpen/Phi-3-mini-4k-instruct-IQ4_NL-imat.gguf

Text Generation • Updated May 22, 2024 • 7

aayushg159/Phi-3-mini-128k-instruct-Q4_K_M-GGUF

Text Generation • Updated May 22, 2024 • 2

JeffreyLind/Meta-Llama-3-8B-Q4_K_M-GGUF

Text Generation • Updated May 22, 2024

VlSav/saiga_llama3_8b_v7-Q6_K-GGUF

Updated Jul 9, 2024 • 23 • 1

NNet/saiga_llama3_8b-Q6_K-GGUF

Updated May 22, 2024 • 5

NikolayKozloff/Awanllm-Llama-3-8B-Cumulus-v0.3.2-Q4_0-GGUF

Updated May 22, 2024 • 3 • 1

NikolayKozloff/Awanllm-Llama-3-8B-Cumulus-v0.3.2-Q5_0-GGUF

Updated May 22, 2024 • 1

Earthkwake/Phi-3-mini-4k-instruct-Q4_0-GGUF

Text Generation • Updated May 22, 2024 • 224

mmacros/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF

Text Generation • Updated May 23, 2024

EZPK/guillaumetell-7b-Q4_K_M-GGUF

Text Generation • Updated May 23, 2024 • 1

KingsonHO/Meta-Llama-3-8B-Instruct-Q5_0-GGUF

Text Generation • Updated May 23, 2024 • 1

nadeem1362/mxbai-embed-large-v1-Q4_K_M-GGUF

Feature Extraction • Updated May 23, 2024 • 13

YorkieOH10/Cream-Phi-3-14B-v1-Q8_0-GGUF

Updated May 23, 2024 • 1

ArturoVitale/gpt2-Q2_K-GGUF

Updated May 23, 2024 • 1