GGUF Quantised Models for Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged

This repository contains GGUF format model files for lewiswatson/Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged-GGUF, quantised.

Original Model

The original fine-tuned model used to generate these quantisations can be found here: lewiswatson/Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged

Provided Files (GGUF)

File Size
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.IQ4_XS.gguf 1.63 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q2_K.gguf 1.19 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q3_K_L.gguf 1.59 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q3_K_M.gguf 1.48 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q3_K_S.gguf 1.35 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q4_K_M.gguf 1.80 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q4_K_S.gguf 1.71 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q5_K_M.gguf 2.07 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q5_K_S.gguf 2.02 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q6_K.gguf 2.36 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.Q8_0.gguf 3.06 GB
Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged.f16.gguf 5.75 GB

This repository was automatically created using a script on 2025-04-14.

Downloads last month
288
GGUF
Model size
3.09B params
Architecture
qwen2
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for lewiswatson/Qwen2.5-3B-Instruct_Johnny_Silverhand_Merged-GGUF