bartowski
/

THUDM_GLM-4-32B-0414-GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (2)

Promt time (ollama) on 22c xenon, 5070 ti, 128GB ram. (Q6_K_L)

#12 opened 4 days ago by

Template bug fixed in llama.cpp

#11 opened 11 days ago by

matteogeniaccio

vllm depolyment error

#10 opened 12 days ago by

Higher than usual refusal rate with Q6_K_L quant GGUF

#9 opened 13 days ago by

Tool use?

#8 opened 14 days ago by

llama.cpp fixes have just been merged

#5 opened 16 days ago by

LM Studio: unknown model architecture: 'glm4'?

#4 opened 20 days ago by

please regenerate ggufs

#3 opened 24 days ago by

Broken results

#2 opened 24 days ago by

Yarn quantization for long context

#1 opened 24 days ago by