Qwen
/

QwQ-32B-AWQ

Text Generation

4-bit precision

Model card Files Files and versions

Resources

View closed (0)

How to make thinking take less time

#7 opened about 1 month ago by

Any one get the issues of <think> tag not showing?

#6 opened about 1 month ago by

GPTQ quants

#5 opened about 2 months ago by

有没有在3090上部署这个awq版本的，速度只有6tokens/s，正常吗

#4 opened about 2 months ago by

AWQ Quant Settings?

#3 opened 2 months ago by

Performance loss of AWQ compared to the original model

#2 opened 2 months ago by

disable thinking for some requests

#1 opened 2 months ago by