Jack
qwertyjack
AI & ML interests
None yet
Organizations
qwertyjack's activity
issue of "think too much" ,how to?(chinese)
2
#14 opened 2 months ago
by
fenglui
Where did the BF16 come from?
8
#10 opened 3 months ago
by
gshpychka
New research paper, R1 type reasoning models can be drastically improved in quality
2
#19 opened 3 months ago
by
krustik
Please make V3-lite
46
4
#12 opened 4 months ago
by
rombodawg

感觉新版的Mistrial-LargeV3的GPTQ量化的int4版本对显存的需求大大提升了
2
#1 opened 5 months ago
by
YanchengQian

How to run the model OpenGVLab/InternVL2-40B-AWQ with vllm docker image?
2
#2 opened 9 months ago
by
andryevinnik
请教一下,cogvlm和glm4v的区别是什么呢
8
3
#1 opened 11 months ago
by
rangehow
Having trouble loading this with transformers
3
5
#8 opened about 1 year ago
by
codelion

GPTQ plz
10
#3 opened about 1 year ago
by
Parkerlambert123
Would you plan to optimize ChatGLM2-6B? and when?
4
#47 opened almost 2 years ago
by
Zuyuan