vllm error
#3 opened 3 days ago
by
chu88
Can you create an 8bit version?
#2 opened 19 days ago
by
MilesQLi
test the inference script and quantized model, but have error as below
6
#1 opened about 1 month ago
by
wikeeyang
