Improve language tag
#5 opened 10 days ago
by
lbourdois

Model is loaded with use_cache=False by default
#4 opened 13 days ago
by
shawnghu
Can this model be run with vllm ?
1
#3 opened 3 months ago
by
just1nseo
Are you considering performing distillation experiments on the qwen70b model?
1
#2 opened 3 months ago
by
lambda1989
Huge thanks and congrats!
2
#1 opened 3 months ago
by
owao