Use unique model_type "chatts" instead of reusing "qwen2"

by alexanderchemeris - opened 5 days ago

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

-2

alexanderchemeris

5 days ago

•

edited 5 days ago

This change is required to correctly auto-load the model configuration class and link it to the implementation in vllm library as being implemented at https://github.com/vllm-project/vllm/pull/16852

The auto-loading is based on model_type maps and requires a unique name to load specific classes.

With this change and the vllm change mentioned above, the model can be loaded without any changes by vllm both for offline and online (OpenAI API) inference.

feat: Use unique model_type chatts instead of reusing qwen2b7a223ed

alexanderchemeris changed pull request status to open 5 days ago

xiezhe24

bytedance-research org 5 days ago

Thanks for implementing this and ChatTS support in vLLM!

xiezhe24 changed pull request status to merged 5 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment