How can I run this with tensor parallelism?

by quilicam - opened 19 days ago

19 days ago

I notice the inference.py script does not provide native functionality for running this model tensor parallel? Is there any plan to update this script to allow for such?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment