New converted model files

by whitphx HF Staff - opened 4 days ago

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

+2024

-169

whitphx

4 days ago

No description provided.

Convert model files with 'python -m scripts.convert --model-id facebook/nllb-200-distilled-600M --quantize'db84eb75

Xenova

Owner 4 days ago

Strange, the weights are much larger in these new conversions. This points to an issue with the weight deduplication step, most likely. Can you confirm which versions of optimum, onnxslim, onnx, and transformers you are using?

Xenova

Owner 4 days ago

I am able to reproduce with the latest versions; can you open an issue in https://github.com/huggingface/optimum?

whitphx

3 days ago

Thanks! Created an issue: https://github.com/huggingface/optimum/issues/2241

whitphx

3 days ago

Will close this PR once,
and create another one only with the newly added quantized versions with proper conversion.

whitphx changed pull request status to closed 3 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment