Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

moonshotai
/
Kimi-Audio-7B-Instruct

Text-to-Speech
Safetensors
English
Chinese
audio
audio-language-model
speech-recognition
audio-understanding
audio-generation
chat
kimi-audio
custom_code
Model card Files Files and versions Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Request: DOI

#14 opened 1 day ago by
huseyinyolcu

Update library tag for better download tracking and code snippets!

#13 opened 3 days ago by
Steveeeeeeen

supported languages?

#12 opened 3 days ago by
nononameneeded2001

About the weight files of the Whisper Encoder

#11 opened 9 days ago by
codecho

how can I fine tune this for farsi?

#10 opened 10 days ago by
uncleMehrzad

Cannot Run Model in Hugging Face Spaces: AutoProcessor/Processor Not Found

#9 opened 10 days ago by
ranagame

Будет ли поддержка Русского языка?

#8 opened 11 days ago by
fduches2

A video on how to set up this in a Colab notebook

1
#7 opened 12 days ago by
ritheshSree

Vocoder Architecture?

#6 opened 12 days ago by
yukiarimo

Base model?

1
#4 opened 12 days ago by
deltanym

Issue with long audio (~1 min) output, or prompt instruct following

2
#2 opened 13 days ago by
JosephusCheung

Update correct task tag

#1 opened 13 days ago by
reach-vb
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs