Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yiwei Guo's picture
1 2 2

Yiwei Guo

cantabile-kwok
Gatozu35's profile picture
ยท
  • cantabile-kwok

AI & ML interests

Text to Speech

Recent Activity

upvoted a paper about 2 months ago
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
updated a Space 6 months ago
cantabile-kwok/vec2wav2.0-demo
updated a dataset 6 months ago
cantabile-kwok/libritts-all-kaldi-data
View all activity

Organizations

Shanghai Jiao Tong University's profile picture SJTU Cross Media Language Intelligence Lab's profile picture

spaces 1

Running
3

Vec2wav2.0 Demo

๐Ÿƒ

vec2wav 2.0, a speech token vocoder for VC. Arxiv 2409.01995

Nov 12, 2024

models 3

cantabile-kwok/vec2wav2.0

Updated Oct 26, 2024 โ€ข 2

cantabile-kwok/hifigan-libritts-800-200

Updated Oct 8, 2023

cantabile-kwok/hifigan-ljspeech-1024-256

Updated Oct 8, 2023

datasets 2

cantabile-kwok/libritts-all-kaldi-data

Updated Nov 6, 2024 โ€ข 299

cantabile-kwok/ljspeech-1024-256-dur

Updated Oct 8, 2023 โ€ข 15
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs