Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
comarproject 's Collections
tts-spaces
tts-papers
video-generation

tts-papers

updated Jun 6, 2024

a collection of text to speech papers.

Upvote
1

  • BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

    Paper • 2402.08093 • Published Feb 12, 2024 • 62

  • E3 TTS: Easy End-to-End Diffusion-based Text to Speech

    Paper • 2311.00945 • Published Nov 2, 2023 • 16

  • Matcha-TTS: A fast TTS architecture with conditional flow matching

    Paper • 2309.03199 • Published Sep 6, 2023 • 12

  • Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

    Paper • 1712.05884 • Published Dec 16, 2017 • 3

  • YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

    Paper • 2112.02418 • Published Dec 4, 2021 • 2

  • ASR data augmentation using cross-lingual multi-speaker TTS and cross-lingual voice conversion

    Paper • 2204.00618 • Published Mar 29, 2022

  • One TTS Alignment To Rule Them All

    Paper • 2108.10447 • Published Aug 23, 2021
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs