tts-papers - a comarproject Collection

comarproject 's Collections

video-generation

tts-papers

updated Jun 6, 2024

a collection of text to speech papers.

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12, 2024 • 62
E3 TTS: Easy End-to-End Diffusion-based Text to Speech

Paper • 2311.00945 • Published Nov 2, 2023 • 16
Matcha-TTS: A fast TTS architecture with conditional flow matching

Paper • 2309.03199 • Published Sep 6, 2023 • 12
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Paper • 1712.05884 • Published Dec 16, 2017 • 3
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Paper • 2112.02418 • Published Dec 4, 2021 • 2
ASR data augmentation using cross-lingual multi-speaker TTS and cross-lingual voice conversion

Paper • 2204.00618 • Published Mar 29, 2022
One TTS Alignment To Rule Them All

Paper • 2108.10447 • Published Aug 23, 2021