TTS - a datnt114 Collection

datnt114 's Collections

TTS

3D

LLM

video

data-SD

TTS

updated Feb 14, 2024

Large-Scale Automatic Audiobook Creation

Paper • 2309.03926 • Published Sep 7, 2023 • 54
Runtime error

114

114

Pop2Piano Demo

🎹

Convert pop audio to piano cover
khanhld/wav2vec2-base-vietnamese-160h

Automatic Speech Recognition • Updated Nov 22, 2024 • 212 • 10
4K4D: Real-Time 4D View Synthesis at 4K Resolution

Paper • 2310.11448 • Published Oct 17, 2023 • 40
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Paper • 2402.08093 • Published Feb 12, 2024 • 62