Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
datnt114 's Collections
TTS
Speech to text
3D
LLM
music gen
video
data-SD

TTS

updated Feb 14, 2024
Upvote
-

  • Large-Scale Automatic Audiobook Creation

    Paper • 2309.03926 • Published Sep 7, 2023 • 54

  • Runtime error
    114
    114

    Pop2Piano Demo

    🎹

    Convert pop audio to piano cover


  • khanhld/wav2vec2-base-vietnamese-160h

    Automatic Speech Recognition • Updated Nov 22, 2024 • 212 • 10

  • 4K4D: Real-Time 4D View Synthesis at 4K Resolution

    Paper • 2310.11448 • Published Oct 17, 2023 • 40

  • BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

    Paper • 2402.08093 • Published Feb 12, 2024 • 62
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs