Parakeet-TDT-0.6b-V2
Transcribe audio to text with timestamps
Large Avatar Model for One-shot Animatable Gaussian Head
Generate animated portraits from images and audio
Generate text and speech responses from text, images, or audio input
High-fidelity 3D Geometry Generation from images