A Step Towards Music Generation Foundation Model
Generate descriptions from images using masks
Generate images from textual prompts
Blind vote on HF TTS models!
Transcribe audio to text in multiple languages
Liquid demo app