Spaces:

Jerich
/

TalklasApp

Paused

TalklasApp / README.md

Updated title to Talklas API. Added description of /translate-audio endpoint with speech detection feature. Included Features section (speech detection, supported models) and Supported Languages section.

8157595 verified about 1 month ago

preview code

raw

history blame contribute delete

1.4 kB

	---
	title: Talklas API
	emoji: 🚀
	colorFrom: blue
	colorTo: green
	sdk: docker
	app_port: 8000
	health_check_path: /health
	---

	# Talklas API

	This is a FastAPI app deployed on Hugging Face Spaces for audio transcription, translation, and text-to-speech (TTS). It includes the following endpoints:

	- `/`: Returns a simple health check response.
	- `/health`: Health check endpoint for Hugging Face Spaces.
	- `/update-languages`: Updates the source and target languages for STT and TTS models.
	- `/translate-text`: Translates text and converts it to speech.
	- `/translate-audio`: Transcribes audio, translates the text, and converts the translated text to speech. Includes speech detection to handle silent audio gracefully.

	## Features

	- Speech Detection: The `/translate-audio` endpoint detects if the audio is silent (no speech) and returns a user-friendly response.
	- Transcription (STT): Uses MMS or Whisper models to transcribe audio.
	- Translation (MT): Uses the NLLB-200 model to translate text between supported languages.
	- Text-to-Speech (TTS): Uses MMS-TTS models to convert translated text to speech.

	## Supported Languages

	- English
	- Tagalog
	- Cebuano
	- Ilocano
	- Waray
	- Pangasinan

	## Deployment

	This app uses a `Dockerfile` to deploy a FastAPI app with Uvicorn. The health check path is set to `/health` to ensure Hugging Face Spaces can verify the app is running.