Spaces:

GavinHuang
/

asr-demo

Running on Zero

File size: 915 Bytes

ecd8fee
4efbce4
 
ecd8fee
 
 
 
 
 
 
 
4efbce4

---
title: Real-time Speech-to-Text
emoji: 🎙️
colorFrom: indigo
colorTo: gray
sdk: gradio
sdk_version: 5.29.0
app_file: app.py
pinned: false
---

# Real-time Speech-to-Text with NeMo

This is a real-time speech-to-text transcription application powered by NVIDIA NeMo and the parakeet-tdt-0.6b-v2 model.

## Features

- 🎙️ Web-based microphone input
- ⚡ Real-time transcription displayed in the browser
- 🧠 Fast inference with NeMo pre-trained model
- 🛠️ Easy to use, no installations required

## Tech Stack

- Python
- Gradio
- NVIDIA NeMo Toolkit for ASR

## How to Use

1. Click the microphone button to start recording
2. Speak clearly into your microphone
3. The transcription will appear in real-time
4. Click 'Clear Transcript' to start a new transcription

## Note

This application requires access to your microphone to function. The audio is processed in real-time and is not stored.