Spaces:
Running
on
Zero
Running
on
Zero
update README for real-time speech-to-text application and remove spaces.GPU decorator from load_model function
4efbce4
title: Real-time Speech-to-Text | |
emoji: ποΈ | |
colorFrom: indigo | |
colorTo: gray | |
sdk: gradio | |
sdk_version: 5.29.0 | |
app_file: app.py | |
pinned: false | |
# Real-time Speech-to-Text with NeMo | |
This is a real-time speech-to-text transcription application powered by NVIDIA NeMo and the parakeet-tdt-0.6b-v2 model. | |
## Features | |
- ποΈ Web-based microphone input | |
- β‘ Real-time transcription displayed in the browser | |
- π§ Fast inference with NeMo pre-trained model | |
- π οΈ Easy to use, no installations required | |
## Tech Stack | |
- Python | |
- Gradio | |
- NVIDIA NeMo Toolkit for ASR | |
## How to Use | |
1. Click the microphone button to start recording | |
2. Speak clearly into your microphone | |
3. The transcription will appear in real-time | |
4. Click 'Clear Transcript' to start a new transcription | |
## Note | |
This application requires access to your microphone to function. The audio is processed in real-time and is not stored. | |