A FastAPI-based service for real-time speech-to-text using faster-whisper and WebRTC VAD.
stt_preview.mov
# Install system requirements
sudo apt install portaudio19-dev
# Install python dependencies
python3 src/setup.py
source src/stt-venv/bin/activateStart the service:
cd src/
python app.pyPython example:
import requests
with open("audio.wav", "rb") as f:
response = requests.post(
"http://localhost:8002/transcribe",
files={"file": f}
)
print(response.json()["text"])