Curated list of open-source speech-to-text and voice typing tools for Linux, macOS, Windows, Android, and iOS. Offline, local, and cloud.
-
Updated
May 17, 2026
Curated list of open-source speech-to-text and voice typing tools for Linux, macOS, Windows, Android, and iOS. Offline, local, and cloud.
An Android app that automatically generates subtitles for videos locally, without needing an internet connection.
🎙️ Offline audio transcription with Whisper.
Automatic video translator and dubber using Whisper, XTTS v2 for voice cloning, and Ollama for local LLM translation. Supports 100+ languages.
A Flask API to convert speech to text using Offline Transcription methods - CMU Sphinx and DeepSpeech.
中文 vosk-android-demo
Offline Speech Recognition For Android Library
ROBOKIDS is a smart educational robot for kids, that connected with educational app that uses technology to make learning fun for kids. Its features like AI and deep learning, has levels for basic concepts, and has parental controls for safety and progress monitoring.
"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind maps — saving hours of time and boosting productivity."
Voice Assistant using Whisper in python3
Offline speech recognition for roboy
Local voice-to-text for macOS and iOS. Multilingual (EN/ZH/JP) with Traditional Chinese output. Runs Qwen3-ASR on Apple Silicon via MLX. No cloud, no subscription.
Use Vosk speech recognition toolkit to transcribe real-time audio from your microphone.
Provide a curated list of open-source speech-to-text tools for voice typing and dictation on desktop, mobile, and command line interfaces.
Control your PC using the fastest speech recognition in the world.
A Capacitor plugin that provides offline speech-to-text functionality for Android and iOS platforms. The plugin offers true offline recognition for Android with multiple languages, while iOS provides offline support for English with online fallback for other languages.
ESP32-S3 offline voice command frontend with wake-word detection, LVGL status UI, and ESP-NOW command forwarding.
efronic-voice-assistant is a voice-controlled assistant platform which runs on a raspberry pi
A Python-based offline voice assistant leveraging Vosk and Pyttsx3 to provide accessible emergency support, voice commands, and reminders for elderly users.
Unofficial local voice playback helper for Windows media control with INZONE Buds
Add a description, image, and links to the offline-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the offline-speech-recognition topic, visit your repo's landing page and select "manage topics."