Cross-platform local voice typing and meeting transcription for macOS and Linux.
-
Updated
Jun 29, 2026 - Python
Cross-platform local voice typing and meeting transcription for macOS and Linux.
Fast STT (Speach-to-Text) Web UI with mlx-whisper
Speech2Text
Unlimited Music with Magenta Realtime and MLX with Arduino ESP32 WebSocket Transport
Local, real-time system-audio captioner + translator for macOS — Whisper + a local LLM + a glass overlay. Fully offline.
把抖音视频链接变成可读的 Markdown 文字稿。in 抖音 URL → out 分段格式化 .md 文件
Cross-platform CLI to download & transcribe podcasts locally — Apple Podcasts, Xiaoyuzhou, RSS feeds with built-in Whisper speech-to-text (Metal/CUDA/CPU)
Meeting Transcriber for Desktop using Whisper
A flexible speech recognition toolkit supporting multiple backends (Whisper, Faster-Whisper, WhisperX, SpeechRecognition, Vosk) with CLI and Gradio web interface.
视频转文字工具 - 支持语音转录、OCR识别、AI总结
Claude Code Skill for video/audio transcription with MLX Whisper on Apple Silicon. Produces accurate Traditional Chinese (Taiwan) transcripts + structured summaries with 8 scene templates. Free, local, no API costs.
Claude Code skill that turns a podcast URL into structured deep-reading HTML: local mlx-whisper transcription, shownotes-aligned chapters, optional product analysis.
A simple CLI script that transcribes your voice memo to a txt file with timestamp
Point it at a video, image, or PDF — get structured JSON. uvx vidlizer[mcp]. Runs local (Ollama/gemma4, LM Studio, oMLX) or cloud (OpenRouter). CLI + MCP server for Claude Code, Cursor, and Claude Desktop.
Automatically extract VTuber personality from Bilibili recordings.
a tool to download, transcribe and perform semantic/keyword searches on audio files, all locally
End-to-end Xiaoyuzhou FM podcast transcription & AI summarization — mlx-whisper + DeepSeek, native on Apple Silicon.
Real-time audio transcription monitoring system with AI-powered note generation using DeepSeek API
Privacy-first voice-to-text for macOS — local STT via mlx-whisper with app-aware formatting, AI post-processing, and push-to-talk dictation
Import Google Meet and Lark/Feishu meeting transcripts into Obsidian with MLX Whisper and OpenAI transcription fallbacks.
Add a description, image, and links to the mlx-whisper topic page so that developers can more easily learn about it.
To associate your repository with the mlx-whisper topic, visit your repo's landing page and select "manage topics."