Native macOS voice dictation, transcription, context-aware prompting, and auto-paste.
Zerm is a native macOS application for turning speech into clean text quickly. It records from your microphone, transcribes with local and cloud-capable engines, applies optional enhancement prompts, understands the active app or website through Power Mode, and can paste the result directly at your cursor.
Zerm is based on the open-source project VoiceInk by Beingpax. VoiceInk provided the foundation for the native macOS app architecture, dictation workflow, transcription pipeline, Power Mode concept, model management, and many supporting services.
This repository is not trying to hide that lineage. Zerm is a modified GPLv3 derivative adapted for Arcusis branding, product direction, and ongoing development. We keep the GPLv3 license, preserve attribution, and link back to the upstream project so users and contributors can inspect the original work.
Additional attribution details are kept in NOTICE.
- Attribution
- Features
- Install
- Requirements
- Build From Source
- Project Structure
- Privacy
- Contributing
- License
- Fast dictation workflow for recording, transcribing, and inserting text.
- Native macOS app built in Swift and SwiftUI.
- Global shortcuts and push-to-talk style recording.
- Auto-stop and auto-paste so a dictation can finish and insert text with minimal friction.
- Power Mode to adapt prompts based on the active app, website, or workflow.
- AI enhancement prompts for cleanup, rewriting, assistant-style responses, and custom modes.
- Local model support through Whisper and FluidAudio-backed transcription paths.
- Cloud transcription providers for users who explicitly configure external services.
- Personal dictionary with vocabulary and word replacement support.
- History and audio-file transcription for reviewing or processing previous recordings.
- macOS permissions flow for microphone, Accessibility, screen context, and automation-related capabilities.
Download the latest macOS build from the Releases page or the project website.
Zerm is currently focused on macOS.
- macOS 14.4 or later
- Microphone permission
- Accessibility permission for auto-paste and global insertion workflows
- Screen Recording permission only when screen/context-aware features are enabled
The actively developed native app lives in native-macos/.
Prerequisites:
- Xcode 16 or newer
- macOS 14.4 or newer
- Swift Package Manager dependencies resolved by Xcode
Build from the command line:
xcodebuild \
-project native-macos/Zerm.xcodeproj \
-scheme Zerm \
-configuration Debug \
CODE_SIGNING_ALLOWED=NO \
buildFor detailed local build notes, see native-macos/BUILDING.md.
| Path | Purpose |
|---|---|
native-macos/Zerm/ |
Native Swift/SwiftUI macOS app |
native-macos/Zerm/Transcription/ |
Transcription engines, providers, and processing pipeline |
native-macos/Zerm/PowerMode/ |
App, URL, and context-aware prompt selection |
native-macos/Zerm/Views/ |
SwiftUI application interface |
native-macos/Zerm/Services/ |
App services, settings, dictionaries, model management, and integrations |
docs/ |
GitHub Pages website |
NOTICE |
Upstream attribution and derivative-work notes |
LICENSE |
GPLv3 license text |
The older Tauri files remain in the repository for historical continuity while the current product direction is the native macOS app.
Zerm is designed to make privacy boundaries explicit.
- Local transcription paths keep audio on the device.
- Cloud transcription and AI providers require user configuration before use.
- History and stored request payloads should be treated carefully because they may contain dictated text.
- Power Mode and screen-context features may require macOS permissions so Zerm can understand where the text is being inserted.
See Notebook/Zerm Runtime Privacy Model.md for project notes on runtime privacy behavior.
Issues and pull requests are welcome when they are aligned with the macOS app.
Before opening a PR:
- Keep changes focused and describe the user-facing behavior.
- Preserve GPLv3 attribution for VoiceInk-derived code.
- Add or update tests for high-risk behavior.
- Include screenshots or short recordings for UI changes.
- Note any macOS permissions or signing behavior that could affect testing.
Zerm is licensed under the GNU General Public License v3.0.
Because Zerm is derived from VoiceInk, the GPLv3 license and upstream attribution are part of the public project identity. See NOTICE for source attribution and modification notes.
Built by Arcusis, based on VoiceInk by Beingpax.
