claude-voice

Gives Claude a voice using your choice of TTS provider — ElevenLabs, OpenAI, or Google Cloud TTS. Works automatically in Claude Code CLI and Claude Desktop on Windows. No system prompts, no manual triggers — turn it on and Claude talks.

Claude Code speaks the full response after it finishes (via a stop hook). Claude Desktop speaks sentence-by-sentence as Claude streams, so it feels more like a real conversation. Your provider, voice, and settings persist between sessions in ~/.claude/voice-config.json.

Code blocks, tool output, and file diffs are filtered out. Only the conversational prose gets spoken.

Providers

Provider	Free Tier	Voices	Notes
ElevenLabs	10,000 chars/month	~30-50 premade via API	Best naturalness; recurring free tier
OpenAI	$5 one-time credit (new accounts only)	6 fixed	Effectively paid; $15/1M chars after credit
Google	1M chars/month Neural2, 4M chars/month Standard (recurring)	~12 en-US Neural2	Best free tier for ongoing use

You only need one provider to get started. Set up more later with /voice provider.

Which provider should I use?

For free ongoing use → Google. 1 million Neural2 characters per month, recurring. That's roughly 10–15 hours of Claude voice per month for free.

For the best voice quality → ElevenLabs. The most natural-sounding voices with 10,000 free characters per month — about 10–15 minutes of audio. Good for light use; upgrade to a paid plan for daily use.

For OpenAI users → OpenAI TTS. If you already have an OpenAI API account with credits, it's the simplest setup. No recurring free tier — you pay per character ($15/1M chars).

Requirements

Python 3.10+
Windows (desktop daemon uses Windows UIAutomation)
ffmpeg — winget install ffmpeg
API key for at least one provider (see table above)
Claude Code CLI
Claude Desktop (optional, for sentence-level streaming)

Setup

git clone https://github.com/yourusername/claude-voice.git
cd claude-voice
pip install -r requirements.txt
python setup.py

Setup will:

Ask which TTS provider you want to use
Prompt for that provider's API key and verify it
Let you pick a default voice
Write config to ~/.claude/voice-config.json
Register the stop hook in ~/.claude/settings.json
Register the verbosity-injection hook (UserPromptSubmit) so caveman directives stay active every turn
Register the MCP prompt server for Claude Desktop verbosity control

Voice is off by default after setup. Enable it:

/voice on

How verbosity stays enforced

Verbosity rules are enforced via the bundled voice-caveman skill at .claude/skills/voice-caveman/SKILL.md. The skill is automatically invoked when you run /voice on or /voice verbosity <N> — Claude Code loads it for the rest of the conversation, with hard word-count ceilings per level. No personal config edits required.

Slash Commands

Command	What it does
`/voice on`	Enable TTS
`/voice off`	Disable TTS
`/voice list`	Show available voices. ElevenLabs defaults to premade voices; use `/voice list all` for everything
`/voice pick <name>`	Switch voice — persists between sessions
`/voice provider <name>`	Switch provider: `elevenlabs`, `openai`, or `google`
`/voice verbosity <level>`	Set verbosity: `1` (most compressed) through `4` (no constraints)

Examples:

/voice provider google
/voice list
/voice pick en-US-Neural2-F

Getting API Keys

ElevenLabs — elevenlabs.io → Sign up → Profile → API Keys. Free tier: 10k chars/month.

OpenAI — platform.openai.com → API Keys.

Note: OpenAI TTS has no recurring free tier. New accounts get $5 in one-time credits (~333k characters), after which you pay per use.

Google Cloud TTS — console.cloud.google.com → Enable "Cloud Text-to-Speech API" → Credentials → Create API Key. Free tier: 1M chars/month (Neural2 voices).

Claude Desktop (sentence streaming)

The desktop daemon runs separately and needs to be started once:

python daemon/desktop-daemon.py

Make sure Claude Desktop is open before starting the daemon. It connects automatically.

To have it start with Windows, add it to Task Scheduler or create a shortcut in your startup folder pointing to:

pythonw "C:\path\to\claude-voice\daemon\desktop-daemon.py"

The daemon respects the same on/off flag as Claude Code. Use /voice off in Claude Code (or flip enabled in ~/.claude/voice-config.json) to silence both at once.

If voice stops working in the desktop daemon, the accessibility tree selector may need updating after a Claude Desktop update. Run the inspector to get a fresh tree dump:

python daemon/inspect-tree.py > daemon/tree-dump.txt

Then look in tree-dump.txt for the element containing response text and update _get_response_text() in desktop-daemon.py accordingly.

Verbosity

Long responses are expensive — both in TTS API credits and listening time. Four verbosity levels control how Claude generates its response, using progressively stronger compression.

/voice verbosity 1
/voice verbosity 2
/voice verbosity 3
/voice verbosity 4

Level	Style	Behavior
`1`	Ultra compressed	Abbreviations, arrows for causality (X→Y), one word when one word enough. No bullets. Max 2 sentences.
`2` (default)	Compressed	Drop articles, fragments OK. Max 3 bullets; extras noted as "...and N more". Max 4 sentences.
`3`	Tight	Full sentences, no filler or hedging. Professional but concise.
`4`	No constraints	Claude responds however it wants.

All levels drop pleasantries, filler words, and hedging. Takes effect immediately. No new session needed.

Verbosity in Claude Desktop

Claude Desktop has no slash commands, but it does have an MCP prompt picker. After running setup.py and restarting Claude Desktop, four prompts appear in the prompt picker:

Voice: Level 1
Voice: Level 2
Voice: Level 3
Voice: Level 4

Select one at the start of a conversation and it injects the same directive that /voice verbosity would in Claude Code.

Voice Selection

ElevenLabs: ElevenLabs has 10,000+ voices in their community library, but only premade official voices (~30-50) are accessible via API on the free tier. /voice list shows premade voices by default; /voice list all shows everything including any voices you've cloned or created yourself.

/voice list          # premade official voices (~30-50)
/voice list all      # everything on your account
/voice pick Rachel

OpenAI: 6 fixed voices — alloy, echo, nova, onyx, shimmer, fissure. Pick with /voice pick nova.

Google: /voice list shows available en-US Neural2 voices. Pick with /voice pick en-US-Neural2-F.

Switch provider at any time:

/voice provider openai
/voice list
/voice pick onyx

Config

Config lives at ~/.claude/voice-config.json:

{
  "enabled": false,
  "provider": "google",
  "voice_id": "21m00Tcm4TlvDq8ikWAM",
  "elevenlabs_api_key": "",
  "openai_api_key": "",
  "openai_voice": "nova",
  "openai_model": "tts-1",
  "google_api_key": "AIza...",
  "google_voice": "en-US-Neural2-C",
  "verbosity": "low"
}

Only the active provider's key is used. You can store all three and switch freely with /voice provider.

Troubleshooting

No audio at all

Check that ffplay is on your PATH:

ffplay -version

If not found, install ffmpeg and make sure it's in your PATH.

Claude Code not speaking

Check that the hook registered correctly:

python -c "import json; d=json.load(open('C:/Users/<you>/.claude/settings.json')); print(d.get('hooks', {}))"

You should see a Stop entry pointing to stop-hook.py. If it's missing, run setup.py again.

Desktop daemon not finding Claude

The daemon looks for a window with "Claude" in the title. Make sure Claude Desktop is open and visible (not minimized) when you start the daemon.

Desktop daemon finds the window but no audio

See the accessibility tree note in the Claude Desktop section above.

Switching providers doesn't work

Make sure you've run setup for that provider (or set the API key manually in ~/.claude/voice-config.json) and run /voice list to confirm voices load correctly.

Project Structure

claude-voice/
├── src/
│   ├── config.py          # Config read/write (provider, keys, voice)
│   ├── filter.py          # Strips code blocks, tool output, diffs
│   └── tts.py             # Multi-provider TTS (ElevenLabs / OpenAI / Google)
├── hooks/
│   └── stop-hook.py       # Claude Code stop hook
├── daemon/
│   ├── desktop-daemon.py  # Claude Desktop UIAutomation watcher
│   └── inspect-tree.py    # One-time tool to dump accessibility tree
├── commands/              # Python scripts called by slash commands
│   ├── voice_on.py
│   ├── voice_off.py
│   ├── voice_list.py
│   ├── voice_pick.py
│   ├── voice_provider.py
│   └── voice_verbosity.py
├── .claude/commands/voice/  # Slash command markdown files
│   ├── on.md
│   ├── off.md
│   ├── list.md
│   ├── pick.md
│   ├── provider.md
│   └── verbosity.md
├── mcp_server/
│   └── voice-mcp-server.py  # MCP prompts for Claude Desktop verbosity
├── tests/
└── setup.py

Credits

Verbosity compression levels inspired by Julius Brussee's Caveman — shamelessly stolen and adapted for voice.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

claude-voice

Providers

Which provider should I use?

Requirements

Setup

How verbosity stays enforced

Slash Commands

Getting API Keys

Claude Desktop (sentence streaming)

Verbosity

Verbosity in Claude Desktop

Voice Selection

Config

Troubleshooting

Project Structure

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.claude		.claude
.superpowers/brainstorm/361-1776961525		.superpowers/brainstorm/361-1776961525
.vscode		.vscode
commands		commands
daemon		daemon
docs		docs
hooks		hooks
mcp_server		mcp_server
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

claude-voice

Providers

Which provider should I use?

Requirements

Setup

How verbosity stays enforced

Slash Commands

Getting API Keys

Claude Desktop (sentence streaming)

Verbosity

Verbosity in Claude Desktop

Voice Selection

Config

Troubleshooting

Project Structure

Credits

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages