Open Source · MIT License · Written in Rust

Give Claude Code
a voice.

Name: Vox
Author: rtk-ai

Hear when tasks finish, get spoken summaries, clone your own voice. Local-first TTS with 3 backends, zero API calls.

3 TTS backends

8 MCP tools

0 API calls

Install Vox Star on GitHub

Why Vox?

Your AI assistant works in the terminal. Now it can talk to you.

Task Notifications

Hear when builds finish, tests pass or fail, and deployments complete. Stay productive without watching the terminal.

Spoken Summaries

Claude reads back what it did after a complex task. Get the gist without scanning hundreds of lines.

Voice Cloning

Record a 10-second sample and Vox speaks with your voice. Or clone any voice from an audio file.

100% Local

No API keys, no cloud, no data leaving your machine. Everything runs on-device with local models.

Cross-Platform

macOS, Linux, WSL. The pure-Rust backend works everywhere. Metal & CUDA GPU acceleration available.

One-Command Setup

vox init mcp registers everything in Claude Code. Zero config, instant integration.

3 TTS backends, one CLI

Pick the right engine for your platform and quality needs.

say macOS native

System speech synthesis via macOS NSSpeechSynthesizer. Instant startup, zero downloads, dozens of built-in voices.

Latency ~50ms

Quality System

Platform macOS

qwen Neural

MLX-powered neural TTS on Apple Silicon. High-quality, natural-sounding speech with voice cloning support. Model auto-downloads (~1.2 GB).

Latency ~500ms

Quality Neural

Platform macOS (Apple Si.)

qwen-native Pure Rust

Cross-platform neural TTS. Pure Rust implementation with Metal & CUDA GPU support. No Python, no dependencies.

Latency ~300ms

Quality Neural

Platform All

See it in action

Simple CLI, powerful results.

Basic usage

$ vox "Build complete. 3 tests passed."
# speaks aloud

$ vox -b qwen-native "Cross-platform TTS."
# uses pure-Rust backend

$ echo "Pipeline done" | vox
# reads from stdin

Voice cloning

$ vox clone record myvoice --duration 10
# records 10s of your voice

$ vox clone add patrick \
    --audio ~/voice.wav \
    --text "Transcription helps quality"
# register from audio file

$ vox -v patrick "Speaking with your voice."
# use cloned voice

Configuration

$ vox config set backend qwen
# switch default backend

$ vox config set lang fr
# set default language

$ vox config set voice Chelsie
# set default voice

$ vox --list-voices
# show available voices

Claude Code integration

4 init modes. Pick what fits your workflow.

mcp Default

Registers vox serve as an MCP server in ~/.claude.json. Exposes 8 tools that Claude can call directly.

vox init mcp

cli

Creates a CLAUDE.md with instructions and adds a stop hook that speaks "Terminé" when tasks complete.

vox init cli

skill

Generates a /speak slash command in ~/.claude/commands/. Invoke with /speak Hello.

vox init skill

all

Runs all three modes at once. MCP server + CLAUDE.md + slash command. The full experience.

vox init all

8 MCP tools exposed

vox_speak Read text aloud

vox_list_voices List available voices

vox_clone_add Add a voice clone

vox_clone_list List voice clones

vox_clone_remove Remove a clone

vox_config_show Show preferences

vox_config_set Set a preference

vox_stats Usage statistics

Install in seconds

Rust-powered. No runtime dependencies for the native backend.

Quick Install

macOS, Linux & WSL

curl -fsSL https://raw.githubusercontent.com/rtk-ai/vox/main/install.sh | sh

Via Cargo

From source

cargo install --git https://github.com/rtk-ai/vox

With GPU

Metal or CUDA acceleration

cargo install --git https://github.com/rtk-ai/vox --features metal

Then connect to Claude Code