Claude PTT - Push-to-Talk Voice Input for Claude Code

A Claude Code plugin that enables voice input via push-to-talk. Hold a hotkey to record your voice, release to transcribe and insert text into Claude Code.

Features

Push-to-talk: Hold Ctrl+Space to record, release to transcribe
Dual transcription backends: OpenAI Whisper API and local whisper.cpp
Cross-platform: Windows, macOS, Linux (X11 and Wayland)
Automatic fallback: Falls back to alternative backend if preferred fails
Visual feedback: Shows recording/transcribing status

Installation

From Marketplace

/plugin marketplace add aaddrick/claude-ptt
/plugin install ptt@claude-ptt-marketplace

Manual Installation

git clone https://github.com/aaddrick/claude-ptt.git
cd claude-ptt
npm install
npm run build

Configuration

Configuration is stored in ~/.claude/ptt-config.json:

{
  "hotkey": "Ctrl+Space",
  "whisper": {
    "openaiApiKey": null,
    "localModelPath": null,
    "preferredMode": "api",
    "enableFallback": true,
    "language": "en"
  },
  "audio": {
    "sampleRate": 16000,
    "silenceThreshold": 0.5
  },
  "keystroke": {
    "waylandBackend": "wtype"
  },
  "feedback": {
    "showRecordingIndicator": true
  }
}

Whisper Configuration

API Mode (Recommended for ease of use):

Set OPENAI_API_KEY environment variable, or
Set openaiApiKey in config

Local Mode (Recommended for privacy):

Install whisper.cpp
Set localModelPath to your model file

Setting Preferred Mode

Use preferredMode to choose which backend to try first:

"api": Try OpenAI API first
"local": Try local whisper.cpp first

Enable enableFallback to automatically try the other backend if the preferred one fails.

Platform Setup

Windows

No additional setup required. The plugin uses nut.js for keystroke simulation.

macOS

Grant accessibility permissions to your terminal application:

Open System Preferences > Security & Privacy > Privacy > Accessibility
Add your terminal app (Terminal.app, iTerm2, etc.)

Linux (X11)

Install libxtst for keystroke simulation:

sudo apt install libxtst-dev

Linux (Wayland)

Install one of the following for keystroke simulation:

# Option 1: wtype (recommended, no daemon required)
sudo apt install wtype

# Option 2: ydotool (requires daemon)
sudo apt install ydotool
sudo systemctl enable --now ydotool

# Option 3: dotool
# Build from source: https://sr.ht/~geb/dotool/

Audio Recording

The plugin uses system audio tools:

Linux: arecord (ALSA, usually pre-installed)
macOS/Windows: sox (install via brew install sox or download from http://sox.sourceforge.net/)

Usage

Starting the Daemon

# Via npm
npm start

# Or directly
node dist/daemon.js

Using with Claude Code

Start the daemon in a separate terminal
In Claude Code, hold Ctrl+Space to record
Speak your message
Release Ctrl+Space to transcribe
Text appears in your input for review
Press Enter to submit

MCP Tools

The plugin provides MCP tools for configuration:

ptt_get_config: Get current configuration
ptt_set_config: Update configuration
ptt_get_status: Get daemon status
ptt_get_platform_info: Get platform info and setup instructions

Troubleshooting

Hotkey not detected

Linux: May need to run as root for global key capture
macOS: Ensure accessibility permissions are granted
All platforms: Check for conflicts with other applications

Keystroke simulation not working

macOS: Check accessibility permissions
Linux Wayland: Ensure wtype or ydotool is installed
Linux X11: Ensure libxtst-dev is installed

Transcription fails

API mode: Verify your OpenAI API key
Local mode: Verify whisper.cpp is installed and model path is correct
All modes: Check microphone permissions and audio recording

No audio recorded

Check microphone permissions in system settings
Verify audio recording tool is installed:
- Linux: which arecord
- macOS/Windows: which sox

Development

# Install dependencies
npm install

# Build TypeScript
npm run build

# Watch mode
npm run watch

# Run daemon
npm start

# Run MCP server
npm run mcp-server

License

MIT

Author

Aaddrick Williams aaddrick@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.claude-plugin		.claude-plugin
commands		commands
docs/plans		docs/plans
skills/whisper-setup		skills/whisper-setup
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.json		config.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Claude PTT - Push-to-Talk Voice Input for Claude Code

Features

Installation

From Marketplace

Manual Installation

Configuration

Whisper Configuration

Setting Preferred Mode

Platform Setup

Windows

macOS

Linux (X11)

Linux (Wayland)

Audio Recording

Usage

Starting the Daemon

Using with Claude Code

MCP Tools

Troubleshooting

Hotkey not detected

Keystroke simulation not working

Transcription fails

No audio recorded

Development

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Claude PTT - Push-to-Talk Voice Input for Claude Code

Features

Installation

From Marketplace

Manual Installation

Configuration

Whisper Configuration

Setting Preferred Mode

Platform Setup

Windows

macOS

Linux (X11)

Linux (Wayland)

Audio Recording

Usage

Starting the Daemon

Using with Claude Code

MCP Tools

Troubleshooting

Hotkey not detected

Keystroke simulation not working

Transcription fails

No audio recorded

Development

License

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages