Currently at v2.3.2

What's new

The stuff that ships. Features, fixes, and the occasional breakthrough.

v2.3.0 January 12, 2026 LATEST

Local Whisper: transcription without API keys

No API key? No problem. Download Whisper.cpp models directly from Settings and run transcription entirely offline. Plus, quick model cycling in the voice overlay and a new console error viewer for debugging.

Local models
Whisper tiny/base/small with one-click download
Model cycling
Click model name in overlay to switch providers
Error viewer
Console errors captured in status bar
Offline transcription Whisper.cpp ElevenLabs WebSocket fix Resizable directory picker
v2.3.2 January 12, 2026

Browse files from your phone

The remote client now includes a full file browser. Navigate your project, read files, edit code, create new files — all from your phone. Combined with the new audiobash.app domain and fixed audio notifications, this release turns the remote client into a proper mobile IDE companion.

File browser
Navigate, read, edit, create, delete
audiobash.app
Custom domain on Cloudflare Pages
Audio restored
Vite build fix for sound files
Remote file browser Code editor Custom domain Audio notifications
v2.1.1 January 12, 2026

Voice is back

Turns out, the Content Security Policy was silently blocking all your transcription API calls. ElevenLabs, OpenAI, Anthropic - all of them. The fix was one line in the CSP header, but finding it required diving into the browser console during a remote debugging session from my phone.

// The culprit
connect-src 'self' https://api.elevenlabs.io wss://api.elevenlabs.io...
CSP Fix ElevenLabs OpenAI Anthropic
v2.1.0 January 10, 2026 FEATURE

Real-time transcription

ElevenLabs Scribe v2 brought WebSocket streaming, and it changes everything. Instead of waiting 2-3 seconds after you stop talking, your words appear in ~150ms. It feels like the terminal is reading your mind.

~150ms
Transcription latency
VAD
Auto-commit on silence
WebSocket PCM 16kHz Voice Activity Detection
v2.1.x January 2026 EXPERIMENTAL

Control from your phone

Your couch is 20 feet from your desk. AudioBash now includes a browser-based remote client. Scan a QR code, type on your phone's keyboard, and watch commands execute on your desktop. The phone's native voice keyboard means you can dictate without even touching the screen.

Scan QR
Enter code
Type anywhere
WebSocket Cloudflare Tunnel Pairing codes
v2.0.0 January 2, 2026 MAJOR

Hello, macOS

AudioBash went cross-platform. Native Apple Silicon builds (M1/M2/M3), Intel support for older Macs, and automatic shell detection so it opens zsh instead of trying to spawn PowerShell on your Mac. The stress test suite expanded to 120+ tests to catch platform-specific edge cases.

Apple Silicon
Windows
120+ tests
Read the macOS announcement
v1.1.0 December 11, 2025

Make it yours

Custom instructions let you tell the AI how to format your commands. Custom vocabulary maps your weird pronunciations to the right words. Audio chimes when Claude Code needs input. 14 keyboard shortcuts. The whole package.

// Vocabulary example
"lawn mow" → "npm"
// Shortcuts
Alt+A, Alt+M, Alt+R...
Custom instructions Vocabulary mapping Audio chimes

Want the full technical changelog?

View release notes