What's new // AUDIOBASH

v2.3.0 January 12, 2026 LATEST

Local Whisper: transcription without API keys

No API key? No problem. Download Whisper.cpp models directly from Settings and run transcription entirely offline. Plus, quick model cycling in the voice overlay and a new console error viewer for debugging.

Local models

Whisper tiny/base/small with one-click download

Model cycling

Click model name in overlay to switch providers

Error viewer

Console errors captured in status bar

Offline transcription Whisper.cpp ElevenLabs WebSocket fix Resizable directory picker

v2.3.2 January 12, 2026

Browse files from your phone

The remote client now includes a full file browser. Navigate your project, read files, edit code, create new files — all from your phone. Combined with the new audiobash.app domain and fixed audio notifications, this release turns the remote client into a proper mobile IDE companion.

File browser

Navigate, read, edit, create, delete

audiobash.app

Custom domain on Cloudflare Pages

Audio restored

Vite build fix for sound files

Remote file browser Code editor Custom domain Audio notifications

v2.1.1 January 12, 2026

Voice is back

Turns out, the Content Security Policy was silently blocking all your transcription API calls. ElevenLabs, OpenAI, Anthropic - all of them. The fix was one line in the CSP header, but finding it required diving into the browser console during a remote debugging session from my phone.

// The culprit

connect-src 'self' https://api.elevenlabs.io wss://api.elevenlabs.io...

CSP Fix ElevenLabs OpenAI Anthropic

v2.1.0 January 10, 2026 FEATURE

Real-time transcription

ElevenLabs Scribe v2 brought WebSocket streaming, and it changes everything. Instead of waiting 2-3 seconds after you stop talking, your words appear in ~150ms. It feels like the terminal is reading your mind.

~150ms

Transcription latency

VAD

Auto-commit on silence

WebSocket PCM 16kHz Voice Activity Detection

v2.1.x January 2026 EXPERIMENTAL

Control from your phone

Your couch is 20 feet from your desk. AudioBash now includes a browser-based remote client. Scan a QR code, type on your phone's keyboard, and watch commands execute on your desktop. The phone's native voice keyboard means you can dictate without even touching the screen.

Scan QR

Enter code

Type anywhere

WebSocket Cloudflare Tunnel Pairing codes

v2.0.0 January 2, 2026 MAJOR

Hello, macOS

AudioBash went cross-platform. Native Apple Silicon builds (M1/M2/M3), Intel support for older Macs, and automatic shell detection so it opens zsh instead of trying to spawn PowerShell on your Mac. The stress test suite expanded to 120+ tests to catch platform-specific edge cases.

Apple Silicon

Windows

120+ tests

Read the macOS announcement

v1.1.0 December 11, 2025

Make it yours

Custom instructions let you tell the AI how to format your commands. Custom vocabulary maps your weird pronunciations to the right words. Audio chimes when Claude Code needs input. 14 keyboard shortcuts. The whole package.

// Vocabulary example

"lawn mow" → "npm"

// Shortcuts

Alt+A, Alt+M, Alt+R...

Custom instructions Vocabulary mapping Audio chimes