Package Exports
This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (verbalcoding) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.
Readme
VerbalCoding
Talk to your CLI coding agents through Discord voice — like a phone call for software work.
한국어 · 日本語 · 中文 · Español · Français · Русский
Why
VerbalCoding turns a Discord voice channel into a hands-free control surface for coding agents. Speak a request, let your CLI agent work, and hear a concise answer back — with text transcripts, progress events, and guardrails for noisy code/log output.
Highlights
| What you get | Why it feels good |
|---|---|
| Voice-first agent control | Talk to Hermes Agent, Claude Code, Codex, Gemini CLI, OpenCode, OpenClaw, or any custom CLI harness. |
| On-device speech loop | Discord voice capture → local whisper-cli transcription → agent → chunked TTS playback. |
| Shared voice + text context | Voice turns and !ask text commands can reuse the same supported agent session. |
| Barge-in and sensitivity modes | Interrupt playback naturally and switch between normal and conservative/noisy environments. |
| Multilingual voice presets | Switch STT, progress language, and TTS voice together with vc language ko/en/auto. |
| Multi-room project isolation | Run one bot per project room with isolated Hermes profiles, sessions, memory, and logs. |
Quick Start
Fastest path with npm:
npm install -g verbalcoding
vc setup --yes
vc doctor
vc startOr run directly without a permanent global install:
npx verbalcoding setup --yes
vc doctor
vc startGitHub clone path for contributors:
git clone https://github.com/ca1773130n/VerbalCoding.git
cd VerbalCoding
./scripts/install.sh --yes
vc doctor
./run.shvc setup --yes bootstraps local prerequisites from the npm package. ./scripts/install.sh --yes does the same for GitHub clone installs. Both cover Node/npm dependencies, ffmpeg, whisper-cli, the default whisper.cpp model, a local .venv-tts Edge TTS helper, and setup wizard configuration where possible. They support macOS/Homebrew plus common Linux package managers (apt, dnf, pacman); rerun with --no-wizard for dependency-only setup or --skip-system if you want to install OS packages yourself.
Need a clean install walkthrough? Start with Fresh Install.
Supported Agent Backends
| Backend | Default command | Session support |
|---|---|---|
| Hermes Agent | hermes chat -Q -q |
Resume, verbose progress, cancellation, final-answer recovery |
| Claude Code | claude -p |
CLI session file support through adapter defaults |
| Codex CLI | codex exec |
CLI session file support through adapter defaults |
| Gemini CLI | gemini -p |
CLI session file support through adapter defaults |
| OpenCode | opencode run |
CLI session file support through adapter defaults |
| OpenClaw | openclaw run |
CLI session file support through adapter defaults |
| Custom | AGENT_COMMAND |
Bring your own non-interactive command |
Learn More
| Guide | What you get |
|---|---|
| Fresh Install | Clean clone setup, model download, first run |
| Usage Guide | CLI commands, Discord commands, progress mode, latency metrics |
| Configuration | .env, agent backends, MCP, TTS backends, operational notes |
| Multi-Instance | One permanent Discord voice room per project |
| Release Notes | Current capabilities and pre-release checklist |
Tiny Command Map
vc status # current language, TTS, and bridge settings
vc language ko|en|auto # switch STT/progress/TTS language preset
vc bot invite CLIENT_ID # generate the Discord bot invite URL
vc instance setup NAME # create an isolated project voice bot
vc instance start NAME # run that bot in the background
vc doctor # redacted health check
vc start # start the default bridgeIn Discord:
| Command | What it does |
|---|---|
!join |
Join your current voice channel. |
!ask <prompt> |
Send text to the same agent backend. |
!verbose on|off |
Show/speak short progress updates. |
!latency |
Summarize recent voice/STT/agent/TTS latency. |
!sensitivity normal |
Use normal indoor barge-in sensitivity. |
!sensitivity conservative |
Use stricter noisy/outdoor sensitivity. |
!session new <name> <workdir> [context] --voice <voice-channel> |
Bind a project session to a voice room. |
Requirements
| Layer | Default |
|---|---|
| Runtime | Node.js 20+, npm; install script can install via Homebrew/apt/dnf/pacman |
| Audio | ffmpeg; install script can install it |
| Speech recognition | Local whisper-cli from whisper.cpp; install script uses Homebrew on macOS or local Linux build fallback |
| TTS | Edge TTS CLI; install script creates .venv-tts if needed |
| Discord | Bot token, Message Content intent, voice permissions |
| Agent | At least one authenticated CLI harness, Hermes Agent by default |
| Platform focus | macOS / Apple Silicon most tested; Linux bootstrap is best-effort and documented |
Contributing
Run the lightweight checks before sending changes:
node --check app-node/main.mjs
npm test
bash -n run.sh scripts/install.sh
npm pack --dry-run
vc doctorStatus
VerbalCoding is public-release oriented but still early. Demo video/GIF, broader Linux validation, CI, and deeper security review are still TODOs.