Package Exports

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (verbalcoding) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

VerbalCoding

Talk to your CLI coding agents through Discord voice — like a phone call for software work.

한국어 · 日本語 · 中文 · Español · Français · Русский

VerbalCoding voice-to-agent flow

Why

VerbalCoding turns a Discord voice channel into a hands-free control surface for coding agents. Speak a request, let your CLI agent work, and hear a concise answer back — with text transcripts, progress events, and guardrails for noisy code/log output.

Highlights

What you get	Why it feels good
Voice-first agent control	Talk to Hermes Agent, Claude Code, Codex, Gemini CLI, OpenCode, OpenClaw, or any custom CLI harness.
On-device speech loop	Discord voice capture → local `whisper-cli` transcription → agent → chunked TTS playback.
Shared voice + text context	Voice turns and `!ask` text commands can reuse the same supported agent session.
Barge-in and sensitivity modes	Interrupt playback naturally and switch between normal and conservative/noisy environments.
Multilingual voice presets	Switch STT, progress language, and TTS voice together with `vc language ko/en/auto`.
Multi-room project isolation	Run one bot per project room with isolated Hermes profiles, sessions, memory, and logs.

Quick Start

Fastest path with npm:

npm install -g verbalcoding
vc setup --yes
vc doctor
vc start

Or run directly without a permanent global install:

npx verbalcoding setup --yes
vc doctor
vc start

GitHub clone path for contributors:

git clone https://github.com/ca1773130n/VerbalCoding.git
cd VerbalCoding
./scripts/install.sh --yes
vc doctor
./run.sh

vc setup --yes bootstraps local prerequisites from the npm package. ./scripts/install.sh --yes does the same for GitHub clone installs. Both cover Node/npm dependencies, ffmpeg, whisper-cli, the default whisper.cpp model, a local .venv-tts Edge TTS helper, and setup wizard configuration where possible. They support macOS/Homebrew plus common Linux package managers (apt, dnf, pacman); rerun with --no-wizard for dependency-only setup or --skip-system if you want to install OS packages yourself.

Need a clean install walkthrough? Start with Fresh Install.

Supported Agent Backends

Backend	Default command	Session support
Hermes Agent	`hermes chat -Q -q`	Resume, verbose progress, cancellation, final-answer recovery
Claude Code	`claude -p`	CLI session file support through adapter defaults
Codex CLI	`codex exec`	CLI session file support through adapter defaults
Gemini CLI	`gemini -p`	CLI session file support through adapter defaults
OpenCode	`opencode run`	CLI session file support through adapter defaults
OpenClaw	`openclaw run`	CLI session file support through adapter defaults
Custom	`AGENT_COMMAND`	Bring your own non-interactive command

Learn More

Guide	What you get
Fresh Install	Clean clone setup, model download, first run
Usage Guide	CLI commands, Discord commands, progress mode, latency metrics
Configuration	`.env`, agent backends, MCP, TTS backends, operational notes
Multi-Instance	One permanent Discord voice room per project
Release Notes	Current capabilities and pre-release checklist

Tiny Command Map

vc status                 # current language, TTS, and bridge settings
vc language ko|en|auto    # switch STT/progress/TTS language preset
vc bot invite CLIENT_ID   # generate the Discord bot invite URL
vc instance setup NAME    # create an isolated project voice bot
vc instance start NAME    # run that bot in the background
vc doctor                 # redacted health check
vc start                  # start the default bridge

In Discord:

Command	What it does
`!join`	Join your current voice channel.
`!ask <prompt>`	Send text to the same agent backend.
`!verbose on\|off`	Show/speak short progress updates.
`!latency`	Summarize recent voice/STT/agent/TTS latency.
`!sensitivity normal`	Use normal indoor barge-in sensitivity.
`!sensitivity conservative`	Use stricter noisy/outdoor sensitivity.
`!session new <name> <workdir> [context] --voice <voice-channel>`	Bind a project session to a voice room.

Requirements

Layer	Default
Runtime	Node.js 20+, npm; install script can install via Homebrew/apt/dnf/pacman
Audio	`ffmpeg`; install script can install it
Speech recognition	Local `whisper-cli` from whisper.cpp; install script uses Homebrew on macOS or local Linux build fallback
TTS	Edge TTS CLI; install script creates `.venv-tts` if needed
Discord	Bot token, Message Content intent, voice permissions
Agent	At least one authenticated CLI harness, Hermes Agent by default
Platform focus	macOS / Apple Silicon most tested; Linux bootstrap is best-effort and documented

Contributing

Run the lightweight checks before sending changes:

node --check app-node/main.mjs
npm test
bash -n run.sh scripts/install.sh
npm pack --dry-run
vc doctor

Status

VerbalCoding is public-release oriented but still early. Demo video/GIF, broader Linux validation, CI, and deeper security review are still TODOs.