JSPM

  • ESM via JSPM
  • ES Module Entrypoint
  • Export Map
  • Keywords
  • License
  • Repository URL
  • TypeScript Types
  • README
  • Created
  • Published
  • Downloads 1766
  • Score
    100M100P100Q89240F
  • License MIT

Discord voice bridge for CLI coding agents.

Package Exports

    This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (verbalcoding) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

    Readme

    VerbalCoding

    Talk to your CLI coding agents through Discord voice — like a phone call for software work.

    한국어 · 日本語 · 中文 · Español · Français · Русский

    Node.js Discord STT TTS Agents

    VerbalCoding voice-to-agent flow

    Why

    VerbalCoding turns a Discord voice channel into a hands-free control surface for coding agents. Speak a request, let your CLI agent work, and hear a concise answer back — with text transcripts, progress events, and guardrails for noisy code/log output.

    Highlights

    What you get Why it feels good
    Voice-first agent control Talk to Hermes Agent, Claude Code, Codex, Gemini CLI, OpenCode, OpenClaw, or any custom CLI harness.
    On-device speech loop Discord voice capture → local whisper-cli transcription → agent → chunked TTS playback.
    Shared voice + text context Voice turns and !ask text commands can reuse the same supported agent session.
    Barge-in and sensitivity modes Interrupt playback naturally and switch between normal and conservative/noisy environments.
    Multilingual voice presets Switch STT, progress language, and TTS voice together with vc language ko/en/auto.
    Multi-room project isolation Run one bot per project room with isolated Hermes profiles, sessions, memory, and logs.

    Quick Start

    Fastest path with npm:

    npm install -g verbalcoding
    vc setup --yes
    vc doctor
    vc start

    Or run directly without a permanent global install:

    npx verbalcoding setup --yes
    vc doctor
    vc start

    GitHub clone path for contributors:

    git clone https://github.com/ca1773130n/VerbalCoding.git
    cd VerbalCoding
    ./scripts/install.sh --yes
    vc doctor
    ./run.sh

    vc setup --yes bootstraps local prerequisites from the npm package. ./scripts/install.sh --yes does the same for GitHub clone installs. Both cover Node/npm dependencies, ffmpeg, whisper-cli, the default whisper.cpp model, a local .venv-tts Edge TTS helper, and setup wizard configuration where possible. They support macOS/Homebrew plus common Linux package managers (apt, dnf, pacman); rerun with --no-wizard for dependency-only setup or --skip-system if you want to install OS packages yourself.

    Need a clean install walkthrough? Start with Fresh Install.

    Supported Agent Backends

    Backend Default command Session support
    Hermes Agent hermes chat -Q -q Resume, verbose progress, cancellation, final-answer recovery
    Claude Code claude -p CLI session file support through adapter defaults
    Codex CLI codex exec CLI session file support through adapter defaults
    Gemini CLI gemini -p CLI session file support through adapter defaults
    OpenCode opencode run CLI session file support through adapter defaults
    OpenClaw openclaw run CLI session file support through adapter defaults
    Custom AGENT_COMMAND Bring your own non-interactive command

    Learn More

    Guide What you get
    Fresh Install Clean clone setup, model download, first run
    Usage Guide CLI commands, Discord commands, progress mode, latency metrics
    Configuration .env, agent backends, MCP, TTS backends, operational notes
    Multi-Instance One permanent Discord voice room per project
    Release Notes Current capabilities and pre-release checklist

    Tiny Command Map

    vc status                 # current language, TTS, and bridge settings
    vc language ko|en|auto    # switch STT/progress/TTS language preset
    vc bot invite CLIENT_ID   # generate the Discord bot invite URL
    vc instance setup NAME    # create an isolated project voice bot
    vc instance start NAME    # run that bot in the background
    vc doctor                 # redacted health check
    vc start                  # start the default bridge

    In Discord:

    Command What it does
    !join Join your current voice channel.
    !ask <prompt> Send text to the same agent backend.
    !verbose on|off Show/speak short progress updates.
    !latency Summarize recent voice/STT/agent/TTS latency.
    !sensitivity normal Use normal indoor barge-in sensitivity.
    !sensitivity conservative Use stricter noisy/outdoor sensitivity.
    !session new <name> <workdir> [context] --voice <voice-channel> Bind a project session to a voice room.

    Requirements

    Layer Default
    Runtime Node.js 20+, npm; install script can install via Homebrew/apt/dnf/pacman
    Audio ffmpeg; install script can install it
    Speech recognition Local whisper-cli from whisper.cpp; install script uses Homebrew on macOS or local Linux build fallback
    TTS Edge TTS CLI; install script creates .venv-tts if needed
    Discord Bot token, Message Content intent, voice permissions
    Agent At least one authenticated CLI harness, Hermes Agent by default
    Platform focus macOS / Apple Silicon most tested; Linux bootstrap is best-effort and documented

    Contributing

    Run the lightweight checks before sending changes:

    node --check app-node/main.mjs
    npm test
    bash -n run.sh scripts/install.sh
    npm pack --dry-run
    vc doctor

    Status

    VerbalCoding is public-release oriented but still early. Demo video/GIF, broader Linux validation, CI, and deeper security review are still TODOs.