Package Exports
This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (@artale/pi-voice) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.
Readme
pi-voice
Voice input for Pi. Multi-provider STT with Deepgram streaming, Groq Whisper, and OpenAI Whisper. 56+ languages.
Install
pi install npm:@artale/pi-voiceSetup
Set at least one API key:
# Best quality — Deepgram Nova 3 with streaming ($200 free credit)
export DEEPGRAM_API_KEY="your-key"
# Fast and free — Groq Whisper
export GROQ_API_KEY="your-key"
# Reliable — OpenAI Whisper
export OPENAI_API_KEY="your-key"Install an audio capture tool:
brew install sox # macOS/Linux (recommended)
brew install ffmpeg # alternative
# arecord is pre-installed on most Linux distrosVerify: /voice test
Commands
/voice [seconds] — record and transcribe (default 10s)
/voice dictate — continuous dictation (30s)
/voice test — check audio + provider setup
/voice config [prov] — set provider (auto/deepgram/groq/openai)
/voice lang [code] — set language (en, es, fr, de, ja, zh, ko, ...)
/voice history — recent transcriptions
/voice stop — stop active recordingProvider fallback chain
- Deepgram Nova 3 — streaming via WebSocket, best quality, 56+ languages
- Groq Whisper Large v3 Turbo — fast, free tier
- OpenAI Whisper-1 — reliable, paid
Auto-detection: uses the first available provider. Override with /voice config deepgram.
56+ Languages
/voice lang es — Spanish
/voice lang ja — Japanese
/voice lang zh — Chinese (auto-switches to Nova 2)
/voice lang de — German
/voice lang fr — FrenchFull list: /voice lang
What's new in v2.0
- Deepgram streaming via WebSocket + REST fallback
- Multi-provider fallback chain (Deepgram → Groq → OpenAI)
- 56+ languages with auto model selection
- Continuous dictation mode (
/voice dictate) - Provider diagnostics (
/voice test) - Better Windows support (ffmpeg dshow)
- History with provider and language tracking
License
MIT