Found 1794 results for voice activity detection

@mjyc/voice-activity-detection

Mic input activity detection

vad-audio-worklet

Voice activity detection (VAD) AudioWorklet.

@ariontalk/plugin-silero-vad

Silero VAD barge-in plugin for ArionTalk — AI-powered voice activity detection

audio-sentence-detector

Advanced audio sentence detection using signal processing and voice activity detection

@picovoice/cobra-web

Cobra VAD engine for web browsers (via WebAssembly)

@discordjs/voice

Implementation of the Discord Voice API for Node.js

@soniox/speech-to-text-web

Javascript client library for Soniox Speech-to-Text websocket API

react-dictate-button

A button to start dictation using Web Speech API, with an easy to understand event lifecycle.

@restnpeacepk/worker-vad

Universal Voice Activity Detection SDK for WebAssembly - supports multiple VAD engines with a unified API

simple-peer

Simple one-to-one WebRTC video/voice and data channels

auralwise_cli

CLI for AuralWise audio intelligence API - transcription, speaker diarization, audio event detection

@vonage/voice

The Voice API lets you create outbound calls, control in-progress calls and get information about historical calls.

@mclean-capital/neura

Neura — CLI for installing and managing the Neura AI assistant core service. Includes text chat and voice listen clients.

react-voice-visualizer

React library for audio recording and visualization using Web Audio API

@voicepilot/sdk

Official VoicePilot JavaScript SDK — TTS, STT, Agents, and real-time conversations.

react-native-tts

React Native Text-To-Speech module for Android and iOS

react-audio-voice-recorder

An audio recording helper for React. Provides a component and a hook to help with audio recording.

@inworld/runtime

`@inworld/runtime` is a Node.js SDK for building AI applications with LLM inference, graph orchestration, speech pipelines, retrieval, tool use, and telemetry.

open-agents-ai

AI coding agent powered by open-source models (Ollama/vLLM) — interactive TUI with agentic tool-calling loop

voice-mcp-server

An MCP server to allow LLMs to speak and listen via bidirectional voice loops

@twilio/rtc-diagnostics

Various diagnostics functions to help analyze connections to Twilio

messagebird

A node.js wrapper for the MessageBird REST API

@4players/odin

A cross-platform SDK enabling developers to integrate real-time VoIP chat technology into their projects

africastalking

Official AfricasTalking node.js API wrapper

@diegoaltoworks/talker

Telephony plugin for Chatter — adds voice call and SMS support via Twilio

@thaunknown/simple-peer

Simple one-to-one WebRTC video/voice and data channels

@telnyx/react-native-voice-sdk

Telnyx React Native Voice SDK - A complete WebRTC voice calling solution

@stella_project/stellalib

StellaLib — A powerful Lavalink v3+v4 client for TypeScript with auto version detection, session persistence, smart autoplay, and graceful shutdown

@cometchat/chat-uikit-react

Ready-to-use Chat UI Components for React(Javascript/Web)

edge-tts-universal

Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

@elevenlabs/react-native

ElevenLabs React Native SDK for the Agents Platform

use-ear

React hooks for wake word detection using Web Speech API

pi-smart-voice-notify

Windows-optimized smart voice, sound, and desktop notifications for Pi coding agent.

mellon

Offline, in-browser voice commands powered by EfficientWord-Net (ResNet-50 ArcFace).

react-native-live-audio-stream

Get live audio stream data for React Native

simple-peer-light

Simple, light-weight WebRTC video/voice and data channels

@rapidaai/react

An easy to use react client for building generative ai application using Rapida platform.

@picovoice/porcupine-web

Porcupine wake word engine for web browsers (via WebAssembly)

voipi

aetherlight

Voice-to-intelligence platform for developers. Voice capture, sprint planning with AI, bug/feature forms, pattern matching to prevent AI hallucinations.

elevenlabs-node

This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

annyang

A JavaScript library for adding voice commands to your site, using speech recognition

@capgo/capacitor-speech-recognition

Capacitor plugin for comprehensive on-device speech recognition with live partial results.

@capgo/capacitor-twilio-voice

Integrates the Twilio Voice SDK into Capacitor

@micdrop/client

🖐️🎤 Micdrop: Real-Time Voice Conversations with AI

voicesmith-mcp

Local AI voice for coding assistants — TTS & STT via MCP. Kokoro ONNX + faster-whisper, fully offline.

voxglide

Embeddable voice AI SDK for web pages — form filling, navigation, Q&A via speech recognition and server proxy

A powerful React hook for real-time voice streaming, designed for AI-powered applications. Perfect for real-time transcription, voice assistants, and audio processing with features like silence detection and configurable audio processing.

@picovoice/porcupine-node

Picovoice Porcupine Node.js binding

@mhpdev/react-native-speech

A high-performance React Native library for text-to-speech on iOS and Android

react-speech-to-text-gk

Advanced React speech-to-text library with real-time audio analysis and comprehensive speech metrics

@aituber-onair/voice

Voice synthesis library for AITuber OnAir

telesignsdk

Official TeleSign SDK for Rest APIs including Messaging (SMS), Intelligence Cloud, PhoneID, Voice, and AppVerify

holostaff-widget

Holostaff AI avatar widget — embeddable voice assistant for any webpage

@andypai/orb

Voice-driven code explorer for your terminal

@inworld/web-core

ssml-check

Check for valid SSML

libp2p-webrtc-peer

Simple one-to-one WebRTC video/voice and data channels

agentvibes

Now your AI Agents can finally talk back! Professional TTS voice for Claude Code, Claude Desktop (via MCP), and Clawdbot with multi-provider support.

vent-hq

Vent CLI — CI/CD for voice AI agents

react-ai-voice-visualizer

A collection of React components for building AI voice interfaces with real-time audio visualization

@cmdotcom/text-sdk

Package to make it very easy to send text messages with CM.com

@steelbrain/media-speech-detection-web

Production-ready speech detection using Silero VAD ONNX model for web browsers

retext-passive

retext plugin to check for passive voice

audio-react-recorder

Audio / Voice Recorder for React

react-siriwave

React version of siriwave.js

react-audio-player-component

A mobile-friendly audio player for React with a modern look and convenient usage.

@chengsokdara/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

@cloudflare/voice

Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities

@fugood/react-native-audio-pcm-stream

Get audio PCM stream data for React Native

voice-tool-call

Voice-to-tool-call browser library: wake word detection, speech-to-text, LLM intent interpretation, tool execution, and text-to-speech

@contractspec/lib.voice

Voice capabilities: TTS, STT, and conversational AI

@umituz/web-cloudflare

Comprehensive Cloudflare Workers & Pages integration with config-based patterns, middleware, router, workflows, AI (with audio/music generation, TTS, ASR), React hooks, and multi-tenant support

@termii/node

Nodejs SDK wrapper for Termii API written with Typescript support

@speechly/browser-client

JavaScript client for Speechly Streaming API

vosk

Node binding for continuous offline voice recoginition with Vosk library.

samvyo-js-sdk

This is the client js sdk for cutting-edge Samvyo real-time voice/video cloud.

@clawvoice/clawvoice

Voice calling plugin for OpenClaw — give your AI agent a phone number

@confiture-ai/gradium-sdk-js

Unofficial TypeScript SDK for the Gradium API

@independo/capacitor-voice-recorder

Capacitor plugin for voice recording

obi-sdk

JavaScript SDK for Obi

@picovoice/porcupine-react

React component for Porcupine Web SDK

@hazeljs/realtime

Real-time voice AI for HazelJS - OpenAI Realtime API & Gemini Live integration for low-latency speech-to-speech

@wave-av/sdk

Official WAVE SDK for TypeScript and Node.js — 34 API modules for live video streaming, production, analytics, voice, captions, and more

@dev-amirzubair/react-native-voice

React Native Voice library for iOS and Android - Fork with New Architecture, Bridgeless mode, and React Native 0.76+ support

@workadventure/simple-peer

Simple one-to-one WebRTC video/voice and data channels

bland-cli

The official Bland AI command-line interface

expo-speech-transcriber

An iOS only on-device transcription library for React Native and Expo apps.

@picovoice/rhino-web

Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

@synervoz/edgespeech

React Native library for on-device voice processing with Switchboard SDK

@unboundcx/sdk

Official JavaScript SDK for the Unbound API - A comprehensive toolkit for integrating with Unbound's communication, AI, and data management services

@stimulus-components/speech-recognition

A Stimulus controller that uses the Web Speech API to capture speech and fill an input or element.

copilot-plus

Voice + screenshots + model hotkeys + live agent monitor — drop-in wrapper for GitHub Copilot CLI

@ai-coustics/aic-sdk

Node.js package of ai-coustics SDK

superturtle

Code from anywhere with your voice. Autonomous coding system controlled from Telegram.

speech-to-text

A speech to text module.

oneai

Make your app understand language. Summarize conversations, categorize articles, and more.

openhome-cli

CLI for managing OpenHome voice AI abilities

@exreve/exk

exk - Control Claude CLI with voice and programmable interfaces

talking-head-studio

Cross-platform 3D avatar component for React Native & web — lip-sync, gestures, accessories, and LLM integration. Powered by TalkingHead + Three.js.

@telnyx/react-voice-commons-sdk

A high-level, state-agnostic, drop-in module for the Telnyx React Native SDK that simplifies WebRTC voice calling integration

@docucare/react-voice-visualizer

React library for audio recording and visualization using Web Audio API

@react-native-community/voice

React Native Native Voice library for iOS and Android

artyom.js

Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

@cometchat/chat-uikit-vue

Ready-to-use Chat UI Components for Vue(Javascript/Web)

discoclaw

Personal AI orchestrator that turns Discord into a persistent workspace

@ariontalk/core

Headless voice AI engine with page understanding — services, types, and session logic

@speechly/speech-recognition-polyfill

Polyfill for the Speech Recognition API using Speechly

react-native-jitsi-meet

Jitsi Meet SDK wrapper for React Native.

voice-page-agent

Voice wake plugin for page-agent with Vue2/Vue3 compatibility.

@onereach/step-voice

Onereach.ai Voice Steps

openclaw-mydazy-mcp

OpenClaw plugin — connect agents to MyDazy voice devices via MCP relay with TTS push

@jambonz/sdk

jambonz SDK for building voice applications — optimized for AI agents

osborn

Voice AI coding assistant - local agent that connects to Osborn frontend

@inworld/nodejs-sdk

The **Inworld AI Node.js SDK** enables Developers to easily integrate AI characters into your Node.js environment.

react-use-audio-recorder

React component and hook for audio recording in your React applications

typelessform-widget

Voice input widget for HTML forms. Users speak once — AI fills all fields at once. Drop-in for React, Vue, Angular, Next.js, WordPress. 25+ languages, 96% accuracy.

react-voice-visualizer-react19

A React 19 compatible fork of react-voice-visualizer by Yurii Zarytskyi. It's a React library for audio recording and visualization using Web Audio API.

opencode-voice2text

Streaming Volcengine speech-to-text plugin for the OpenCode TUI

cookiy-mcp

One-command bootstrap for Cookiy local skills and MCP connections in your AI coding clients

@picovoice/koala-web

Koala Noise Suppression engine for web browsers (via WebAssembly)

parrot-messenger

Unified messaging library for Email, SMS, and Voice with multiple provider support

@superdots/airtype

Speak and it types. Hands-free voice transcription CLI for macOS.

primvoices-react

React client for the PrimVoices Agents API

zyka-sdk

Programmatic AI media generation SDK for Zyka

yet-another-react-native-voice

React Native Native Voice library for iOS and Android

@mychatbot/client

Voice calling SDK for MyChatBot Sales Platform agents

claudetalk-bridge

Connect your phone to Claude Code. Voice control Claude Code from anywhere.

@onmars/lunar-voice

Voice synthesis adapter for Lunar (ElevenLabs TTS)

@speechmatics/flow-client

Javascript client for the Speechmatics Flow API

openclaw-command-center

超哥办公室 — OpenClaw AI 多部门指挥中心：像素办公室、会议室(跨部门顺序讨论+协商投票+行动项)、信任评分、子代理(sessions_spawn委派)、实时流式响应、定时任务、工作流、公告板、记忆系统、仪表盘、Gmail/Drive/Sheets集成、语音输入、Webhook、PWA、移动端适配、命令面板(Cmd+K)、中英双语 | Pixel-art virtual office with multi-agent chat, meeting room (sequential discuss

notification-catcher

Web Interface for reading and testing notifications during development

spoken-token

TOTP but you say it out loud. Derive time-rotating, human-speakable verification tokens from a shared secret.

@grunnverk/audio-tools

Audio recording tools for voice-driven development workflows

univoice

Unified Voice SDK for TTS and ASR

ambient-alfred

OpenClaw plugin for Omi ambient transcript processing — always-on AI listening and command detection

@chamade/mcp-server

MCP server for Chamade — voice gateway for AI agents. v3 is a thin stdio shim around the hosted HTTP MCP at mcp.chamade.io, so every MCP client (stdio and HTTP) talks to the same hosted surface. Supports Claude Code channel mode for push events.

@alan-ai/alan-sdk-web

Alan Web SDK: a lightweight JavaScript library for adding a voice experience to your website or web application

@sinch/functions-runtime

Development runtime for Sinch Functions - serverless voice applications

voicerun-react

React client for the VoiceRun Agents API

@speechly/react-client

React client for Speechly Streaming API

@aituber-onair/core

Core library for AITuber OnAir providing voice synthesis and chat processing

piopiyjs

Official PIOPIY WebRTC SDK for high-quality voice communication and telephony integration in the browser.

dump-ai

Instant thought capture CLI with voice and on-demand AI analysis. Think now, organize later.

@grunnverk/commands-audio

Audio transcription and voice commands for kodrdriv (transcribe, voice-note)

@dank074/discord-video-stream

Experiment for making video streaming work for discord selfbots

@trtc/calls-uikit-react

An Open-source Voice & Video Calling UI Component Based on Tencent Cloud Service.

@justwkendpkg/justwkendai-assistance-widget

AI-powered chatbot widget for Next.js and React.js — answers site questions, web search fallback, appointment scheduling, navigation, voice support, and tutorials.

@framers/agentos-ext-voice-synthesis

Voice synthesis and transcription tools for AgentOS via OpenAI, ElevenLabs, Deepgram, and local Ollama/Whisper-compatible runtimes

clawtalk

Voice calls, SMS, missions, and approvals via ClawTalk — OpenClaw plugin

@artale/pi-voice

Voice input for Pi. Multi-provider STT with Deepgram streaming, Groq Whisper, OpenAI Whisper. 56+ languages.

@caspingus/ssml-check

Check for valid SSML

@360labs/live-transcribe

Professional live speech transcription library for TypeScript/JavaScript with multi-provider support

vocallabsai-sdk

React Native SDK for VocalLabs audio calls with direct WebSocket connection

node-red-contrib-tts-ultimate

Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Google (without credentials as well), Google TTS, ElevenLabs.io TTS, Voice.ai TTS or your own voice. You can

@shenmue_dsc/shenmue.js-stream

@4players/odin-tokens

A lightweight token generator for 4Players ODIN

use-audio-capture

🎙️ A lightweight React hook for audio recording using native Web APIs (MediaRecorder, getUserMedia). Start, stop, pause, resume audio recordings with customizable callbacks. Perfect for voice notes, interviews, podcasts, and real-time audio processing in