JSPM

Found 1794 results for voice activity detection

vad-audio-worklet

Voice activity detection (VAD) AudioWorklet.

  • v0.1.4
  • 250.50
  • Published

audio-sentence-detector

Advanced audio sentence detection using signal processing and voice activity detection

  • v1.0.5
  • 122.10
  • Published

@picovoice/cobra-web

Cobra VAD engine for web browsers (via WebAssembly)

    • v3.0.0
    • 103.63
    • Published

    @discordjs/voice

    Implementation of the Discord Voice API for Node.js

    • v0.19.2
    • 67.50
    • Published

    react-dictate-button

    A button to start dictation using Web Speech API, with an easy to understand event lifecycle.

    • v4.0.1
    • 64.73
    • Published

    @restnpeacepk/worker-vad

    Universal Voice Activity Detection SDK for WebAssembly - supports multiple VAD engines with a unified API

      • v1.0.5
      • 61.42
      • Published

      simple-peer

      Simple one-to-one WebRTC video/voice and data channels

      • v9.11.1
      • 59.83
      • Published

      expo-speech

      Provides text-to-speech functionality.

      • v55.0.13
      • 56.89
      • Published

      auralwise_cli

      CLI for AuralWise audio intelligence API - transcription, speaker diarization, audio event detection

      • v1.0.8
      • 55.28
      • Published

      @vonage/voice

      The Voice API lets you create outbound calls, control in-progress calls and get information about historical calls.

      • v1.21.0
      • 53.45
      • Published

      @mclean-capital/neura

      Neura — CLI for installing and managing the Neura AI assistant core service. Includes text chat and voice listen clients.

      • v3.4.1
      • 53.43
      • Published

      react-voice-visualizer

      React library for audio recording and visualization using Web Audio API

      • v2.1.0
      • 53.09
      • Published

      @voicepilot/sdk

      Official VoicePilot JavaScript SDK — TTS, STT, Agents, and real-time conversations.

      • v0.1.19
      • 52.72
      • Published

      react-native-tts

      React Native Text-To-Speech module for Android and iOS

      • v4.1.1
      • 52.34
      • Published

      react-audio-voice-recorder

      An audio recording helper for React. Provides a component and a hook to help with audio recording.

      • v2.2.0
      • 50.40
      • Published

      @inworld/runtime

      `@inworld/runtime` is a Node.js SDK for building AI applications with LLM inference, graph orchestration, speech pipelines, retrieval, tool use, and telemetry.

      • v1.0.6
      • 49.39
      • Published

      open-agents-ai

      AI coding agent powered by open-source models (Ollama/vLLM) — interactive TUI with agentic tool-calling loop

        • v0.187.400
        • 48.73
        • Published

        vmsg

        Library for creating voice messages

        • v0.4.0
        • 48.61
        • Published

        voice-mcp-server

        An MCP server to allow LLMs to speak and listen via bidirectional voice loops

        • v0.3.1
        • 48.24
        • Published

        @twilio/rtc-diagnostics

        Various diagnostics functions to help analyze connections to Twilio

          • v1.0.1
          • 48.24
          • Published

          messagebird

          A node.js wrapper for the MessageBird REST API

          • v4.0.1
          • 47.56
          • Published

          @4players/odin

          A cross-platform SDK enabling developers to integrate real-time VoIP chat technology into their projects

          • v1.6.2
          • 47.46
          • Published

          africastalking

          Official AfricasTalking node.js API wrapper

          • v0.8.0
          • 46.86
          • Published

          @diegoaltoworks/talker

          Telephony plugin for Chatter — adds voice call and SMS support via Twilio

          • v0.15.0
          • 46.12
          • Published

          @stella_project/stellalib

          StellaLib — A powerful Lavalink v3+v4 client for TypeScript with auto version detection, session persistence, smart autoplay, and graceful shutdown

          • v1.3.0
          • 44.84
          • Published

          edge-tts-universal

          Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

          • v1.4.0
          • 44.24
          • Published

          use-ear

          React hooks for wake word detection using Web Speech API

          • v0.1.3
          • 43.63
          • Published

          pi-smart-voice-notify

          Windows-optimized smart voice, sound, and desktop notifications for Pi coding agent.

          • v0.3.2
          • 43.60
          • Published

          mellon

          Offline, in-browser voice commands powered by EfficientWord-Net (ResNet-50 ArcFace).

            • v0.0.26
            • 43.42
            • Published

            simple-peer-light

            Simple, light-weight WebRTC video/voice and data channels

            • v9.10.0
            • 42.91
            • Published

            @rapidaai/react

            An easy to use react client for building generative ai application using Rapida platform.

            • v1.1.67
            • 42.37
            • Published

            voipi

            <p align="center"> <a href="https://voipi.vercel.app/"><img src="logo.svg" alt="voipi" width="128" height="128"></a> </p>

            • v0.0.10
            • 41.50
            • Published

            aetherlight

            Voice-to-intelligence platform for developers. Voice capture, sprint planning with AI, bug/feature forms, pattern matching to prevent AI hallucinations.

            • v0.18.15
            • 41.38
            • Published

            elevenlabs-node

            This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

            • v2.0.3
            • 41.36
            • Published

            annyang

            A JavaScript library for adding voice commands to your site, using speech recognition

            • v3.0.0
            • 41.25
            • Published

            sendbird-calls

            SendBird Calls JavaScript SDK

            • v1.10.21
            • 40.70
            • Published

            ssml-check-core

            Core library to check for valid SSML

            • v0.3.9
            • 40.64
            • Published

            tnzapi

            Node.js Library for TNZ Group REST API

            • v2.4.2
            • 40.58
            • Published

            @micdrop/client

            🖐️🎤 Micdrop: Real-Time Voice Conversations with AI

            • v2.2.7
            • 40.22
            • Published

            voicesmith-mcp

            Local AI voice for coding assistants — TTS & STT via MCP. Kokoro ONNX + faster-whisper, fully offline.

            • v1.0.19
            • 39.92
            • Published

            voxglide

            Embeddable voice AI SDK for web pages — form filling, navigation, Q&A via speech recognition and server proxy

            • v1.1.2
            • 39.35
            • Published

            voice-stream

            A powerful React hook for real-time voice streaming, designed for AI-powered applications. Perfect for real-time transcription, voice assistants, and audio processing with features like silence detection and configurable audio processing.

            • v1.0.1
            • 39.35
            • Published

            react-speech-to-text-gk

            Advanced React speech-to-text library with real-time audio analysis and comprehensive speech metrics

            • v1.1.9
            • 38.99
            • Published

            telesignsdk

            Official TeleSign SDK for Rest APIs including Messaging (SMS), Intelligence Cloud, PhoneID, Voice, and AppVerify

            • v5.0.0
            • 38.48
            • Published

            holostaff-widget

            Holostaff AI avatar widget — embeddable voice assistant for any webpage

              • v3.0.11
              • 38.45
              • Published

              @andypai/orb

              Voice-driven code explorer for your terminal

              • v0.2.0
              • 38.41
              • Published

              ssml-check

              Check for valid SSML

              • v0.4.6
              • 38.19
              • Published

              libp2p-webrtc-peer

              Simple one-to-one WebRTC video/voice and data channels

              • v10.0.1
              • 38.14
              • Published

              agentvibes

              Now your AI Agents can finally talk back! Professional TTS voice for Claude Code, Claude Desktop (via MCP), and Clawdbot with multi-provider support.

              • v5.2.1
              • 37.76
              • Published

              vent-hq

              Vent CLI — CI/CD for voice AI agents

              • v0.9.22
              • 37.76
              • Published

              react-ai-voice-visualizer

              A collection of React components for building AI voice interfaces with real-time audio visualization

              • v0.1.6
              • 37.43
              • Published

              @cmdotcom/text-sdk

              Package to make it very easy to send text messages with CM.com

              • v2.1.0
              • 37.41
              • Published

              retext-passive

              retext plugin to check for passive voice

              • v5.0.0
              • 37.15
              • Published

              react-siriwave

              React version of siriwave.js

              • v3.1.0
              • 37.04
              • Published

              @chengsokdara/use-whisper

              React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

              • v0.2.0
              • 36.70
              • Published

              @cloudflare/voice

              Voice pipeline for Cloudflare Agents — STT, TTS, VAD, streaming, and SFU utilities

              • v0.1.2
              • 36.45
              • Published

              voice-tool-call

              Voice-to-tool-call browser library: wake word detection, speech-to-text, LLM intent interpretation, tool execution, and text-to-speech

              • v0.2.5
              • 36.08
              • Published

              @umituz/web-cloudflare

              Comprehensive Cloudflare Workers & Pages integration with config-based patterns, middleware, router, workflows, AI (with audio/music generation, TTS, ASR), React hooks, and multi-tenant support

              • v1.7.8
              • 35.41
              • Published

              @termii/node

              Nodejs SDK wrapper for Termii API written with Typescript support

              • v0.3.0
              • 35.39
              • Published

              vosk

              Node binding for continuous offline voice recoginition with Vosk library.

              • v0.3.39
              • 35.36
              • Published

              samvyo-js-sdk

              This is the client js sdk for cutting-edge Samvyo real-time voice/video cloud.

              • v2.0.30
              • 34.25
              • Published

              @clawvoice/clawvoice

              Voice calling plugin for OpenClaw — give your AI agent a phone number

              • v1.0.4
              • 34.09
              • Published

              obi-sdk

              JavaScript SDK for Obi

              • v0.19.77
              • 33.88
              • Published

              @hazeljs/realtime

              Real-time voice AI for HazelJS - OpenAI Realtime API & Gemini Live integration for low-latency speech-to-speech

              • v0.8.2
              • 33.86
              • Published

              @wave-av/sdk

              Official WAVE SDK for TypeScript and Node.js — 34 API modules for live video streaming, production, analytics, voice, captions, and more

              • v2.0.14
              • 33.83
              • Published

              @dev-amirzubair/react-native-voice

              React Native Voice library for iOS and Android - Fork with New Architecture, Bridgeless mode, and React Native 0.76+ support

              • v1.0.4
              • 33.82
              • Published

              bland-cli

              The official Bland AI command-line interface

                • v0.2.29
                • 33.67
                • Published

                expo-speech-transcriber

                An iOS only on-device transcription library for React Native and Expo apps.

                • v0.1.9
                • 33.58
                • Published

                @picovoice/rhino-web

                Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

                  • v4.0.0
                  • 33.47
                  • Published

                  @synervoz/edgespeech

                  React Native library for on-device voice processing with Switchboard SDK

                  • v0.1.0
                  • 33.47
                  • Published

                  @unboundcx/sdk

                  Official JavaScript SDK for the Unbound API - A comprehensive toolkit for integrating with Unbound's communication, AI, and data management services

                  • v4.0.0
                  • 33.46
                  • Published

                  copilot-plus

                  Voice + screenshots + model hotkeys + live agent monitor — drop-in wrapper for GitHub Copilot CLI

                  • v1.0.27
                  • 33.40
                  • Published

                  superturtle

                  Code from anywhere with your voice. Autonomous coding system controlled from Telegram.

                  • v0.2.9
                  • 33.28
                  • Published

                  oneai

                  Make your app understand language. Summarize conversations, categorize articles, and more.

                  • v0.8.4
                  • 33.15
                  • Published

                  openhome-cli

                  CLI for managing OpenHome voice AI abilities

                  • v0.1.40
                  • 32.96
                  • Published

                  @exreve/exk

                  exk - Control Claude CLI with voice and programmable interfaces

                    • v1.0.26
                    • 32.95
                    • Published

                    talking-head-studio

                    Cross-platform 3D avatar component for React Native & web — lip-sync, gestures, accessories, and LLM integration. Powered by TalkingHead + Three.js.

                    • v0.4.11
                    • 32.86
                    • Published

                    @telnyx/react-voice-commons-sdk

                    A high-level, state-agnostic, drop-in module for the Telnyx React Native SDK that simplifies WebRTC voice calling integration

                    • v0.3.1
                    • 32.74
                    • Published

                    artyom.js

                    Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

                    • v1.0.6
                    • 32.36
                    • Published

                    discoclaw

                    Personal AI orchestrator that turns Discord into a persistent workspace

                    • v2.0.0
                    • 32.33
                    • Published

                    @ariontalk/core

                    Headless voice AI engine with page understanding — services, types, and session logic

                    • v0.2.0
                    • 32.18
                    • Published

                    voice-page-agent

                    Voice wake plugin for page-agent with Vue2/Vue3 compatibility.

                      • v3.0.6
                      • 31.95
                      • Published

                      openclaw-mydazy-mcp

                      OpenClaw plugin — connect agents to MyDazy voice devices via MCP relay with TTS push

                      • v0.7.0
                      • 31.80
                      • Published

                      @jambonz/sdk

                      jambonz SDK for building voice applications — optimized for AI agents

                      • v0.3.1
                      • 31.80
                      • Published

                      osborn

                      Voice AI coding assistant - local agent that connects to Osborn frontend

                      • v0.8.18
                      • 31.74
                      • Published

                      @inworld/nodejs-sdk

                      The **Inworld AI Node.js SDK** enables Developers to easily integrate AI characters into your Node.js environment.

                      • v1.17.0
                      • 31.56
                      • Published

                      react-use-audio-recorder

                      React component and hook for audio recording in your React applications

                      • v0.4.2
                      • 31.52
                      • Published

                      typelessform-widget

                      Voice input widget for HTML forms. Users speak once — AI fills all fields at once. Drop-in for React, Vue, Angular, Next.js, WordPress. 25+ languages, 96% accuracy.

                      • v1.0.6
                      • 31.47
                      • Published

                      react-voice-visualizer-react19

                      A React 19 compatible fork of react-voice-visualizer by Yurii Zarytskyi. It's a React library for audio recording and visualization using Web Audio API.

                      • v1.0.0
                      • 31.47
                      • Published

                      opencode-voice2text

                      Streaming Volcengine speech-to-text plugin for the OpenCode TUI

                      • v0.1.17
                      • 31.41
                      • Published

                      cookiy-mcp

                      One-command bootstrap for Cookiy local skills and MCP connections in your AI coding clients

                      • v1.9.1
                      • 31.40
                      • Published

                      @picovoice/koala-web

                      Koala Noise Suppression engine for web browsers (via WebAssembly)

                        • v3.0.0
                        • 31.36
                        • Published

                        parrot-messenger

                        Unified messaging library for Email, SMS, and Voice with multiple provider support

                        • v2.3.2
                        • 31.30
                        • Published

                        @superdots/airtype

                        Speak and it types. Hands-free voice transcription CLI for macOS.

                        • v0.5.5
                        • 31.25
                        • Published

                        primvoices-react

                        React client for the PrimVoices Agents API

                        • v0.2.13
                        • 31.23
                        • Published

                        zyka-sdk

                        Programmatic AI media generation SDK for Zyka

                        • v0.4.4
                        • 31.09
                        • Published

                        @mychatbot/client

                        Voice calling SDK for MyChatBot Sales Platform agents

                        • v0.4.8
                        • 30.72
                        • Published

                        claudetalk-bridge

                        Connect your phone to Claude Code. Voice control Claude Code from anywhere.

                        • v5.0.0
                        • 30.69
                        • Published

                        @onmars/lunar-voice

                        Voice synthesis adapter for Lunar (ElevenLabs TTS)

                        • v0.7.0
                        • 30.65
                        • Published

                        openclaw-command-center

                        超哥办公室 — OpenClaw AI 多部门指挥中心:像素办公室、会议室(跨部门顺序讨论+协商投票+行动项)、信任评分、子代理(sessions_spawn委派)、实时流式响应、定时任务、工作流、公告板、记忆系统、仪表盘、Gmail/Drive/Sheets集成、语音输入、Webhook、PWA、移动端适配、命令面板(Cmd+K)、中英双语 | Pixel-art virtual office with multi-agent chat, meeting room (sequential discuss

                        • v1.8.0
                        • 30.58
                        • Published

                        notification-catcher

                        Web Interface for reading and testing notifications during development

                        • v1.2.1
                        • 30.52
                        • Published

                        spoken-token

                        TOTP but you say it out loud. Derive time-rotating, human-speakable verification tokens from a shared secret.

                        • v2.0.4
                        • 30.33
                        • Published

                        @grunnverk/audio-tools

                        Audio recording tools for voice-driven development workflows

                        • v1.5.13
                        • 30.28
                        • Published

                        univoice

                        Unified Voice SDK for TTS and ASR

                        • v0.10.0
                        • 30.24
                        • Published

                        ambient-alfred

                        OpenClaw plugin for Omi ambient transcript processing — always-on AI listening and command detection

                        • v1.6.0
                        • 30.16
                        • Published

                        @chamade/mcp-server

                        MCP server for Chamade — voice gateway for AI agents. v3 is a thin stdio shim around the hosted HTTP MCP at mcp.chamade.io, so every MCP client (stdio and HTTP) talks to the same hosted surface. Supports Claude Code channel mode for push events.

                        • v3.0.2
                        • 30.14
                        • Published

                        @alan-ai/alan-sdk-web

                        Alan Web SDK: a lightweight JavaScript library for adding a voice experience to your website or web application

                        • v1.8.119
                        • 30.13
                        • Published

                        @sinch/functions-runtime

                        Development runtime for Sinch Functions - serverless voice applications

                        • v0.4.13
                        • 30.11
                        • Published

                        voicerun-react

                        React client for the VoiceRun Agents API

                        • v0.2.0
                        • 30.10
                        • Published

                        @aituber-onair/core

                        Core library for AITuber OnAir providing voice synthesis and chat processing

                          • v0.25.3
                          • 29.94
                          • Published

                          piopiyjs

                          Official PIOPIY WebRTC SDK for high-quality voice communication and telephony integration in the browser.

                          • v0.15.0
                          • 29.90
                          • Published

                          dump-ai

                          Instant thought capture CLI with voice and on-demand AI analysis. Think now, organize later.

                          • v1.5.2
                          • 29.82
                          • Published

                          @grunnverk/commands-audio

                          Audio transcription and voice commands for kodrdriv (transcribe, voice-note)

                          • v1.5.14
                          • 29.67
                          • Published

                          @trtc/calls-uikit-react

                          An Open-source Voice & Video Calling UI Component Based on Tencent Cloud Service.

                          • v4.4.9
                          • 29.58
                          • Published

                          @justwkendpkg/justwkendai-assistance-widget

                          AI-powered chatbot widget for Next.js and React.js — answers site questions, web search fallback, appointment scheduling, navigation, voice support, and tutorials.

                            • v1.1.0
                            • 29.56
                            • Published

                            @framers/agentos-ext-voice-synthesis

                            Voice synthesis and transcription tools for AgentOS via OpenAI, ElevenLabs, Deepgram, and local Ollama/Whisper-compatible runtimes

                            • v2.0.1
                            • 29.56
                            • Published

                            clawtalk

                            Voice calls, SMS, missions, and approvals via ClawTalk — OpenClaw plugin

                            • v0.2.3
                            • 29.56
                            • Published

                            @artale/pi-voice

                            Voice input for Pi. Multi-provider STT with Deepgram streaming, Groq Whisper, OpenAI Whisper. 56+ languages.

                            • v2.0.0
                            • 29.54
                            • Published

                            @360labs/live-transcribe

                            Professional live speech transcription library for TypeScript/JavaScript with multi-provider support

                            • v0.2.2
                            • 29.44
                            • Published

                            vocallabsai-sdk

                            React Native SDK for VocalLabs audio calls with direct WebSocket connection

                            • v1.1.4
                            • 29.41
                            • Published

                            node-red-contrib-tts-ultimate

                            Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Google (without credentials as well), Google TTS, ElevenLabs.io TTS, Voice.ai TTS or your own voice. You can

                            • v3.0.7
                            • 29.40
                            • Published

                            use-audio-capture

                            🎙️ A lightweight React hook for audio recording using native Web APIs (MediaRecorder, getUserMedia). Start, stop, pause, resume audio recordings with customizable callbacks. Perfect for voice notes, interviews, podcasts, and real-time audio processing in

                            • v1.0.1
                            • 29.17
                            • Published

                            discordjs-nextgen-voice

                            Native voice plugin for discordjs-nextgen (without @discordjs/voice)

                            • v0.2.0
                            • 29.07
                            • Published

                            @telitask/mcp-server

                            TeliTask MCP server — manage contacts, tasks, and calls from AI assistants

                            • v0.3.1
                            • 29.05
                            • Published

                            clawvoice

                            Voice calling plugin for OpenClaw — give your AI agent a phone number

                            • v1.1.3
                            • 28.97
                            • Published

                            @theyahia/voximplant-mcp

                            MCP server for Voximplant — cloud telephony, call history, SMS (Russia)

                            • v1.2.3
                            • 28.93
                            • Published

                            responsivevoice

                            npm wrapper for responsivevoice.js obtained from dataplusscience.com

                            • v0.4.1
                            • 28.80
                            • Published

                            dropin-feedback-widget

                            Drop-in feedback widget for React — text + voice recording with Whisper transcription

                            • v0.1.1
                            • 28.79
                            • Published

                            @neosapience/typecast-js

                            The official Node.js library for the Typecast API. Text-to-Speech with AI voices. TypeScript support included.

                            • v0.3.0
                            • 28.77
                            • Published

                            xfyun-sdk

                            科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能

                            • v1.3.3
                            • 28.72
                            • Published

                            animalese-tts

                            Animalese TTS is an Animal Crossing style Voice Synthesis (TTS) engine.

                            • v1.1.2
                            • 28.67
                            • Published

                            @estuary-ai/sdk

                            Web SDK for the Estuary real-time AI conversation platform

                            • v0.2.1
                            • 28.65
                            • Published

                            @codexstar/pi-listen

                            Hold-to-talk voice input for Pi CLI — cloud streaming via Deepgram or fully offline with 19 local models

                            • v5.0.7
                            • 28.64
                            • Published

                            @picovoice/eagle-web

                            Eagle Speaker Recognition engine for web browsers (via WebAssembly)

                              • v3.0.0
                              • 28.61
                              • Published

                              oomi-ai

                              Managed Oomi chat, voice bridge, and XR-first persona scaffolding for OpenClaw

                              • v0.2.50
                              • 28.61
                              • Published

                              agentdial

                              Dial your AI agent into every platform. One identity. Every channel.

                              • v1.2.0
                              • 28.58
                              • Published

                              deepgram

                              NodeJS wrapper for Deepgram

                              • v1.0.3
                              • 28.55
                              • Published

                              cursor-buddy

                              AI-powered cursor companion for web apps

                              • v0.0.10
                              • 28.53
                              • Published

                              @imcooder/opuslib

                              Opus 1.6 audio encoding for React Native and Expo with audio level metering and lifecycle events. Forked from Scdales/opuslib.

                              • v2.4.6
                              • 28.45
                              • Published

                              @picovoice/cheetah-web

                              Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

                                • v4.0.1
                                • 28.36
                                • Published

                                @voxglide/react

                                React wrapper for VoxGlide — loads SDK at runtime from proxy server, zero bundled SDK code

                                • v1.1.2
                                • 28.18
                                • Published

                                @drakulavich/kesha-voice-kit

                                Open-source voice toolkit for Apple Silicon. Speech-to-text, language detection. 25 languages.

                                • v1.0.9
                                • 28.08
                                • Published

                                vosk-koffi

                                Vosk node API based on Koffi.

                                • v1.1.1
                                • 27.87
                                • Published

                                @voice-ai-labs/web-sdk

                                Web SDK for Voice.ai - Easy integration of voice agents into JavaScript applications

                                • v1.0.2
                                • 27.84
                                • Published

                                xphone

                                The WebSocket/WebRTC library by lirax.ua (PBX Cloud Platform)

                                • v1.0.5
                                • 27.84
                                • Published

                                murmuraba

                                Real-time audio noise reduction with advanced chunked processing for web applications

                                • v3.0.3
                                • 27.81
                                • Published

                                pope-test-callkit-vue2

                                An Open-source Voice & Video Calling UI Component Based on Tencent Cloud Service.

                                • v0.0.61
                                • 27.80
                                • Published

                                spoken

                                JavaScript Web API for Text-to-Speech and Speech-to-Text.

                                • v1.1.17
                                • 27.55
                                • Published

                                @sarafhbk/react-audio-recorder

                                This is a simple audio recorder package for react application using the javascript Web Audio API.

                                • v1.1.2
                                • 27.55
                                • Published

                                sarvam-conv-ai-sdk

                                TypeScript SDK for Sarvam Conversational AI

                                  • v0.0.42
                                  • 27.52
                                  • Published

                                  infobip

                                  Infobip Node Client

                                  • v0.1.0
                                  • 27.47
                                  • Published

                                  @sonnetics/js

                                  Wake-word inference for JavaScript and TypeScript. High-level API over @sonnetics/core.

                                  • v0.0.1-beta.2
                                  • 27.43
                                  • Published

                                  heydad

                                  Your terminal has feelings now

                                    • v0.5.0
                                    • 27.41
                                    • Published

                                    stormee-websocket

                                    Framework-agnostic WebSocket library for real-time audio streaming with MessagePack and Opus encoding

                                      • v1.0.5
                                      • 27.29
                                      • Published

                                      voxflow

                                      AI audio content creation CLI — stories, podcasts, narration, dubbing, transcription, translation, and video translation with TTS

                                      • v1.7.1
                                      • 27.23
                                      • Published

                                      capacitor-audio-engine

                                      High-quality audio recording Capacitor plugin with native iOS & Android support. Features pause/resume, microphone management, real-time monitoring, audio trimming, and comprehensive mobile audio recording capabilities.

                                      • v2.0.31
                                      • 27.16
                                      • Published

                                      @bowbee/peer-lite

                                      Lightweight WebRTC browser library that supports video, audio and data channels

                                        • v2.2.0
                                        • 27.10
                                        • Published

                                        africastalking-ts

                                        Unofficial Typescript version of the Africa's Talking SDK

                                        • v0.0.3
                                        • 27.06
                                        • Published

                                        solana-clawd

                                        $CLAWD — Solana x xAI agentic engine powered by Grok. Multi-agent research (16 agents), vision, image gen, voice, function calling, X search, and 31 MCP tools. CLAWD Cloud OS bootstrap for E2B/Docker/any terminal.

                                        • v1.7.0
                                        • 27.02
                                        • Published

                                        voice-router-dev

                                        Universal speech-to-text router for Gladia, AssemblyAI, Deepgram, Azure, OpenAI Whisper, Speechmatics, Soniox, and ElevenLabs

                                        • v0.8.6
                                        • 27.01
                                        • Published

                                        @teamlearners/clawops

                                        Official Node.js/TypeScript SDK for the ClawOps Voice API

                                        • v0.13.1
                                        • 26.93
                                        • Published

                                        @oneshot-agent/sdk

                                        Autonomous Agent SDK for executing real-world commercial transactions with automatic x402 payments

                                        • v0.15.0
                                        • 26.91
                                        • Published

                                        revoice.js

                                        A voice module for Stoat

                                        • v0.2.1695
                                        • 26.67
                                        • Published

                                        ttp-agent-sdk

                                        Comprehensive Voice Agent SDK with Customizable Widget - Real-time audio, WebSocket communication, React components, and extensive customization options

                                        • v2.43.0
                                        • 26.60
                                        • Published

                                        @inworld/tts

                                        Inworld TTS SDK – generate, stream, and voice management

                                        • v1.0.1
                                        • 26.55
                                        • Published

                                        @picovoice/leopard-web

                                        Leopard Speech-to-Text engine for web browsers (via WebAssembly)

                                          • v3.0.0
                                          • 26.52
                                          • Published

                                          opencode-smart-voice-notify

                                          Smart voice notification plugin for OpenCode with multiple TTS engines (ElevenLabs, Edge TTS, Windows SAPI), AI-generated dynamic messages, and intelligent reminder system

                                          • v1.3.3
                                          • 26.52
                                          • Published

                                          opencode-baseline-hooks

                                          Security validation, logging, context monitoring, and Kokoro TTS voice notifications for OpenCode

                                          • v0.9.5
                                          • 26.49
                                          • Published

                                          @mac20777/vibecoding-voice

                                          ESP32 LAN voice coding bridge with inject, Codex, and Claude modes; inject is the recommended default.

                                          • v0.2.3
                                          • 26.49
                                          • Published

                                          koishi-plugin-minimax-vits

                                          自用,使用 minimax 国际版生成语音,不注册chatluna只截取chatluna输出LLM。

                                          • v1.7.4
                                          • 26.49
                                          • Published

                                          @picovoice/orca-web

                                          Orca Text-to-Speech engine for web browsers (via WebAssembly)

                                            • v3.0.0
                                            • 26.45
                                            • Published

                                            expo-realtime-audio

                                            Real-time bidirectional audio streaming for Expo and React Native. Record microphone input and play audio chunks with low latency using native AVAudioEngine.

                                            • v1.0.0
                                            • 26.36
                                            • Published

                                            ssml-builder

                                            This package creates Speech Synthesis Markup Language (SSML) using the builder pattern.

                                            • v0.4.3
                                            • 26.32
                                            • Published

                                            @depup/africastalking

                                            Official AfricasTalking node.js API wrapper (with updated dependencies)

                                            • v0.7.9-depup.5
                                            • 26.30
                                            • Published

                                            @serviceagent/nextjs

                                            Next.js SDK and components for ServiceAgent chat, low-latency voice agents, dialer workflows, booking, webhooks, and server-side API access

                                            • v1.2.0
                                            • 26.23
                                            • Published

                                            agentphone-mcp

                                            MCP server for AgentPhone — give AI agents phone numbers, SMS, and voice calls

                                            • v0.5.0
                                            • 26.21
                                            • Published

                                            @theyahia/mts-exolve-mcp

                                            MCP server for MTS Exolve — SMS, calls, recordings, Viber messaging (Russia)

                                            • v3.0.1
                                            • 26.20
                                            • Published

                                            @trtc/calls-uikit-wx

                                            An Open-source Voice & Video Calling UI Component Based on Tencent Cloud Service.

                                            • v4.2.11
                                            • 26.12
                                            • Published

                                            @serviceagent/sdk

                                            TypeScript and Node.js SDK for ServiceAgent APIs, AI agents, knowledge base search, CRM sync, analytics, billing, and workflow automation

                                            • v1.2.0
                                            • 26.02
                                            • Published