JSPM

Found 735 results for speech recognition

annyang

A javascript library for adding voice commands to your site, using speech recognition

  • v2.6.1
  • 215.61
  • Published

sonix-speech-recognition

A library that produces audio transcriptions and translations using the Sonix.AI service.

  • v2.1.1
  • 177.29
  • Published

echogarden

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

  • v2.10.0
  • 176.73
  • Published

speechkitt

A flexible GUI for interacting with Speech Recognition

  • v1.0.0
  • 139.51
  • Published

@picovoice/eagle-web

Eagle Speaker Recognition engine for web browsers (via WebAssembly)

    • v1.0.0
    • 133.49
    • Published

    @sign-speak/react-sdk

    Unlock Sign Language Recognition, Avatar, and Speech Recognition.

    • v0.7.3
    • 125.80
    • Published

    spremic

    A simple JavaScript speech recognition library.

    • v0.0.48
    • 115.50
    • Published

    sherpa-ncnn

    Real-time speech recognition with Next-gen Kaldi

    • v2.1.12
    • 115.25
    • Published

    react-native-voicebox-speech-rec

    A powerful speech recognition library for React Native applications, enabling real-time speech-to-text transcription.

    • v1.0.4
    • 94.39
    • Published

    electron-speech

    speech recognition cli and api for node using electron

    • v1.0.7
    • 88.69
    • Published

    react-voice-search

    React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

    • v1.1.1
    • 82.27
    • Published

    sonus

    Open source cross platform decentralized always-on speech recognition framework

    • v1.0.3
    • 81.41
    • Published

    vosk

    Node binding for continuous offline voice recoginition with Vosk library.

    • v0.3.39
    • 80.49
    • Published

    vosk-koffi

    Vosk node API based on Koffi.

    • v1.1.1
    • 79.69
    • Published

    @mastashake08/speech-kit

    Package for simplifying the Speech Recognition and Speech Utterence process.

    • v2.0.8
    • 77.42
    • Published

    voice-speech-recognition

    Simple wrapper extended functionalities of Speech Recognition embedded in browsers.

    • v1.1.2
    • 75.00
    • Published

    speech-js

    lib for recognition and synthesis of speech

    • v0.1.1
    • 73.36
    • Published

    artyom.js

    Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

    • v1.0.6
    • 72.25
    • Published

    @aurally/fancy-search

    A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching

    • v1.0.9
    • 70.64
    • Published

    whisper-onnx-speech-to-text

    Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.

    • v1.0.1
    • 68.56
    • Published

    @ng-web-apis/speech

    A library for using Web Speech API with Angular

    • v4.12.0
    • 67.31
    • Published

    cordova-plugin-speech

    This is cordova plugin for Speech Recognition and Text to Speech.

    • v0.0.4
    • 64.96
    • Published

    speech-recognition-react

    A react library that encapsulates the native browser speech recognition api

    • v2.0.0
    • 64.19
    • Published

    @deepgram/sdk

    Isomorphic Javascript client for Deepgram

    • v4.11.2
    • 62.54
    • Published

    @deepgram/captions

    Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

    • v1.2.0
    • 60.96
    • Published

    speech-to-element

    Add real-time speech to text functionality into your website with no effort

    • v1.0.4
    • 60.14
    • Published

    corti

    Replace window.SpeechRecognition with a mock object and automate your tests

    • v1.0.0
    • 59.47
    • Published

    ng-speech-recognition

    AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

    • v2.0.1
    • 57.81
    • Published

    parakeet.js

    NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

    • v0.0.3
    • 57.65
    • Published

    revai-node-sdk

    Rev AI makes speech applications easy to build!

    • v3.8.5
    • 56.34
    • Published

    expo-speech

    Provides text-to-speech functionality.

    • v13.1.7
    • 56.31
    • Published

    electron-vosk-speech

    Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).

    • v0.2.1
    • 54.49
    • Published

    react-native-tts

    React Native Text-To-Speech module for Android and iOS

    • v4.1.1
    • 53.95
    • Published

    speech-recog-stream

    A module to stream audio to a speech recognition server and get back the STT result"

    • v1.0.8
    • 52.74
    • Published

    react-speech

    React component for the web speech synthesis api

    • v1.0.2
    • 52.66
    • Published

    sherpa-onnx-node

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 48.05
    • Published

    yanyu

    A Chinese speech synthesis and recognition library toolkit

    • v0.1.4
    • 48.04
    • Published

    fft-js

    Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.

    • v0.0.12
    • 47.64
    • Published

    en-pos

    A better English POS tagger written in JavaScript

    • v1.0.16
    • 47.36
    • Published

    speech-into-text

    SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

    • v4.0.2
    • 46.87
    • Published

    elevenlabs-node

    This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

    • v2.0.3
    • 44.94
    • Published

    mic-to-speech

    Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.

    • v1.0.1
    • 44.43
    • Published

    react-text-to-speech

    An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

    • v2.1.2
    • 44.05
    • Published

    speechflow

    Speech Processing Flow Graph

    • v1.5.0
    • 43.74
    • Published

    sherpa-onnx-linux-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 43.72
    • Published

    sherpa-onnx

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 43.42
    • Published

    retext-pos

    retext plugin to add part-of-speech (POS) tags

    • v5.0.0
    • 43.36
    • Published

    ssml-check-core

    Core library to check for valid SSML

    • v0.3.9
    • 43.22
    • Published

    @chengsokdara/use-whisper

    React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

    • v0.2.0
    • 41.34
    • Published

    @wdragon/react-native-voice

    React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

    • v3.3.11
    • 41.30
    • Published

    sherpa-onnx-darwin-arm64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 41.18
    • Published

    web-wake-word

    A web package for keyword detection

      • v2.0.10
      • 39.94
      • Published

      babbler

      Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.

      • v1.0.0
      • 39.69
      • Published

      sam-js

      SAM - The Software Automatic Mouth

      • v0.3.1
      • 39.49
      • Published

      houndify-react-native

      Allows react-native apps to connect to Houndify for speech recognition.

        • v0.2.0
        • 39.23
        • Published

        koi-koi

        Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

        • v0.1.0
        • 39.17
        • Published

        ssml-check

        Check for valid SSML

        • v0.4.6
        • 39.05
        • Published

        sherpa-onnx-win-x64

        Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

        • v1.12.10
        • 38.90
        • Published

        sherpa-onnx-win-ia32

        Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

        • v1.12.10
        • 38.49
        • Published

        vocalize.ts

        A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.

        • v1.2.2
        • 38.40
        • Published

        aws-transcribe

        A client for Amazon Transcribe using the websocket interface

        • v1.1.1
        • 38.13
        • Published

        soundswallower

        An even smaller speech recognizer

        • v0.6.3
        • 36.80
        • Published

        espeak-ng

        eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

        • v1.0.2
        • 36.61
        • Published

        @picovoice/cobra-web

        Cobra VAD engine for web browsers (via WebAssembly)

          • v2.0.3
          • 36.03
          • Published

          @picovoice/cheetah-web

          Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

            • v2.3.0
            • 35.21
            • Published

            @aurally/speech-control

            A class to handle microphone permissions, start and observe speech input

            • v1.1.2
            • 34.46
            • Published

            audio-to-text-node

            Backend audio file to text transcription using Web Speech API with Puppeteer

            • v0.1.2
            • 34.30
            • Published

            sherpa-onnx-linux-arm64

            Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

            • v1.12.10
            • 33.50
            • Published

            piper-announce

            AI-powered announcement generator using Piper TTS and OpenAI GPT models

            • v1.2.10
            • 32.96
            • Published

            @picovoice/rhino-web

            Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

              • v3.0.3
              • 32.71
              • Published

              text2wav

              Self-contained multilingual TTS speech synthesizer for Node.js in pure js

              • v0.0.14
              • 32.36
              • Published

              speech-ui-kitt

              A flexible GUI for interacting with Speech Recognition

              • v0.1.0
              • 32.28
              • Published

              pmacom-react-transcript-editor

              A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

              • v2.4.0
              • 32.06
              • Published

              yandex-speech

              node.js module for Yandex speech systems (ASR & TTS)

              • v0.0.14
              • 31.92
              • Published

              node-witai-speech

              This is an API wrapper for witai speech for nodejs

              • v1.0.2
              • 31.81
              • Published

              avr-vad

              A Node.js library for Voice Activity Detection using Silero VAD

              • v1.0.9
              • 31.65
              • Published

              @larriereguichet/vosk

              Node binding for continuous offline voice recoginition with Vosk library.

              • v0.4.4
              • 31.12
              • Published

              mumble-js

              A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

              • v1.0.1
              • 30.88
              • Published

              edge-tts-universal

              Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

              • v1.3.0
              • 30.71
              • Published

              brill

              Part-of-speech tags from the Brill-tagger

              • v3.1.0
              • 30.65
              • Published

              sherpa-onnx-darwin-x64

              Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

              • v1.12.10
              • 30.52
              • Published

              mmir-lib

              MMIR (Mobile Multimodal Interaction and Relay) library

              • v7.0.1
              • 30.46
              • Published

              vosk-lib

              Vosk library for node, with type defenitions and multi-arch support.

              • v0.1.3
              • 30.30
              • Published

              @pr0gramm/fluester

              Node.js bindings for OpenAI's Whisper. Optimized for CPU.

              • v0.9.15
              • 29.78
              • Published

              react-native-voice-hold

              React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

              • v1.0.7
              • 29.47
              • Published

              @picovoice/koala-web

              Koala Noise Suppression engine for web browsers (via WebAssembly)

                • v2.0.0
                • 28.64
                • Published

                postcss-speech-bubble

                PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

                • v1.0.12
                • 28.54
                • Published

                mespeak

                Text to speech synthesizer

                • v2.0.2
                • 28.39
                • Published

                node-red-contrib-tts-ultimate

                Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

                • v3.0.1
                • 28.26
                • Published

                text-to-speech-js

                A small JavaScript library that provides a text to speech conversion using tts-api.com service.

                • v1.1.11
                • 28.09
                • Published

                mbz-voice-sdk

                🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.

                • v1.0.21
                • 27.92
                • Published

                vue-webapi-speech-recognition

                Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api

                • v1.0.1
                • 27.83
                • Published

                azure-speech-utilities

                Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil

                • v1.0.0
                • 27.68
                • Published

                n8n-nodes-groq-speech

                N8N Community Node for Groq Text-to-Speech API integration

                • v1.1.2
                • 27.43
                • Published

                @arach/speakeasy

                SpeakEasy - Unified text-to-speech service with provider abstraction

                  • v0.2.4
                  • 27.34
                  • Published

                  node-mfcc

                  Node.js implementation of the MFCC audio speech analysis algorithm.

                  • v0.0.2
                  • 27.23
                  • Published

                  speech-command-engine

                  A package for handling voice commands and speech recognition.

                    • v1.0.5
                    • 26.76
                    • Published

                    @picovoice/leopard-web

                    Leopard Speech-to-Text engine for web browsers (via WebAssembly)

                      • v2.0.1
                      • 26.46
                      • Published

                      @mirawision/reactive-hooks

                      A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

                      • v1.1.0
                      • 25.81
                      • Published

                      xfyun-sdk

                      科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能

                      • v1.0.2
                      • 25.57
                      • Published

                      koi-app

                      Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

                      • v0.1.2
                      • 25.47
                      • Published

                      whisper-speech-to-text

                      A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

                      • v1.0.3
                      • 25.39
                      • Published

                      @moonshine-ai/moonshine-js

                      On-device speech-to-text and voice control for web applications with Moonshine.

                      • v0.1.29
                      • 25.11
                      • Published

                      iobroker.sonus

                      With this adapter you can control ioBroker with voice in many different languages

                      • v0.1.1
                      • 25.02
                      • Published

                      speechjs

                      Chrome speech recognition API wrapper

                      • v0.0.1
                      • 24.95
                      • Published

                      @picovoice/orca-web

                      Orca Text-to-Speech engine for web browsers (via WebAssembly)

                        • v1.2.1
                        • 24.71
                        • Published

                        node-droid-language

                        A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

                        • v1.0.2
                        • 24.42
                        • Published

                        mac-say

                        The macOS built-in `say` CLI for JavaScript

                        • v0.3.3
                        • 24.42
                        • Published

                        text-to-speech

                        Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

                        • v1.0.11
                        • 24.36
                        • Published

                        @untemps/react-vocal

                        React component and hook to initiate a SpeechRecognition session

                        • v1.7.28
                        • 23.83
                        • Published

                        pocketsphinx

                        Node binding for continuous voice recoginition through pocketsphinx.

                        • v5.0.7
                        • 23.75
                        • Published

                        primvoices-react

                        React client for the PrimVoices Agents API

                        • v0.2.2
                        • 23.59
                        • Published

                        @bluefly/apple-fm

                        Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

                        • v0.2.7
                        • 23.36
                        • Published

                        tiktok-tts

                        Use TikTok TTS from node.js

                        • v1.1.17
                        • 23.34
                        • Published

                        @logikron/talk-widget-embed

                        Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

                        • v1.0.2
                        • 23.32
                        • Published

                        buzzphrase

                        Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

                        • v3.2.1
                        • 23.15
                        • Published

                        ispikit

                        ispikit

                        • v1.0.3
                        • 23.13
                        • Published

                        node-mic-record

                        Record microphone sond using nodejs

                        • v0.0.1
                        • 23.10
                        • Published

                        cybertyper

                        ReactJS component for automatically typing text synchronized with speech synthesis & recognition

                        • v0.0.3
                        • 22.97
                        • Published

                        discord-tts

                        Node.js module to make your discord bot talk

                        • v1.2.2
                        • 22.97
                        • Published

                        ttsmaker

                        Text-to-Speech API wrapper for ttsmp3.com

                        • v1.0.3
                        • 22.78
                        • Published

                        browser-speech

                        🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech

                          • v1.1.1
                          • 22.77
                          • Published

                          google-cloud-speech-webaudio

                          Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.

                          • v0.1.4
                          • 22.72
                          • Published

                          tts-cli

                          Command-line tool to convert text to speech

                          • v5.4.1
                          • 22.36
                          • Published

                          aixblock-voice-ai-deepgram

                          A React component for real-time transcription and voice agent interactions using Deepgram APIs

                            • v0.0.7
                            • 22.30
                            • Published

                            @albertsyh/use-whisper

                            React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                            • v0.2.17
                            • 21.87
                            • Published

                            pronounceability

                            Calculate pronounceability for a given word.

                            • v0.0.3
                            • 21.66
                            • Published

                            ugai

                            A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

                            • v1.1.0
                            • 21.61
                            • Published

                            n8n-nodes-groq

                            N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

                            • v0.2.0
                            • 21.60
                            • Published

                            speechify

                            Easily add speech to text functionality into your website

                            • v0.1.0
                            • 21.31
                            • Published

                            espeak

                            text-to-speech using espeak cli program

                            • v0.0.3
                            • 21.07
                            • Published

                            @squirrelsoft/dev-say

                            MCP server for macOS text-to-speech using the say command

                            • v1.0.1
                            • 21.05
                            • Published

                            @jcbyte/tts-queue

                            A lightweight wrapper for the Web Speech API's SpeechSynthesis, enabling easy queuing and management of text-to-speech utterances.

                            • v1.0.1
                            • 21.03
                            • Published

                            @cloudraker/use-whisper

                            React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                            • v0.3.0
                            • 20.59
                            • Published

                            @lipsurf/plugins

                            Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

                            • v4.10.0
                            • 20.55
                            • Published

                            spoken

                            JavaScript Web API for Text-to-Speech and Speech-to-Text.

                            • v1.1.17
                            • 20.39
                            • Published

                            speechrecognizer

                            Cordova plugin which provides a speech recognition service

                            • v0.0.2
                            • 20.22
                            • Published

                            linear16

                            Converts an audio file to LINEAR16 Google-speech compatible file.

                            • v1.2.1
                            • 20.21
                            • Published

                            echogarden-migaku

                            An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

                            • v2.5.2
                            • 20.09
                            • Published

                            @swankylegg/voice-io

                            A browser-based speech recognition and synthesis assistant

                            • v1.0.11
                            • 19.93
                            • Published

                            @qubby/use-whisper-beta

                            React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                            • v0.0.27
                            • 19.77
                            • Published

                            klatt-syn

                            Klatt formant synthesizer

                            • v1.0.7
                            • 19.68
                            • Published

                            simpletts

                            A basic TTS manager

                            • v2.6.0
                            • 19.64
                            • Published

                            @yayaadev/sermo-models

                            TypeScript models for Sermo API (HTTP and WebSocket) generated from OpenAPI specifications

                            • v1.0.1
                            • 19.25
                            • Published

                            speakie

                            speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.

                            • v1.0.0
                            • 19.14
                            • Published

                            mmi-js

                            Multi-Modal Input Library for voice, gesture, and traditional inputs.

                              • v1.0.0
                              • 19.02
                              • Published

                              react-transcript-editor

                              A React component to make transcribing audio and video easier and faster.

                              • v1.3.1-alpha.4
                              • 18.90
                              • Published

                              web-speech-profanity

                              Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.

                              • v7.1.2-0
                              • 18.34
                              • Published

                              angular2-speech-engine

                              A set of Angular2 services and components to add speech recognition and synthesis to Angular2 applications

                              • v0.0.2
                              • 18.34
                              • Published

                              realtime-ten-vad

                              Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.

                              • v1.0.0
                              • 18.29
                              • Published

                              react-native-deepgram

                              React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                              • v0.1.21
                              • 18.14
                              • Published

                              say2

                              Interactive text-to-speech CLI with multiple voices using ElevenLabs API

                              • v1.1.0
                              • 18.13
                              • Published