JSPM

Found 737 results for speech recognition

annyang

A javascript library for adding voice commands to your site, using speech recognition

  • v2.6.1
  • 218.41
  • Published

sonix-speech-recognition

A library that produces audio transcriptions and translations using the Sonix.AI service.

  • v2.1.1
  • 177.24
  • Published

echogarden

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

  • v2.10.1
  • 176.93
  • Published

@picovoice/eagle-web

Eagle Speaker Recognition engine for web browsers (via WebAssembly)

    • v1.0.0
    • 136.65
    • Published

    speechkitt

    A flexible GUI for interacting with Speech Recognition

    • v1.0.0
    • 134.28
    • Published

    @sign-speak/react-sdk

    Unlock Sign Language Recognition, Avatar, and Speech Recognition.

    • v0.7.3
    • 126.77
    • Published

    sherpa-ncnn

    Real-time speech recognition with Next-gen Kaldi

    • v2.1.12
    • 114.87
    • Published

    speech-js

    lib for recognition and synthesis of speech

    • v0.1.1
    • 97.97
    • Published

    voice-speech-recognition

    Simple wrapper extended functionalities of Speech Recognition embedded in browsers.

    • v1.1.2
    • 91.37
    • Published

    react-native-voicebox-speech-rec

    A powerful speech recognition library for React Native applications, enabling real-time speech-to-text transcription.

    • v1.0.4
    • 91.17
    • Published

    electron-speech

    speech recognition cli and api for node using electron

    • v1.0.7
    • 86.67
    • Published

    spremic

    A simple JavaScript speech recognition library.

    • v0.0.48
    • 86.39
    • Published

    @mastashake08/speech-kit

    Package for simplifying the Speech Recognition and Speech Utterence process.

    • v2.0.8
    • 85.04
    • Published

    react-voice-search

    React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

    • v1.1.1
    • 83.01
    • Published

    sonus

    Open source cross platform decentralized always-on speech recognition framework

    • v1.0.3
    • 82.55
    • Published

    vosk

    Node binding for continuous offline voice recoginition with Vosk library.

    • v0.3.39
    • 82.20
    • Published

    vosk-koffi

    Vosk node API based on Koffi.

    • v1.1.1
    • 79.63
    • Published

    @aurally/fancy-search

    A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching

    • v1.0.9
    • 77.85
    • Published

    react-native-speech-engine

    React Native Speech Recognition and Text-to-Speech with new architecture support

    • v0.0.1
    • 77.48
    • Published

    artyom.js

    Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

    • v1.0.6
    • 72.31
    • Published

    whisper-onnx-speech-to-text

    Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.

    • v1.0.1
    • 69.56
    • Published

    @ng-web-apis/speech

    A library for using Web Speech API with Angular

    • v4.12.0
    • 68.95
    • Published

    electron-vosk-speech

    Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).

    • v0.2.1
    • 67.13
    • Published

    cordova-plugin-speech

    This is cordova plugin for Speech Recognition and Text to Speech.

    • v0.0.4
    • 66.62
    • Published

    speech-recognition-react

    A react library that encapsulates the native browser speech recognition api

    • v2.0.0
    • 64.28
    • Published

    @deepgram/sdk

    Isomorphic Javascript client for Deepgram

    • v4.11.2
    • 62.55
    • Published

    @deepgram/captions

    Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

    • v1.2.0
    • 61.66
    • Published

    speech-to-element

    Add real-time speech to text functionality into your website with no effort

    • v1.0.4
    • 60.52
    • Published

    corti

    Replace window.SpeechRecognition with a mock object and automate your tests

    • v1.0.0
    • 60.40
    • Published

    parakeet.js

    NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

    • v0.0.3
    • 58.78
    • Published

    expo-speech

    Provides text-to-speech functionality.

    • v13.1.7
    • 56.44
    • Published

    revai-node-sdk

    Rev AI makes speech applications easy to build!

    • v3.8.5
    • 56.32
    • Published

    houndify-react-native

    Allows react-native apps to connect to Houndify for speech recognition.

      • v0.2.0
      • 56.17
      • Published

      react-native-tts

      React Native Text-To-Speech module for Android and iOS

      • v4.1.1
      • 54.43
      • Published

      speaktome-api

      JavaScript modules for Mozilla's cloud speech recognition API

      • v0.2.1
      • 54.34
      • Published

      yanyu

      A Chinese speech synthesis and recognition library toolkit

      • v0.1.4
      • 54.09
      • Published

      speech-into-text

      SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

      • v4.0.2
      • 53.52
      • Published

      react-speech

      React component for the web speech synthesis api

      • v1.0.2
      • 53.07
      • Published

      sherpa-onnx-node

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 48.26
      • Published

      koi-koi

      Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

      • v0.1.0
      • 48.19
      • Published

      fft-js

      Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.

      • v0.0.12
      • 48.09
      • Published

      en-pos

      A better English POS tagger written in JavaScript

      • v1.0.16
      • 47.82
      • Published

      speech-recog-stream

      A module to stream audio to a speech recognition server and get back the STT result"

      • v1.0.8
      • 47.71
      • Published

      elevenlabs-node

      This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

      • v2.0.3
      • 45.70
      • Published

      speechflow

      Speech Processing Flow Graph

      • v1.5.1
      • 44.58
      • Published

      react-text-to-speech

      An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

      • v2.1.2
      • 44.40
      • Published

      sherpa-onnx-linux-x64

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 43.88
      • Published

      mic-to-speech

      Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.

      • v1.0.1
      • 43.82
      • Published

      retext-pos

      retext plugin to add part-of-speech (POS) tags

      • v5.0.0
      • 43.72
      • Published

      ssml-check-core

      Core library to check for valid SSML

      • v0.3.9
      • 43.58
      • Published

      sherpa-onnx

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 43.44
      • Published

      google-cloud-speech-webaudio

      Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.

      • v0.1.4
      • 42.30
      • Published

      sherpa-onnx-darwin-arm64

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 41.13
      • Published

      speech-ui-kitt

      A flexible GUI for interacting with Speech Recognition

      • v0.1.0
      • 40.80
      • Published

      @chengsokdara/use-whisper

      React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

      • v0.2.0
      • 40.78
      • Published

      @wdragon/react-native-voice

      React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

      • v3.3.11
      • 40.20
      • Published

      @swankylegg/voice-io

      A browser-based speech recognition and synthesis assistant

      • v1.0.11
      • 39.74
      • Published

      babbler

      Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.

      • v1.0.0
      • 39.69
      • Published

      web-wake-word

      A web package for keyword detection

        • v2.0.10
        • 39.58
        • Published

        ssml-check

        Check for valid SSML

        • v0.4.6
        • 38.96
        • Published

        sam-js

        SAM - The Software Automatic Mouth

        • v0.3.1
        • 38.91
        • Published

        sherpa-onnx-win-x64

        Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

        • v1.12.11
        • 38.91
        • Published

        sherpa-onnx-win-ia32

        Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

        • v1.12.11
        • 38.56
        • Published

        aws-transcribe

        A client for Amazon Transcribe using the websocket interface

        • v1.1.1
        • 38.43
        • Published

        @aurally/speech-control

        A class to handle microphone permissions, start and observe speech input

        • v1.1.2
        • 37.89
        • Published

        espeak-ng

        eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

        • v1.0.2
        • 37.57
        • Published

        soundswallower

        An even smaller speech recognizer

        • v0.6.3
        • 37.30
        • Published

        vocalize.ts

        A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.

        • v1.2.2
        • 36.65
        • Published

        @picovoice/cheetah-web

        Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

          • v2.3.0
          • 35.15
          • Published

          mumble-js

          A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

          • v1.0.1
          • 34.87
          • Published

          @picovoice/cobra-web

          Cobra VAD engine for web browsers (via WebAssembly)

            • v2.0.3
            • 34.69
            • Published

            audio-to-text-node

            Backend audio file to text transcription using Web Speech API with Puppeteer

            • v0.1.2
            • 34.48
            • Published

            text2wav

            Self-contained multilingual TTS speech synthesizer for Node.js in pure js

            • v0.0.14
            • 34.14
            • Published

            piper-announce

            AI-powered announcement generator using Piper TTS and OpenAI GPT models

            • v1.2.10
            • 33.93
            • Published

            sherpa-onnx-linux-arm64

            Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

            • v1.12.11
            • 33.33
            • Published

            ng-speech-recognition

            AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

            • v2.0.1
            • 33.01
            • Published

            @picovoice/rhino-web

            Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

              • v3.0.3
              • 32.83
              • Published

              mmir-lib

              MMIR (Mobile Multimodal Interaction and Relay) library

              • v7.0.1
              • 32.17
              • Published

              yandex-speech

              node.js module for Yandex speech systems (ASR & TTS)

              • v0.0.14
              • 32.04
              • Published

              cybertyper

              ReactJS component for automatically typing text synchronized with speech synthesis & recognition

              • v0.0.3
              • 31.51
              • Published

              node-witai-speech

              This is an API wrapper for witai speech for nodejs

              • v1.0.2
              • 31.40
              • Published

              azure-speech-utilities

              Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil

              • v1.0.0
              • 31.09
              • Published

              brill

              Part-of-speech tags from the Brill-tagger

              • v3.1.0
              • 30.80
              • Published

              edge-tts-universal

              Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

              • v1.3.0
              • 30.79
              • Published

              avr-vad

              A Node.js library for Voice Activity Detection using Silero VAD

              • v1.0.9
              • 30.24
              • Published

              vosk-lib

              Vosk library for node, with type defenitions and multi-arch support.

              • v0.1.3
              • 30.22
              • Published

              sherpa-onnx-darwin-x64

              Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

              • v1.12.11
              • 30.22
              • Published

              @pr0gramm/fluester

              Node.js bindings for OpenAI's Whisper. Optimized for CPU.

              • v0.9.15
              • 30.06
              • Published

              whisper-speech-to-text

              A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

              • v1.0.3
              • 29.67
              • Published

              mespeak

              Text to speech synthesizer

              • v2.0.2
              • 29.18
              • Published

              pmacom-react-transcript-editor

              A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

              • v2.4.0
              • 28.65
              • Published

              node-red-contrib-tts-ultimate

              Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

              • v3.0.1
              • 28.60
              • Published

              postcss-speech-bubble

              PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

              • v1.0.12
              • 28.48
              • Published

              mbz-voice-sdk

              🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.

              • v1.0.21
              • 28.23
              • Published

              text-to-speech-js

              A small JavaScript library that provides a text to speech conversion using tts-api.com service.

              • v1.1.11
              • 28.17
              • Published

              vue-webapi-speech-recognition

              Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api

              • v1.0.1
              • 27.87
              • Published

              echogarden-migaku

              An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

              • v2.5.2
              • 27.70
              • Published

              @picovoice/koala-web

              Koala Noise Suppression engine for web browsers (via WebAssembly)

                • v2.0.0
                • 27.67
                • Published

                @arach/speakeasy

                SpeakEasy - Unified text-to-speech service with provider abstraction

                  • v0.2.4
                  • 27.32
                  • Published

                  ispikit

                  ispikit

                  • v1.0.3
                  • 27.20
                  • Published

                  @larriereguichet/vosk

                  Node binding for continuous offline voice recoginition with Vosk library.

                  • v0.4.4
                  • 26.96
                  • Published

                  @picovoice/leopard-web

                  Leopard Speech-to-Text engine for web browsers (via WebAssembly)

                    • v2.0.1
                    • 26.61
                    • Published

                    node-mfcc

                    Node.js implementation of the MFCC audio speech analysis algorithm.

                    • v0.0.2
                    • 26.59
                    • Published

                    iobroker.sonus

                    With this adapter you can control ioBroker with voice in many different languages

                    • v0.1.1
                    • 26.14
                    • Published

                    @mirawision/reactive-hooks

                    A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

                    • v1.1.0
                    • 26.12
                    • Published

                    @arifdroid/enhanced-chat

                    Enhanced chat widget combining modern n8n styling with advanced voice features and alert mode

                    • v1.3.1
                    • 26.07
                    • Published

                    pocketsphinx

                    Node binding for continuous voice recoginition through pocketsphinx.

                    • v5.0.7
                    • 25.84
                    • Published

                    xfyun-sdk

                    科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能

                    • v1.0.2
                    • 25.56
                    • Published

                    @picovoice/orca-web

                    Orca Text-to-Speech engine for web browsers (via WebAssembly)

                      • v1.2.1
                      • 25.40
                      • Published

                      @moonshine-ai/moonshine-js

                      On-device speech-to-text and voice control for web applications with Moonshine.

                      • v0.1.29
                      • 25.31
                      • Published

                      mac-say

                      The macOS built-in `say` CLI for JavaScript

                      • v0.3.3
                      • 25.22
                      • Published

                      @albertsyh/use-whisper

                      React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                      • v0.2.17
                      • 25.12
                      • Published

                      speechjs

                      Chrome speech recognition API wrapper

                      • v0.0.1
                      • 25.00
                      • Published

                      primvoices-react

                      React client for the PrimVoices Agents API

                      • v0.2.2
                      • 24.97
                      • Published

                      react-native-voice-hold

                      React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

                      • v1.0.7
                      • 24.85
                      • Published

                      node-droid-language

                      A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

                      • v1.0.2
                      • 24.63
                      • Published

                      text-to-speech

                      Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

                      • v1.0.11
                      • 24.54
                      • Published

                      discord-tts

                      Node.js module to make your discord bot talk

                      • v1.2.2
                      • 24.53
                      • Published

                      node-mic-record

                      Record microphone sond using nodejs

                      • v0.0.1
                      • 23.92
                      • Published

                      @untemps/react-vocal

                      React component and hook to initiate a SpeechRecognition session

                      • v1.7.28
                      • 23.83
                      • Published

                      tiktok-tts

                      Use TikTok TTS from node.js

                      • v1.1.17
                      • 23.76
                      • Published

                      @logikron/talk-widget-embed

                      Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

                      • v1.0.2
                      • 23.65
                      • Published

                      buzzphrase

                      Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

                      • v3.2.1
                      • 23.35
                      • Published

                      @bluefly/apple-fm

                      Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

                      • v0.2.7
                      • 23.33
                      • Published

                      @lipsurf/plugins

                      Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

                      • v4.10.0
                      • 23.14
                      • Published

                      ttsmaker

                      Text-to-Speech API wrapper for ttsmp3.com

                      • v1.0.3
                      • 22.73
                      • Published

                      aixblock-voice-ai-deepgram

                      A React component for real-time transcription and voice agent interactions using Deepgram APIs

                        • v0.0.7
                        • 22.25
                        • Published

                        pronounceability

                        Calculate pronounceability for a given word.

                        • v0.0.3
                        • 21.79
                        • Published

                        n8n-nodes-groq

                        N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

                        • v0.2.0
                        • 21.77
                        • Published

                        whatsapp-claude-gpt

                        WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an

                        • v1.4.0
                        • 21.63
                        • Published

                        ugai

                        A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

                        • v1.1.0
                        • 21.56
                        • Published

                        espeak

                        text-to-speech using espeak cli program

                        • v0.0.3
                        • 21.56
                        • Published

                        browser-speech

                        🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech

                          • v1.1.1
                          • 21.45
                          • Published

                          @squirrelsoft/dev-say

                          MCP server for macOS text-to-speech using the say command

                          • v1.0.1
                          • 21.32
                          • Published

                          koi-app

                          Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

                          • v0.1.2
                          • 21.29
                          • Published

                          @qubby/use-whisper-beta

                          React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                          • v0.0.27
                          • 21.02
                          • Published

                          spoken

                          JavaScript Web API for Text-to-Speech and Speech-to-Text.

                          • v1.1.17
                          • 20.86
                          • Published

                          linear16

                          Converts an audio file to LINEAR16 Google-speech compatible file.

                          • v1.2.1
                          • 20.62
                          • Published

                          klatt-syn

                          Klatt formant synthesizer

                          • v1.0.7
                          • 20.57
                          • Published

                          react-assistant

                          Web Speech Recognition API turned into a React component

                          • v0.0.1
                          • 20.50
                          • Published

                          speechrecognizer

                          Cordova plugin which provides a speech recognition service

                          • v0.0.2
                          • 20.25
                          • Published

                          n8n-nodes-groq-speech

                          N8N Community Node for Groq Text-to-Speech API integration

                          • v1.1.2
                          • 20.19
                          • Published

                          @qubby/use-whisper

                          React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                          • v0.0.42
                          • 20.16
                          • Published

                          editorjs-speech

                          Speech Block Tool for Editor.js

                          • v1.6.1
                          • 20.14
                          • Published

                          @cloudraker/use-whisper

                          React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                          • v0.3.0
                          • 19.70
                          • Published

                          extra-googletts

                          Generate speech audio from super long text through machine, via Google TTS, ffmpeg.

                          • v1.6.31
                          • 19.33
                          • Published

                          simpletts

                          A basic TTS manager

                          • v2.6.0
                          • 19.18
                          • Published

                          speakie

                          speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.

                          • v1.0.0
                          • 19.18
                          • Published

                          texttospeech

                          Text to Speech (Pure Client Side)

                          • v0.2.0
                          • 19.00
                          • Published