JSPM

Found 736 results for speech recognition

electron-speech

speech recognition cli and api for node using electron

  • v1.0.7
  • 86.98
  • Published

spremic

A simple JavaScript speech recognition library.

  • v0.0.48
  • 86.69
  • Published

@mastashake08/speech-kit

Package for simplifying the Speech Recognition and Speech Utterence process.

  • v2.0.8
  • 85.41
  • Published

sonus

Open source cross platform decentralized always-on speech recognition framework

  • v1.0.3
  • 82.85
  • Published

react-voice-search

React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

  • v1.1.1
  • 82.78
  • Published

vosk

Node binding for continuous offline voice recoginition with Vosk library.

  • v0.3.39
  • 80.08
  • Published

vosk-koffi

Vosk node API based on Koffi.

  • v1.1.1
  • 79.66
  • Published

@aurally/fancy-search

A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching

  • v1.0.9
  • 78.16
  • Published

speech-js

lib for recognition and synthesis of speech

  • v0.1.1
  • 73.04
  • Published

artyom.js

Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

  • v1.0.6
  • 72.38
  • Published

@ng-web-apis/speech

A library for using Web Speech API with Angular

  • v4.12.0
  • 68.64
  • Published

whisper-onnx-speech-to-text

Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.

  • v1.0.1
  • 68.30
  • Published

electron-vosk-speech

Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).

  • v0.2.1
  • 67.39
  • Published

cordova-plugin-speech

This is cordova plugin for Speech Recognition and Text to Speech.

  • v0.0.4
  • 64.68
  • Published

speech-recognition-react

A react library that encapsulates the native browser speech recognition api

  • v2.0.0
  • 64.67
  • Published

@deepgram/sdk

Isomorphic Javascript client for Deepgram

  • v4.11.2
  • 62.12
  • Published

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

  • v1.2.0
  • 60.95
  • Published

corti

Replace window.SpeechRecognition with a mock object and automate your tests

  • v1.0.0
  • 60.36
  • Published

speech-to-element

Add real-time speech to text functionality into your website with no effort

  • v1.0.4
  • 59.80
  • Published

parakeet.js

NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

  • v0.0.3
  • 59.25
  • Published

houndify-react-native

Allows react-native apps to connect to Houndify for speech recognition.

    • v0.2.0
    • 56.62
    • Published

    revai-node-sdk

    Rev AI makes speech applications easy to build!

    • v3.8.5
    • 56.18
    • Published

    expo-speech

    Provides text-to-speech functionality.

    • v13.1.7
    • 56.15
    • Published

    speaktome-api

    JavaScript modules for Mozilla's cloud speech recognition API

    • v0.2.1
    • 54.53
    • Published

    react-native-tts

    React Native Text-To-Speech module for Android and iOS

    • v4.1.1
    • 53.62
    • Published

    speech-into-text

    SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

    • v4.0.2
    • 53.22
    • Published

    react-speech

    React component for the web speech synthesis api

    • v1.0.2
    • 52.46
    • Published

    speech-recog-stream

    A module to stream audio to a speech recognition server and get back the STT result"

    • v1.0.8
    • 47.90
    • Published

    sherpa-onnx-node

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 47.89
    • Published

    yanyu

    A Chinese speech synthesis and recognition library toolkit

    • v0.1.4
    • 47.84
    • Published

    fft-js

    Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.

    • v0.0.12
    • 47.49
    • Published

    en-pos

    A better English POS tagger written in JavaScript

    • v1.0.16
    • 47.39
    • Published

    elevenlabs-node

    This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

    • v2.0.3
    • 45.44
    • Published

    react-text-to-speech

    An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

    • v2.1.2
    • 44.68
    • Published

    speechflow

    Speech Processing Flow Graph

    • v1.5.1
    • 44.46
    • Published

    mic-to-speech

    Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.

    • v1.0.1
    • 44.00
    • Published

    sherpa-onnx-linux-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 43.66
    • Published

    retext-pos

    retext plugin to add part-of-speech (POS) tags

    • v5.0.0
    • 43.59
    • Published

    ssml-check-core

    Core library to check for valid SSML

    • v0.3.9
    • 43.45
    • Published

    sherpa-onnx

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 43.38
    • Published

    google-cloud-speech-webaudio

    Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.

    • v0.1.4
    • 42.57
    • Published

    @wdragon/react-native-voice

    React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

    • v3.3.11
    • 41.14
    • Published

    sherpa-onnx-darwin-arm64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 40.82
    • Published

    @chengsokdara/use-whisper

    React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

    • v0.2.0
    • 40.68
    • Published

    web-wake-word

    A web package for keyword detection

      • v2.0.10
      • 40.11
      • Published

      babbler

      Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.

      • v1.0.0
      • 39.96
      • Published

      @swankylegg/voice-io

      A browser-based speech recognition and synthesis assistant

      • v1.0.11
      • 39.86
      • Published

      koi-koi

      Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

      • v0.1.0
      • 39.44
      • Published

      ssml-check

      Check for valid SSML

      • v0.4.6
      • 38.90
      • Published

      sam-js

      SAM - The Software Automatic Mouth

      • v0.3.1
      • 38.82
      • Published

      sherpa-onnx-win-x64

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 38.81
      • Published

      sherpa-onnx-win-ia32

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 38.76
      • Published

      aws-transcribe

      A client for Amazon Transcribe using the websocket interface

      • v1.1.1
      • 38.08
      • Published

      @aurally/speech-control

      A class to handle microphone permissions, start and observe speech input

      • v1.1.2
      • 37.91
      • Published

      espeak-ng

      eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

      • v1.0.2
      • 37.26
      • Published

      soundswallower

      An even smaller speech recognizer

      • v0.6.3
      • 37.03
      • Published

      vocalize.ts

      A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.

      • v1.2.2
      • 36.84
      • Published

      @picovoice/cheetah-web

      Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

        • v2.3.0
        • 35.08
        • Published

        @picovoice/cobra-web

        Cobra VAD engine for web browsers (via WebAssembly)

          • v2.0.3
          • 34.60
          • Published

          audio-to-text-node

          Backend audio file to text transcription using Web Speech API with Puppeteer

          • v0.1.2
          • 34.27
          • Published

          piper-announce

          AI-powered announcement generator using Piper TTS and OpenAI GPT models

          • v1.2.10
          • 33.42
          • Published

          ng-speech-recognition

          AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

          • v2.0.1
          • 33.15
          • Published

          sherpa-onnx-linux-arm64

          Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

          • v1.12.11
          • 33.12
          • Published

          @picovoice/rhino-web

          Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

            • v3.0.3
            • 32.76
            • Published

            yandex-speech

            node.js module for Yandex speech systems (ASR & TTS)

            • v0.0.14
            • 32.18
            • Published

            speech-ui-kitt

            A flexible GUI for interacting with Speech Recognition

            • v0.1.0
            • 32.15
            • Published

            text2wav

            Self-contained multilingual TTS speech synthesizer for Node.js in pure js

            • v0.0.14
            • 32.10
            • Published

            mmir-lib

            MMIR (Mobile Multimodal Interaction and Relay) library

            • v7.0.1
            • 32.08
            • Published

            cybertyper

            ReactJS component for automatically typing text synchronized with speech synthesis & recognition

            • v0.0.3
            • 31.68
            • Published

            avr-vad

            A Node.js library for Voice Activity Detection using Silero VAD

            • v1.0.9
            • 31.39
            • Published

            node-witai-speech

            This is an API wrapper for witai speech for nodejs

            • v1.0.2
            • 31.33
            • Published

            azure-speech-utilities

            Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil

            • v1.0.0
            • 31.02
            • Published

            @larriereguichet/vosk

            Node binding for continuous offline voice recoginition with Vosk library.

            • v0.4.4
            • 31.02
            • Published

            mumble-js

            A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

            • v1.0.1
            • 30.69
            • Published

            edge-tts-universal

            Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

            • v1.3.0
            • 30.62
            • Published

            brill

            Part-of-speech tags from the Brill-tagger

            • v3.1.0
            • 30.55
            • Published

            sherpa-onnx-darwin-x64

            Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

            • v1.12.11
            • 30.38
            • Published

            vosk-lib

            Vosk library for node, with type defenitions and multi-arch support.

            • v0.1.3
            • 30.21
            • Published

            @pr0gramm/fluester

            Node.js bindings for OpenAI's Whisper. Optimized for CPU.

            • v0.9.15
            • 29.79
            • Published

            react-native-voice-hold

            React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

            • v1.0.7
            • 29.59
            • Published

            whisper-speech-to-text

            A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

            • v1.0.3
            • 29.58
            • Published

            mespeak

            Text to speech synthesizer

            • v2.0.2
            • 29.13
            • Published

            pmacom-react-transcript-editor

            A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

            • v2.4.0
            • 28.60
            • Published

            @kajidog/aivis-cloud-cli

            Aivis Cloud CLI - Text-to-speech synthesis and model management

              • v0.5.1
              • 28.48
              • Published

              postcss-speech-bubble

              PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

              • v1.0.12
              • 28.41
              • Published

              node-red-contrib-tts-ultimate

              Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

              • v3.0.1
              • 28.38
              • Published

              mbz-voice-sdk

              🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.

              • v1.0.21
              • 28.35
              • Published

              vue-webapi-speech-recognition

              Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api

              • v1.0.1
              • 28.01
              • Published

              text-to-speech-js

              A small JavaScript library that provides a text to speech conversion using tts-api.com service.

              • v1.1.11
              • 28.01
              • Published

              echogarden-migaku

              An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

              • v2.5.2
              • 27.65
              • Published

              @picovoice/koala-web

              Koala Noise Suppression engine for web browsers (via WebAssembly)

                • v2.0.0
                • 27.62
                • Published

                n8n-nodes-groq-speech

                N8N Community Node for Groq Text-to-Speech API integration

                • v1.1.2
                • 27.35
                • Published

                ispikit

                ispikit

                • v1.0.3
                • 27.30
                • Published

                @arach/speakeasy

                SpeakEasy - Unified text-to-speech service with provider abstraction

                  • v0.2.4
                  • 27.12
                  • Published

                  @picovoice/leopard-web

                  Leopard Speech-to-Text engine for web browsers (via WebAssembly)

                    • v2.0.1
                    • 26.55
                    • Published

                    node-mfcc

                    Node.js implementation of the MFCC audio speech analysis algorithm.

                    • v0.0.2
                    • 26.55
                    • Published

                    iobroker.sonus

                    With this adapter you can control ioBroker with voice in many different languages

                    • v0.1.1
                    • 26.07
                    • Published

                    @mirawision/reactive-hooks

                    A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

                    • v1.1.0
                    • 25.94
                    • Published

                    pocketsphinx

                    Node binding for continuous voice recoginition through pocketsphinx.

                    • v5.0.7
                    • 25.89
                    • Published

                    xfyun-sdk

                    科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能

                    • v1.0.2
                    • 25.39
                    • Published

                    @picovoice/orca-web

                    Orca Text-to-Speech engine for web browsers (via WebAssembly)

                      • v1.2.1
                      • 25.19
                      • Published

                      @moonshine-ai/moonshine-js

                      On-device speech-to-text and voice control for web applications with Moonshine.

                      • v0.1.29
                      • 25.12
                      • Published

                      @albertsyh/use-whisper

                      React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                      • v0.2.17
                      • 25.05
                      • Published

                      speechjs

                      Chrome speech recognition API wrapper

                      • v0.0.1
                      • 24.85
                      • Published

                      primvoices-react

                      React client for the PrimVoices Agents API

                      • v0.2.2
                      • 24.78
                      • Published

                      text-to-speech

                      Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

                      • v1.0.11
                      • 24.69
                      • Published

                      mac-say

                      The macOS built-in `say` CLI for JavaScript

                      • v0.3.3
                      • 24.55
                      • Published

                      node-droid-language

                      A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

                      • v1.0.2
                      • 24.55
                      • Published

                      discord-tts

                      Node.js module to make your discord bot talk

                      • v1.2.2
                      • 24.31
                      • Published

                      node-mic-record

                      Record microphone sond using nodejs

                      • v0.0.1
                      • 23.84
                      • Published

                      @untemps/react-vocal

                      React component and hook to initiate a SpeechRecognition session

                      • v1.7.28
                      • 23.72
                      • Published

                      @logikron/talk-widget-embed

                      Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

                      • v1.0.2
                      • 23.33
                      • Published

                      buzzphrase

                      Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

                      • v3.2.1
                      • 23.27
                      • Published

                      tiktok-tts

                      Use TikTok TTS from node.js

                      • v1.1.17
                      • 23.15
                      • Published

                      @bluefly/apple-fm

                      Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

                      • v0.2.7
                      • 23.15
                      • Published

                      @lipsurf/plugins

                      Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

                      • v4.10.0
                      • 23.11
                      • Published

                      ttsmaker

                      Text-to-Speech API wrapper for ttsmp3.com

                      • v1.0.3
                      • 22.68
                      • Published

                      aixblock-voice-ai-deepgram

                      A React component for real-time transcription and voice agent interactions using Deepgram APIs

                        • v0.0.7
                        • 22.22
                        • Published

                        n8n-nodes-groq

                        N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

                        • v0.2.0
                        • 21.90
                        • Published

                        pronounceability

                        Calculate pronounceability for a given word.

                        • v0.0.3
                        • 21.75
                        • Published

                        browser-speech

                        🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech

                          • v1.1.1
                          • 21.59
                          • Published

                          ugai

                          A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

                          • v1.1.0
                          • 21.51
                          • Published

                          espeak

                          text-to-speech using espeak cli program

                          • v0.0.3
                          • 21.51
                          • Published

                          koi-app

                          Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

                          • v0.1.2
                          • 21.37
                          • Published

                          @squirrelsoft/dev-say

                          MCP server for macOS text-to-speech using the say command

                          • v1.0.1
                          • 21.13
                          • Published

                          @qubby/use-whisper-beta

                          React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                          • v0.0.27
                          • 20.90
                          • Published

                          spoken

                          JavaScript Web API for Text-to-Speech and Speech-to-Text.

                          • v1.1.17
                          • 20.79
                          • Published

                          klatt-syn

                          Klatt formant synthesizer

                          • v1.0.7
                          • 20.38
                          • Published

                          speechrecognizer

                          Cordova plugin which provides a speech recognition service

                          • v0.0.2
                          • 20.14
                          • Published

                          @qubby/use-whisper

                          React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                          • v0.0.42
                          • 20.13
                          • Published

                          linear16

                          Converts an audio file to LINEAR16 Google-speech compatible file.

                          • v1.2.1
                          • 20.04
                          • Published

                          @cloudraker/use-whisper

                          React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                          • v0.3.0
                          • 19.65
                          • Published

                          simpletts

                          A basic TTS manager

                          • v2.6.0
                          • 19.13
                          • Published

                          speakie

                          speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.

                          • v1.0.0
                          • 19.05
                          • Published

                          tts-cli

                          Command-line tool to convert text to speech

                          • v5.4.1
                          • 18.95
                          • Published

                          react-transcript-editor

                          A React component to make transcribing audio and video easier and faster.

                          • v1.3.1-alpha.4
                          • 18.75
                          • Published

                          speechify

                          Easily add speech to text functionality into your website

                          • v0.1.0
                          • 18.66
                          • Published

                          say2

                          Interactive text-to-speech CLI with multiple voices using ElevenLabs API

                          • v1.1.0
                          • 18.56
                          • Published

                          web-speech-profanity

                          Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.

                          • v7.1.2-0
                          • 18.28
                          • Published

                          angular2-speech-engine

                          A set of Angular2 services and components to add speech recognition and synthesis to Angular2 applications

                          • v0.0.2
                          • 18.27
                          • Published

                          realtime-ten-vad

                          Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.

                          • v1.0.0
                          • 18.14
                          • Published

                          react-native-deepgram

                          React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                          • v0.1.21
                          • 18.08
                          • Published

                          @revrag-ai/embed-react-native

                          A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation

                          • v1.0.15
                          • 18.02
                          • Published

                          real-time-speech-analyzer

                          Real-time speech analysis with local LLM using multiple concurrent analysis instructions

                          • v1.0.0
                          • 17.91
                          • Published

                          austack

                          TypeScript/JavaScript client SDK for Austack conversational AI

                          • v0.1.0
                          • 17.87
                          • Published

                          mfcc

                          Node.js implementation of the MFCC audio speech analysis algorithm.

                          • v0.0.3
                          • 17.82
                          • Published

                          alexa-ssml

                          JSX for Alexa Skills Kit SSML

                          • v0.5.0
                          • 17.65
                          • Published

                          transpeech

                          TranSpeech is a small voice and text library. It allows you to recognize and synthesize speech using a browser, and translate text.

                          • v1.1.0
                          • 17.58
                          • Published

                          alexa-speech-utils

                          Helper functions for building speech responses

                            • v0.2.0
                            • 17.57
                            • Published

                            extra-amazontts

                            Generate speech audio from super long text, via Amazon Polly and ffmpeg.

                            • v1.1.18
                            • 17.41
                            • Published

                            texttospeech

                            Text to Speech (Pure Client Side)

                            • v0.2.0
                            • 17.30
                            • Published

                            @freddydrodev/artyom

                            Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

                            • v0.0.1
                            • 17.13
                            • Published

                            falexa

                            Create your own verbal commands that map to custom Javascript functions

                            • v2.0.3
                            • 17.10
                            • Published