JSPM

Found 736 results for speech recognition

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.11
  • 43.66
  • Published

retext-pos

retext plugin to add part-of-speech (POS) tags

  • v5.0.0
  • 43.59
  • Published

ssml-check-core

Core library to check for valid SSML

  • v0.3.9
  • 43.45
  • Published

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.11
  • 43.38
  • Published

google-cloud-speech-webaudio

Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.

  • v0.1.4
  • 42.57
  • Published

@wdragon/react-native-voice

React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

  • v3.3.11
  • 41.14
  • Published

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.11
  • 40.82
  • Published

@chengsokdara/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

  • v0.2.0
  • 40.68
  • Published

web-wake-word

A web package for keyword detection

    • v2.0.10
    • 40.11
    • Published

    babbler

    Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.

    • v1.0.0
    • 39.96
    • Published

    @swankylegg/voice-io

    A browser-based speech recognition and synthesis assistant

    • v1.0.11
    • 39.86
    • Published

    koi-koi

    Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

    • v0.1.0
    • 39.44
    • Published

    ssml-check

    Check for valid SSML

    • v0.4.6
    • 38.90
    • Published

    sam-js

    SAM - The Software Automatic Mouth

    • v0.3.1
    • 38.82
    • Published

    sherpa-onnx-win-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 38.81
    • Published

    sherpa-onnx-win-ia32

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 38.76
    • Published

    aws-transcribe

    A client for Amazon Transcribe using the websocket interface

    • v1.1.1
    • 38.08
    • Published

    @aurally/speech-control

    A class to handle microphone permissions, start and observe speech input

    • v1.1.2
    • 37.91
    • Published

    espeak-ng

    eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

    • v1.0.2
    • 37.26
    • Published

    soundswallower

    An even smaller speech recognizer

    • v0.6.3
    • 37.03
    • Published

    vocalize.ts

    A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.

    • v1.2.2
    • 36.84
    • Published

    @picovoice/cheetah-web

    Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

      • v2.3.0
      • 35.08
      • Published

      @picovoice/cobra-web

      Cobra VAD engine for web browsers (via WebAssembly)

        • v2.0.3
        • 34.60
        • Published

        audio-to-text-node

        Backend audio file to text transcription using Web Speech API with Puppeteer

        • v0.1.2
        • 34.27
        • Published

        piper-announce

        AI-powered announcement generator using Piper TTS and OpenAI GPT models

        • v1.2.10
        • 33.42
        • Published

        ng-speech-recognition

        AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

        • v2.0.1
        • 33.15
        • Published

        sherpa-onnx-linux-arm64

        Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

        • v1.12.11
        • 33.12
        • Published

        @picovoice/rhino-web

        Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

          • v3.0.3
          • 32.76
          • Published

          yandex-speech

          node.js module for Yandex speech systems (ASR & TTS)

          • v0.0.14
          • 32.18
          • Published

          speech-ui-kitt

          A flexible GUI for interacting with Speech Recognition

          • v0.1.0
          • 32.15
          • Published

          text2wav

          Self-contained multilingual TTS speech synthesizer for Node.js in pure js

          • v0.0.14
          • 32.10
          • Published

          mmir-lib

          MMIR (Mobile Multimodal Interaction and Relay) library

          • v7.0.1
          • 32.08
          • Published

          cybertyper

          ReactJS component for automatically typing text synchronized with speech synthesis & recognition

          • v0.0.3
          • 31.68
          • Published

          avr-vad

          A Node.js library for Voice Activity Detection using Silero VAD

          • v1.0.9
          • 31.39
          • Published

          node-witai-speech

          This is an API wrapper for witai speech for nodejs

          • v1.0.2
          • 31.33
          • Published

          azure-speech-utilities

          Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil

          • v1.0.0
          • 31.02
          • Published

          @larriereguichet/vosk

          Node binding for continuous offline voice recoginition with Vosk library.

          • v0.4.4
          • 31.02
          • Published

          mumble-js

          A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

          • v1.0.1
          • 30.69
          • Published

          edge-tts-universal

          Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

          • v1.3.0
          • 30.62
          • Published

          brill

          Part-of-speech tags from the Brill-tagger

          • v3.1.0
          • 30.55
          • Published

          sherpa-onnx-darwin-x64

          Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

          • v1.12.11
          • 30.38
          • Published

          vosk-lib

          Vosk library for node, with type defenitions and multi-arch support.

          • v0.1.3
          • 30.21
          • Published

          @pr0gramm/fluester

          Node.js bindings for OpenAI's Whisper. Optimized for CPU.

          • v0.9.15
          • 29.79
          • Published

          react-native-voice-hold

          React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

          • v1.0.7
          • 29.59
          • Published

          whisper-speech-to-text

          A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

          • v1.0.3
          • 29.58
          • Published

          mespeak

          Text to speech synthesizer

          • v2.0.2
          • 29.13
          • Published

          pmacom-react-transcript-editor

          A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

          • v2.4.0
          • 28.60
          • Published

          @kajidog/aivis-cloud-cli

          Aivis Cloud CLI - Text-to-speech synthesis and model management

            • v0.5.1
            • 28.48
            • Published

            postcss-speech-bubble

            PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

            • v1.0.12
            • 28.41
            • Published

            node-red-contrib-tts-ultimate

            Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

            • v3.0.1
            • 28.38
            • Published

            mbz-voice-sdk

            🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.

            • v1.0.21
            • 28.35
            • Published

            vue-webapi-speech-recognition

            Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api

            • v1.0.1
            • 28.01
            • Published

            text-to-speech-js

            A small JavaScript library that provides a text to speech conversion using tts-api.com service.

            • v1.1.11
            • 28.01
            • Published

            echogarden-migaku

            An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

            • v2.5.2
            • 27.65
            • Published

            @picovoice/koala-web

            Koala Noise Suppression engine for web browsers (via WebAssembly)

              • v2.0.0
              • 27.62
              • Published

              n8n-nodes-groq-speech

              N8N Community Node for Groq Text-to-Speech API integration

              • v1.1.2
              • 27.35
              • Published

              ispikit

              ispikit

              • v1.0.3
              • 27.30
              • Published

              @arach/speakeasy

              SpeakEasy - Unified text-to-speech service with provider abstraction

                • v0.2.4
                • 27.12
                • Published

                @picovoice/leopard-web

                Leopard Speech-to-Text engine for web browsers (via WebAssembly)

                  • v2.0.1
                  • 26.55
                  • Published

                  node-mfcc

                  Node.js implementation of the MFCC audio speech analysis algorithm.

                  • v0.0.2
                  • 26.55
                  • Published

                  iobroker.sonus

                  With this adapter you can control ioBroker with voice in many different languages

                  • v0.1.1
                  • 26.07
                  • Published

                  @mirawision/reactive-hooks

                  A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

                  • v1.1.0
                  • 25.94
                  • Published

                  pocketsphinx

                  Node binding for continuous voice recoginition through pocketsphinx.

                  • v5.0.7
                  • 25.89
                  • Published

                  xfyun-sdk

                  科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能

                  • v1.0.2
                  • 25.39
                  • Published

                  @picovoice/orca-web

                  Orca Text-to-Speech engine for web browsers (via WebAssembly)

                    • v1.2.1
                    • 25.19
                    • Published

                    @moonshine-ai/moonshine-js

                    On-device speech-to-text and voice control for web applications with Moonshine.

                    • v0.1.29
                    • 25.12
                    • Published

                    @albertsyh/use-whisper

                    React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                    • v0.2.17
                    • 25.05
                    • Published

                    speechjs

                    Chrome speech recognition API wrapper

                    • v0.0.1
                    • 24.85
                    • Published

                    primvoices-react

                    React client for the PrimVoices Agents API

                    • v0.2.2
                    • 24.78
                    • Published

                    text-to-speech

                    Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

                    • v1.0.11
                    • 24.69
                    • Published

                    mac-say

                    The macOS built-in `say` CLI for JavaScript

                    • v0.3.3
                    • 24.55
                    • Published

                    node-droid-language

                    A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

                    • v1.0.2
                    • 24.55
                    • Published

                    discord-tts

                    Node.js module to make your discord bot talk

                    • v1.2.2
                    • 24.31
                    • Published

                    node-mic-record

                    Record microphone sond using nodejs

                    • v0.0.1
                    • 23.84
                    • Published

                    @untemps/react-vocal

                    React component and hook to initiate a SpeechRecognition session

                    • v1.7.28
                    • 23.72
                    • Published

                    @logikron/talk-widget-embed

                    Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

                    • v1.0.2
                    • 23.33
                    • Published

                    buzzphrase

                    Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

                    • v3.2.1
                    • 23.27
                    • Published

                    tiktok-tts

                    Use TikTok TTS from node.js

                    • v1.1.17
                    • 23.15
                    • Published

                    @bluefly/apple-fm

                    Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

                    • v0.2.7
                    • 23.15
                    • Published

                    @lipsurf/plugins

                    Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

                    • v4.10.0
                    • 23.11
                    • Published

                    ttsmaker

                    Text-to-Speech API wrapper for ttsmp3.com

                    • v1.0.3
                    • 22.68
                    • Published

                    aixblock-voice-ai-deepgram

                    A React component for real-time transcription and voice agent interactions using Deepgram APIs

                      • v0.0.7
                      • 22.22
                      • Published

                      n8n-nodes-groq

                      N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

                      • v0.2.0
                      • 21.90
                      • Published

                      pronounceability

                      Calculate pronounceability for a given word.

                      • v0.0.3
                      • 21.75
                      • Published

                      browser-speech

                      🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech

                        • v1.1.1
                        • 21.59
                        • Published

                        ugai

                        A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

                        • v1.1.0
                        • 21.51
                        • Published

                        espeak

                        text-to-speech using espeak cli program

                        • v0.0.3
                        • 21.51
                        • Published

                        koi-app

                        Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

                        • v0.1.2
                        • 21.37
                        • Published

                        @squirrelsoft/dev-say

                        MCP server for macOS text-to-speech using the say command

                        • v1.0.1
                        • 21.13
                        • Published

                        @qubby/use-whisper-beta

                        React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                        • v0.0.27
                        • 20.90
                        • Published

                        spoken

                        JavaScript Web API for Text-to-Speech and Speech-to-Text.

                        • v1.1.17
                        • 20.79
                        • Published

                        klatt-syn

                        Klatt formant synthesizer

                        • v1.0.7
                        • 20.38
                        • Published

                        speechrecognizer

                        Cordova plugin which provides a speech recognition service

                        • v0.0.2
                        • 20.14
                        • Published

                        @qubby/use-whisper

                        React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                        • v0.0.42
                        • 20.13
                        • Published

                        linear16

                        Converts an audio file to LINEAR16 Google-speech compatible file.

                        • v1.2.1
                        • 20.04
                        • Published

                        @cloudraker/use-whisper

                        React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                        • v0.3.0
                        • 19.65
                        • Published

                        simpletts

                        A basic TTS manager

                        • v2.6.0
                        • 19.13
                        • Published

                        speakie

                        speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.

                        • v1.0.0
                        • 19.05
                        • Published

                        tts-cli

                        Command-line tool to convert text to speech

                        • v5.4.1
                        • 18.95
                        • Published

                        react-transcript-editor

                        A React component to make transcribing audio and video easier and faster.

                        • v1.3.1-alpha.4
                        • 18.75
                        • Published

                        speechify

                        Easily add speech to text functionality into your website

                        • v0.1.0
                        • 18.66
                        • Published

                        say2

                        Interactive text-to-speech CLI with multiple voices using ElevenLabs API

                        • v1.1.0
                        • 18.56
                        • Published

                        web-speech-profanity

                        Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.

                        • v7.1.2-0
                        • 18.28
                        • Published

                        angular2-speech-engine

                        A set of Angular2 services and components to add speech recognition and synthesis to Angular2 applications

                        • v0.0.2
                        • 18.27
                        • Published

                        realtime-ten-vad

                        Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.

                        • v1.0.0
                        • 18.14
                        • Published

                        react-native-deepgram

                        React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                        • v0.1.21
                        • 18.08
                        • Published

                        @revrag-ai/embed-react-native

                        A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation

                        • v1.0.15
                        • 18.02
                        • Published

                        real-time-speech-analyzer

                        Real-time speech analysis with local LLM using multiple concurrent analysis instructions

                        • v1.0.0
                        • 17.91
                        • Published

                        austack

                        TypeScript/JavaScript client SDK for Austack conversational AI

                        • v0.1.0
                        • 17.87
                        • Published

                        mfcc

                        Node.js implementation of the MFCC audio speech analysis algorithm.

                        • v0.0.3
                        • 17.82
                        • Published

                        alexa-ssml

                        JSX for Alexa Skills Kit SSML

                        • v0.5.0
                        • 17.65
                        • Published

                        transpeech

                        TranSpeech is a small voice and text library. It allows you to recognize and synthesize speech using a browser, and translate text.

                        • v1.1.0
                        • 17.58
                        • Published

                        alexa-speech-utils

                        Helper functions for building speech responses

                          • v0.2.0
                          • 17.57
                          • Published

                          extra-amazontts

                          Generate speech audio from super long text, via Amazon Polly and ffmpeg.

                          • v1.1.18
                          • 17.41
                          • Published

                          texttospeech

                          Text to Speech (Pure Client Side)

                          • v0.2.0
                          • 17.30
                          • Published

                          @freddydrodev/artyom

                          Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

                          • v0.0.1
                          • 17.13
                          • Published

                          falexa

                          Create your own verbal commands that map to custom Javascript functions

                          • v2.0.3
                          • 17.10
                          • Published

                          salient

                          Salient is a natural language processing and sentiment analysis library

                          • v0.2.1
                          • 17.08
                          • Published

                          vosk-js

                          Node binding for continuous voice recoginition through vosk-api.

                          • v0.3.0
                          • 17.03
                          • Published

                          speedyspeech

                          This is a module to quickly use the Web Speech API to recognize keywords as a user speaks.

                          • v0.1.2
                          • 16.89
                          • Published

                          node-speak

                          TTS (Text to Speech) for Node and Browser

                          • v0.0.2
                          • 16.83
                          • Published

                          cordova-plugin-iflyspeech

                          Cordova plugin to support mobile speech recognizer and synthesizer with iFlyTek voice cloud service

                          • v0.9.2
                          • 16.64
                          • Published

                          speech-tree

                          An events tree which lets you define a sequence of voice commands.

                          • v0.0.2
                          • 16.60
                          • Published

                          qt-ai-gateway-npm-sdk

                          A WebSocket-based TTS client with real-time audio streaming and playback

                          • v1.0.5
                          • 16.24
                          • Published

                          voicevox.js

                          A client for the VOICEVOX API, providing text-to-speech capabilities.

                            • v0.8.1
                            • 16.24
                            • Published

                            mmir-plugin-speech-nuance-lang

                            tools for querying supported languages (ASR and TTS) and voices (TTS) by Nuance / Cerence SpeechKit

                            • v1.1.1
                            • 16.24
                            • Published

                            @itslanguage/api

                            The JavaScript API SDK for ITSLanguage.

                            • v5.7.0
                            • 15.85
                            • Published

                            formantanalyzer

                            Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a web browser using WebAudio API.

                            • v1.1.8
                            • 15.78
                            • Published

                            @itslanguage/recorder

                            JavaScript Recorder based on MediaRecorder from ITSLanguage.

                            • v6.0.3
                            • 15.71
                            • Published

                            praatio

                            A javascript library for working with praat, textgrids, time aligned audio transcripts, and audio files.

                            • v2.3.4
                            • 15.67
                            • Published

                            @usefulsensors/moonshine-js

                            On-device speech-to-text and voice control for web applications with Moonshine.

                            • v0.1.21
                            • 15.61
                            • Published

                            whatsapp-claude-gpt

                            WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an

                            • v1.4.0
                            • 15.56
                            • Published

                            alexa-speechlet

                            Alexa speech synthesis markup generator (SSML), making it easy to do all the things.

                            • v1.3.6
                            • 15.47
                            • Published

                            aws-transcribe-to-vtt

                            Turn JSON from Amazon AWS Transcribe into VTT files for use as subtitles.

                            • v1.0.6
                            • 15.46
                            • Published

                            wikipedia-tts

                            Crawl Wikipedia pages and upload TTS to Youtube.

                            • v1.4.11
                            • 15.21
                            • Published

                            node-red-contrib-wavenet

                            Easily convert text to speech using Google Wavenet voices on Node-RED.

                            • v3.1.2
                            • 14.96
                            • Published

                            voice-node-library

                            Real-time voice bot library with STT, LLM, and TTS capabilities

                            • v1.0.2
                            • 14.92
                            • Published

                            baidu_yuyin

                            百度语音的Nodejs实现

                            • v2.3.1
                            • 14.88
                            • Published

                            node-tts-api

                            Simple way to get TTS with node using TTS-API.com

                            • v0.0.5
                            • 14.70
                            • Published

                            @ji8122s/use-whisper-test

                            React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                            • v0.0.62
                            • 14.59
                            • Published

                            mmir-plugin-lang-support

                            tools for querying supported languages (ASR and TTS) and voices (TTS) for mmir speech plugins

                            • v1.5.0
                            • 14.57
                            • Published

                            @venkatesh966/speech-text

                            A React component seamlessly integrating audio-assistant functionality via the Web Speech API and OpenAI GPT. Users can interact naturally with the application through spoken commands, receiving responses as audio.

                              • v1.1.7
                              • 14.50
                              • Published

                              editorjs-speech

                              Speech Block Tool for Editor.js

                              • v1.6.1
                              • 14.40
                              • Published

                              @daitanjs/speech

                              A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

                              • v1.0.6
                              • 14.40
                              • Published

                              tts-api

                              Text to speech REST API for multiple TTS engines

                              • v2.5.1
                              • 14.32
                              • Published

                              speech-code

                              The text generator that uses the soviet speech code. No LLM required!

                              • v2.0.0
                              • 14.21
                              • Published

                              @sharcoux/vosk

                              Node binding for continuous offline voice recoginition with Vosk library.

                              • v0.3.24
                              • 13.98
                              • Published

                              yandex-dialogs-client

                              Клиент для работы с навыками Яндекс.Диалогов Алисы локально

                              • v1.2.0
                              • 13.78
                              • Published

                              cmusphinxdict

                              Wrapper for CMU Sphinx Pronouncing Dictionary

                              • v0.0.9
                              • 13.73
                              • Published

                              gtts.js

                              A Promise based Node.js/TypeScript port of the gTTS python library

                              • v1.0.1
                              • 13.56
                              • Published

                              speechmatch

                              Match words by the pronunciations

                              • v0.5.2
                              • 13.47
                              • Published