JSPM

Found 735 results for speech

@deepgram/sdk

Isomorphic Javascript client for Deepgram

  • v4.11.2
  • 62.34
  • Published

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

  • v1.2.0
  • 60.64
  • Published

expo-speech

Provides text-to-speech functionality.

  • v13.1.7
  • 56.24
  • Published

react-native-tts

React Native Text-To-Speech module for Android and iOS

  • v4.1.1
  • 54.12
  • Published

speech-to-element

Add real-time speech to text functionality into your website with no effort

  • v1.0.4
  • 51.36
  • Published

revai-node-sdk

Rev AI makes speech applications easy to build!

  • v3.8.5
  • 48.11
  • Published

fft-js

Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.

  • v0.0.12
  • 47.79
  • Published

en-pos

A better English POS tagger written in JavaScript

  • v1.0.16
  • 47.21
  • Published

sonix-speech-recognition

A library that produces audio transcriptions and translations using the Sonix.AI service.

  • v2.1.1
  • 45.20
  • Published

elevenlabs-node

This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

  • v2.0.3
  • 44.96
  • Published

annyang

A javascript library for adding voice commands to your site, using speech recognition

  • v2.6.1
  • 44.66
  • Published

react-text-to-speech

An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

  • v2.1.2
  • 44.00
  • Published

retext-pos

retext plugin to add part-of-speech (POS) tags

  • v5.0.0
  • 43.24
  • Published

ssml-check-core

Core library to check for valid SSML

  • v0.3.9
  • 43.10
  • Published

@chengsokdara/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

  • v0.2.0
  • 41.29
  • Published

echogarden

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

  • v2.10.0
  • 40.87
  • Published

sherpa-onnx-node

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.10
  • 40.59
  • Published

sam-js

SAM - The Software Automatic Mouth

  • v0.3.1
  • 39.44
  • Published

ssml-check

Check for valid SSML

  • v0.4.6
  • 38.98
  • Published

aws-transcribe

A client for Amazon Transcribe using the websocket interface

  • v1.1.1
  • 38.01
  • Published

react-speech

React component for the web speech synthesis api

  • v1.0.2
  • 37.60
  • Published

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.10
  • 37.27
  • Published

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.10
  • 37.11
  • Published

espeak-ng

eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

  • v1.0.2
  • 36.72
  • Published

vosk-koffi

Vosk node API based on Koffi.

  • v1.1.1
  • 35.87
  • Published

vosk

Node binding for continuous offline voice recoginition with Vosk library.

  • v0.3.39
  • 35.79
  • Published

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.10
  • 34.59
  • Published

piper-announce

AI-powered announcement generator using Piper TTS and OpenAI GPT models

  • v1.2.10
  • 32.92
  • Published

sherpa-onnx-win-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.10
  • 32.84
  • Published

sherpa-onnx-win-ia32

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.10
  • 32.61
  • Published

text2wav

Self-contained multilingual TTS speech synthesizer for Node.js in pure js

  • v0.0.14
  • 32.26
  • Published

artyom.js

Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

  • v1.0.6
  • 32.07
  • Published

pmacom-react-transcript-editor

A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

  • v2.4.0
  • 32.01
  • Published

node-witai-speech

This is an API wrapper for witai speech for nodejs

  • v1.0.2
  • 31.77
  • Published

avr-vad

A Node.js library for Voice Activity Detection using Silero VAD

  • v1.0.9
  • 31.55
  • Published

brill

Part-of-speech tags from the Brill-tagger

  • v3.1.0
  • 30.75
  • Published

edge-tts-universal

Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

  • v1.3.0
  • 30.72
  • Published

@ng-web-apis/speech

A library for using Web Speech API with Angular

  • v4.12.0
  • 30.71
  • Published

@picovoice/cobra-web

Cobra VAD engine for web browsers (via WebAssembly)

    • v2.0.3
    • 30.63
    • Published

    corti

    Replace window.SpeechRecognition with a mock object and automate your tests

    • v1.0.0
    • 30.05
    • Published

    @picovoice/cheetah-web

    Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

      • v2.3.0
      • 29.98
      • Published

      @pr0gramm/fluester

      Node.js bindings for OpenAI's Whisper. Optimized for CPU.

      • v0.9.15
      • 29.68
      • Published

      speech-into-text

      SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

      • v4.0.2
      • 28.72
      • Published

      @picovoice/koala-web

      Koala Noise Suppression engine for web browsers (via WebAssembly)

        • v2.0.0
        • 28.59
        • Published

        node-red-contrib-tts-ultimate

        Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

        • v3.0.1
        • 28.52
        • Published

        speechflow

        Speech Processing Flow Graph

        • v1.5.0
        • 28.51
        • Published

        postcss-speech-bubble

        PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

        • v1.0.12
        • 28.51
        • Published

        mespeak

        Text to speech synthesizer

        • v2.0.2
        • 28.35
        • Published

        sherpa-onnx-linux-arm64

        Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

        • v1.12.10
        • 28.33
        • Published

        @wdragon/react-native-voice

        React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

        • v3.3.11
        • 28.26
        • Published

        text-to-speech-js

        A small JavaScript library that provides a text to speech conversion using tts-api.com service.

        • v1.1.11
        • 28.10
        • Published

        @picovoice/rhino-web

        Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

          • v3.0.3
          • 27.85
          • Published

          n8n-nodes-groq-speech

          N8N Community Node for Groq Text-to-Speech API integration

          • v1.1.2
          • 27.44
          • Published

          @arach/speakeasy

          SpeakEasy - Unified text-to-speech service with provider abstraction

            • v0.2.4
            • 27.25
            • Published

            node-mfcc

            Node.js implementation of the MFCC audio speech analysis algorithm.

            • v0.0.2
            • 27.19
            • Published

            vosk-lib

            Vosk library for node, with type defenitions and multi-arch support.

            • v0.1.3
            • 25.87
            • Published

            sherpa-onnx-darwin-x64

            Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

            • v1.12.10
            • 25.56
            • Published

            whisper-speech-to-text

            A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

            • v1.0.3
            • 25.32
            • Published

            @picovoice/orca-web

            Orca Text-to-Speech engine for web browsers (via WebAssembly)

              • v1.2.1
              • 24.78
              • Published

              web-wake-word

              A web package for keyword detection

                • v2.0.10
                • 24.48
                • Published

                node-droid-language

                A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

                • v1.0.2
                • 24.36
                • Published

                mac-say

                The macOS built-in `say` CLI for JavaScript

                • v0.3.3
                • 24.36
                • Published

                text-to-speech

                Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

                • v1.0.11
                • 24.33
                • Published

                audio-to-text-node

                Backend audio file to text transcription using Web Speech API with Puppeteer

                • v0.1.2
                • 23.54
                • Published

                primvoices-react

                React client for the PrimVoices Agents API

                • v0.2.2
                • 23.52
                • Published

                tiktok-tts

                Use TikTok TTS from node.js

                • v1.1.17
                • 23.26
                • Published

                @bluefly/apple-fm

                Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

                • v0.2.7
                • 23.26
                • Published

                @logikron/talk-widget-embed

                Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

                • v1.0.2
                • 23.24
                • Published

                buzzphrase

                Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

                • v3.2.1
                • 23.09
                • Published

                node-mic-record

                Record microphone sond using nodejs

                • v0.0.1
                • 23.04
                • Published

                speechkitt

                A flexible GUI for interacting with Speech Recognition

                • v1.0.0
                • 22.99
                • Published

                discord-tts

                Node.js module to make your discord bot talk

                • v1.2.2
                • 22.90
                • Published

                ttsmaker

                Text-to-Speech API wrapper for ttsmp3.com

                • v1.0.3
                • 22.75
                • Published

                browser-speech

                🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech

                  • v1.1.1
                  • 22.74
                  • Published

                  @picovoice/leopard-web

                  Leopard Speech-to-Text engine for web browsers (via WebAssembly)

                    • v2.0.1
                    • 22.53
                    • Published

                    sherpa-ncnn

                    Real-time speech recognition with Next-gen Kaldi

                    • v2.1.12
                    • 22.41
                    • Published

                    tts-cli

                    Command-line tool to convert text to speech

                    • v5.4.1
                    • 22.32
                    • Published

                    aixblock-voice-ai-deepgram

                    A React component for real-time transcription and voice agent interactions using Deepgram APIs

                      • v0.0.7
                      • 22.26
                      • Published

                      @picovoice/eagle-web

                      Eagle Speaker Recognition engine for web browsers (via WebAssembly)

                        • v1.0.0
                        • 22.17
                        • Published

                        soundswallower

                        An even smaller speech recognizer

                        • v0.6.3
                        • 22.05
                        • Published

                        @mirawision/reactive-hooks

                        A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

                        • v1.1.0
                        • 22.04
                        • Published

                        @albertsyh/use-whisper

                        React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                        • v0.2.17
                        • 21.84
                        • Published

                        xfyun-sdk

                        科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能

                        • v1.0.2
                        • 21.72
                        • Published

                        pronounceability

                        Calculate pronounceability for a given word.

                        • v0.0.3
                        • 21.62
                        • Published

                        ugai

                        A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

                        • v1.1.0
                        • 21.58
                        • Published

                        n8n-nodes-groq

                        N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

                        • v0.2.0
                        • 21.58
                        • Published

                        @moonshine-ai/moonshine-js

                        On-device speech-to-text and voice control for web applications with Moonshine.

                        • v0.1.29
                        • 21.29
                        • Published

                        espeak

                        text-to-speech using espeak cli program

                        • v0.0.3
                        • 21.04
                        • Published

                        @jcbyte/tts-queue

                        A lightweight wrapper for the Web Speech API's SpeechSynthesis, enabling easy queuing and management of text-to-speech utterances.

                        • v1.0.1
                        • 21.01
                        • Published

                        @squirrelsoft/dev-say

                        MCP server for macOS text-to-speech using the say command

                        • v1.0.1
                        • 20.98
                        • Published

                        mmir-lib

                        MMIR (Mobile Multimodal Interaction and Relay) library

                        • v7.0.1
                        • 20.36
                        • Published

                        spoken

                        JavaScript Web API for Text-to-Speech and Speech-to-Text.

                        • v1.1.17
                        • 20.33
                        • Published

                        klatt-syn

                        Klatt formant synthesizer

                        • v1.0.7
                        • 20.31
                        • Published

                        linear16

                        Converts an audio file to LINEAR16 Google-speech compatible file.

                        • v1.2.1
                        • 20.15
                        • Published

                        react-voice-search

                        React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

                        • v1.1.1
                        • 19.98
                        • Published

                        @qubby/use-whisper-beta

                        React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                        • v0.0.27
                        • 19.78
                        • Published

                        @cloudraker/use-whisper

                        React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

                        • v0.3.0
                        • 19.71
                        • Published

                        simpletts

                        A basic TTS manager

                        • v2.6.0
                        • 19.62
                        • Published

                        @yayaadev/sermo-models

                        TypeScript models for Sermo API (HTTP and WebSocket) generated from OpenAPI specifications

                        • v1.0.1
                        • 19.19
                        • Published

                        mmi-js

                        Multi-Modal Input Library for voice, gesture, and traditional inputs.

                          • v1.0.0
                          • 18.99
                          • Published

                          react-transcript-editor

                          A React component to make transcribing audio and video easier and faster.

                          • v1.3.1-alpha.4
                          • 18.84
                          • Published

                          say2

                          Interactive text-to-speech CLI with multiple voices using ElevenLabs API

                          • v1.1.0
                          • 18.41
                          • Published

                          react-native-voicebox-speech-rec

                          A powerful speech recognition library for React Native applications, enabling real-time speech-to-text transcription.

                          • v1.0.4
                          • 18.33
                          • Published

                          realtime-ten-vad

                          Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.

                          • v1.0.0
                          • 18.23
                          • Published

                          speechify

                          Easily add speech to text functionality into your website

                          • v0.1.0
                          • 18.14
                          • Published

                          real-time-speech-analyzer

                          Real-time speech analysis with local LLM using multiple concurrent analysis instructions

                          • v1.0.0
                          • 17.97
                          • Published

                          voicescribe

                          Live speech transcription library with multi-language support.

                            • v0.1.0
                            • 17.93
                            • Published

                            austack

                            TypeScript/JavaScript client SDK for Austack conversational AI

                            • v0.1.0
                            • 17.73
                            • Published

                            spremic

                            A simple JavaScript speech recognition library.

                            • v0.0.48
                            • 17.73
                            • Published

                            iobroker.sonus

                            With this adapter you can control ioBroker with voice in many different languages

                            • v0.1.1
                            • 17.66
                            • Published

                            yandex-speech

                            node.js module for Yandex speech systems (ASR & TTS)

                            • v0.0.14
                            • 17.65
                            • Published

                            alexa-ssml

                            JSX for Alexa Skills Kit SSML

                            • v0.5.0
                            • 17.58
                            • Published

                            @lipsurf/plugins

                            Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

                            • v4.10.0
                            • 17.53
                            • Published

                            texttospeech

                            Text to Speech (Pure Client Side)

                            • v0.2.0
                            • 17.38
                            • Published

                            mfcc

                            Node.js implementation of the MFCC audio speech analysis algorithm.

                            • v0.0.3
                            • 17.38
                            • Published

                            @revrag-ai/embed-react-native

                            A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation

                            • v1.0.15
                            • 17.04
                            • Published

                            @sign-speak/react-sdk

                            Unlock Sign Language Recognition, Avatar, and Speech Recognition.

                            • v0.7.3
                            • 16.69
                            • Published

                            electron-speech

                            speech recognition cli and api for node using electron

                            • v1.0.7
                            • 16.68
                            • Published

                            node-speak

                            TTS (Text to Speech) for Node and Browser

                            • v0.0.2
                            • 16.53
                            • Published

                            @itslanguage/api

                            The JavaScript API SDK for ITSLanguage.

                            • v5.7.0
                            • 16.47
                            • Published

                            qt-ai-gateway-npm-sdk

                            A WebSocket-based TTS client with real-time audio streaming and playback

                            • v1.0.5
                            • 16.29
                            • Published

                            voicevox.js

                            A client for the VOICEVOX API, providing text-to-speech capabilities.

                              • v0.8.1
                              • 16.29
                              • Published

                              salient

                              Salient is a natural language processing and sentiment analysis library

                              • v0.2.1
                              • 16.02
                              • Published

                              whisper-onnx-speech-to-text

                              Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.

                              • v1.0.1
                              • 15.85
                              • Published

                              @qubby/use-whisper

                              React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                              • v0.0.42
                              • 15.82
                              • Published

                              whatsapp-claude-gpt

                              WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an

                              • v1.4.0
                              • 15.61
                              • Published

                              alexa-speechlet

                              Alexa speech synthesis markup generator (SSML), making it easy to do all the things.

                              • v1.3.6
                              • 15.55
                              • Published

                              react-native-deepgram

                              React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                              • v0.1.21
                              • 15.47
                              • Published

                              wikipedia-tts

                              Crawl Wikipedia pages and upload TTS to Youtube.

                              • v1.4.11
                              • 15.38
                              • Published

                              mmir-plugin-speech-nuance-lang

                              tools for querying supported languages (ASR and TTS) and voices (TTS) by Nuance / Cerence SpeechKit

                              • v1.1.1
                              • 15.13
                              • Published

                              @aurally/speech-control

                              A class to handle microphone permissions, start and observe speech input

                              • v1.1.2
                              • 15.02
                              • Published

                              @aurally/fancy-search

                              A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching

                              • v1.0.9
                              • 15.00
                              • Published

                              sonus

                              Open source cross platform decentralized always-on speech recognition framework

                              • v1.0.3
                              • 15.00
                              • Published

                              baidu_yuyin

                              百度语音的Nodejs实现

                              • v2.3.1
                              • 14.96
                              • Published

                              pocketsphinx

                              Node binding for continuous voice recoginition through pocketsphinx.

                              • v5.0.7
                              • 14.89
                              • Published

                              node-red-contrib-wavenet

                              Easily convert text to speech using Google Wavenet voices on Node-RED.

                              • v3.1.2
                              • 14.76
                              • Published

                              node-tts-api

                              Simple way to get TTS with node using TTS-API.com

                              • v0.0.5
                              • 14.65
                              • Published

                              @ji8122s/use-whisper-test

                              React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

                              • v0.0.62
                              • 14.63
                              • Published

                              vosk-js

                              Node binding for continuous voice recoginition through vosk-api.

                              • v0.3.0
                              • 14.57
                              • Published

                              @daitanjs/speech

                              A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

                              • v1.0.6
                              • 14.47
                              • Published

                              parakeet.js

                              NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

                              • v0.0.3
                              • 14.39
                              • Published

                              electron-vosk-speech

                              Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).

                              • v0.2.1
                              • 14.33
                              • Published

                              formantanalyzer

                              Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a web browser using WebAudio API.

                              • v1.1.8
                              • 14.21
                              • Published

                              editorjs-speech

                              Speech Block Tool for Editor.js

                              • v1.6.1
                              • 14.18
                              • Published

                              mic-to-speech

                              Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.

                              • v1.0.1
                              • 14.03
                              • Published

                              speech-code

                              The text generator that uses the soviet speech code. No LLM required!

                              • v2.0.0
                              • 13.92
                              • Published

                              @itslanguage/recorder

                              JavaScript Recorder based on MediaRecorder from ITSLanguage.

                              • v6.0.3
                              • 13.88
                              • Published

                              yandex-dialogs-client

                              Клиент для работы с навыками Яндекс.Диалогов Алисы локально

                              • v1.2.0
                              • 13.85
                              • Published

                              @larriereguichet/vosk

                              Node binding for continuous offline voice recoginition with Vosk library.

                              • v0.4.4
                              • 13.85
                              • Published

                              react-native-voice-hold

                              React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

                              • v1.0.7
                              • 13.73
                              • Published

                              tts-api

                              Text to speech REST API for multiple TTS engines

                              • v2.5.1
                              • 13.62
                              • Published

                              @untemps/react-vocal

                              React component and hook to initiate a SpeechRecognition session

                              • v1.7.28
                              • 13.62
                              • Published

                              gtts.js

                              A Promise based Node.js/TypeScript port of the gTTS python library

                              • v1.0.1
                              • 13.51
                              • Published

                              @mastashake08/speech-kit

                              Package for simplifying the Speech Recognition and Speech Utterence process.

                              • v2.0.8
                              • 13.46
                              • Published