Found 737 results for speech recognition

@speechly/speech-recognition-polyfill

Polyfill for the Speech Recognition API using Speechly

cordova-plugin-speechrecognition

Cordova Plugin for Speech Recognition

annyang

A javascript library for adding voice commands to your site, using speech recognition

sonix-speech-recognition

A library that produces audio transcriptions and translations using the Sonix.AI service.

echogarden

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

@spoonconsulting/cordova-plugin-speechrecognition

Cordova Plugin for Speech Recognition

react-speech-recognition-mutation

💬Speech recognition for your React app

@picovoice/eagle-web

Eagle Speaker Recognition engine for web browsers (via WebAssembly)

speechkitt

A flexible GUI for interacting with Speech Recognition

@afsalk/use-speech-recognition

A custom React hook for speech recognition built using SpeechRecognition API

speech-recognition-mock

Mock SpeechRecognition for headless unit tests

@sign-speak/react-sdk

Unlock Sign Language Recognition, Avatar, and Speech Recognition.

sherpa-ncnn

Real-time speech recognition with Next-gen Kaldi

speech-js

lib for recognition and synthesis of speech

react-native-speech-to-text-ios

React Native speech recognition component for iOS 10+

voice-speech-recognition

Simple wrapper extended functionalities of Speech Recognition embedded in browsers.

sber-salute-speech-recognition

A library that produces audio transcriptions using the SBER Salute Speech service.

react-native-voicebox-speech-rec

A powerful speech recognition library for React Native applications, enabling real-time speech-to-text transcription.

@red-mobile/cordova-plugin-speech-recognition

Cordova Plugin for Speech Recognition

electron-speech

speech recognition cli and api for node using electron

spremic

A simple JavaScript speech recognition library.

@mastashake08/speech-kit

Package for simplifying the Speech Recognition and Speech Utterence process.

react-voice-search

React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

sonus

Open source cross platform decentralized always-on speech recognition framework

vosk

Node binding for continuous offline voice recoginition with Vosk library.

@aurally/fancy-search

A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching

speech-recognition-evaluation

Command line utility to evaluate Automated Speech Recognition (ASR) systems

react-native-speech-engine

React Native Speech Recognition and Text-to-Speech with new architecture support

artyom.js

Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

whisper-onnx-speech-to-text

Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.

@ng-web-apis/speech

A library for using Web Speech API with Angular

simple-speech-recognition

A simple wrapper for Speech Recognition APIs in the browser

Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).

cordova-plugin-speech

This is cordova plugin for Speech Recognition and Text to Speech.

@antonyxuan/media-processor

processing audio media used for speech recognition

speech-recognition-react

A react library that encapsulates the native browser speech recognition api

@deepgram/sdk

Isomorphic Javascript client for Deepgram

microsoft-cognitiveservices-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

@soniox/speech-to-text-web

Javascript client library for Soniox Speech-to-Text websocket API

@wasm-audio-decoders/opus-ml

Web Assembly streaming Opus decoder with Machine Learning enhancements

cordova-plugin-speechrecognition-prakash

Cordova Plugin for Speech Recognition

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

speech-to-element

Add real-time speech to text functionality into your website with no effort

corti

Replace window.SpeechRecognition with a mock object and automate your tests

@google-cloud/speech

Cloud Speech Client Library for Node.js

parakeet.js

NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

microsoft-speech-browser-sdk

Microsoft Speech SDK for browsers

expo-speech

Provides text-to-speech functionality.

revai-node-sdk

Rev AI makes speech applications easy to build!

houndify-react-native

Allows react-native apps to connect to Houndify for speech recognition.

@blrrt/cordova-plugin-speech-recognition-ios

Cordova plugin exposing the iOS Speech Recognition API

cordova-plugin-speechrecognition-edited-wai

Cordova Plugin for Speech Recognition ios, Speech Recognition Extension

react-native-tts

React Native Text-To-Speech module for Android and iOS

speaktome-api

JavaScript modules for Mozilla's cloud speech recognition API

speech-to-text

A speech to text module.

yanyu

A Chinese speech synthesis and recognition library toolkit

@react-native-voice/voice

React Native Native Voice library for iOS and Android

speech-into-text

SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

cordova-plugin-speechrecognition-lbp

Cordova Plugin for Speech Recognition

@speechmatics/batch-client

Javascript client for the Speechmatics batch jobs API

react-speech

React component for the web speech synthesis api

oxford-speech-wrapper

simple bing voice recognition wrapper

@blrrt/cordova-plugin-speech-recognition-ios-browser-polyfill

Cordova iOS polyfill for the Speech Recognition API

react-native-kie-android-voice

A React Native Voice Recognition Module

@speechly/browser-client

JavaScript client for Speechly Streaming API

sherpa-onnx-node

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

koi-koi

Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

fft-js

Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.

en-pos

A better English POS tagger written in JavaScript

speech-recog-stream

A module to stream audio to a speech recognition server and get back the STT result"

@speechmatics/real-time-client

Client for the Speechmatics real-time API

node-record-lpcm16

Record a microphone input stream

elevenlabs-node

This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

react-native-android-voice-persian

A React Native Voice Recognition Module With Persian Locale support

@picovoice/porcupine-web

Porcupine wake word engine for web browsers (via WebAssembly)

@react-native-community/voice

React Native Native Voice library for iOS and Android

speechflow

Speech Processing Flow Graph

react-text-to-speech

An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

mic-to-speech

Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.

retext-pos

retext plugin to add part-of-speech (POS) tags

ssml-check-core

Core library to check for valid SSML

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

react-hook-speech-to-text

Simple cross-browser speech to text using react hooks.

google-cloud-speech-webaudio

Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@speechly/react-client

React client for Speechly Streaming API

speech-ui-kitt

A flexible GUI for interacting with Speech Recognition

@chengsokdara/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

watson-html5-speech-recognition

Watson HTML5 Speech to Text

@wdragon/react-native-voice

React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

@swankylegg/voice-io

A browser-based speech recognition and synthesis assistant

babbler

Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.

web-wake-word

A web package for keyword detection

@picovoice/porcupine-react

React component for Porcupine Web SDK

@corti/dictation-web

Web component for Corti Dictation

ssml-check

Check for valid SSML

sam-js

SAM - The Software Automatic Mouth

sherpa-onnx-win-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

sherpa-onnx-win-ia32

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

aws-transcribe

A client for Amazon Transcribe using the websocket interface

@speechmatics/browser-audio-input-react

React hooks for managing audio inputs and permissions across browsers

aws-transcription-to-vtt

convert an AWS transcribe JSON body into a .vtt file

@aurally/speech-control

A class to handle microphone permissions, start and observe speech input

espeak-ng

eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

cordova-plugin-speech-edited-wai

Cordova Plugin for Speech Recognition ios, Speech Recognition Extension

soundswallower

An even smaller speech recognizer

@speechmatics/real-time-client-react

React hooks for interacting with the Speechmatics Real-Time API

vocalize.ts

A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.

@speechmatics/flow-client

Javascript client for the Speechmatics Flow API

@speechmatics/flow-client-react

React hooks for interacting with the Speechmatics Flow API

node-red-contrib-google-cloud

Node-RED nodes for Google Cloud Platform

@picovoice/cheetah-web

Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

mumble-js

A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

@picovoice/cobra-web

Cobra VAD engine for web browsers (via WebAssembly)

audio-to-text-node

Backend audio file to text transcription using Web Speech API with Puppeteer

@hahnpro/ms-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

text2wav

Self-contained multilingual TTS speech synthesizer for Node.js in pure js

@picovoice/cheetah-react

React hook for Cheetah Web SDK

piper-announce

AI-powered announcement generator using Piper TTS and OpenAI GPT models

cordova-plugin-speechtotext-activity

Cordova Plugin for Speech Recognition

sherpa-onnx-linux-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

ng-speech-recognition

AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

speech-recognition-android

Cordova Speech Recognition Plugin for Android

@picovoice/rhino-web

Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

@bbc/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

mmir-lib

MMIR (Mobile Multimodal Interaction and Relay) library

yandex-speech

node.js module for Yandex speech systems (ASR & TTS)

cybertyper

ReactJS component for automatically typing text synchronized with speech synthesis & recognition

node-witai-speech

This is an API wrapper for witai speech for nodejs

azure-speech-utilities

Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil

react-native-spokestack

React Native plugin for adding voice using Spokestack

brill

Part-of-speech tags from the Brill-tagger

yet-another-react-native-voice

React Native Native Voice library for iOS and Android

edge-tts-universal

Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

@picovoice/rhino-react

React component for Rhino Web SDK

react-native-baidu-asr

提供百度语音 React Native 接口

avr-vad

A Node.js library for Voice Activity Detection using Silero VAD

vosk-lib

Vosk library for node, with type defenitions and multi-arch support.

sherpa-onnx-darwin-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@nextevi/voice-react

React SDK for NextEVI Voice AI Platform

@pr0gramm/fluester

Node.js bindings for OpenAI's Whisper. Optimized for CPU.

react-audio-transcriber-hook

A custom React hook for voice recording with speech recognition

@voice-ping/cognitive-services-speech

VoicePing Cognitive Services Speech SDK for JavaScript forked from Microsoft

whisper-speech-to-text

A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

@novo-learning/novo-sdk

SDK for the Novolanguage Speech Analysis API

@qdacity/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

mespeak

Text to speech synthesizer

pmacom-react-transcript-editor

A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

@ssml-utilities/highlighter

SSML syntax highlighter for the SSML Utilities toolkit

node-red-contrib-tts-ultimate

Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

@euirim/microsoft-cognitiveservices-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

postcss-speech-bubble

PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

cordova-plugin-speechrecognition-updated

Updated cordova-plugin-speechrecognition to remove onfulfilled() errors

bingspeechrecognition-api

Module to use bing speech recognition api to convert speech to text

@mhpdev/react-native-speech

A high-performance React Native library for text-to-speech on iOS and Android

@kajidog/aivis-cloud-cli

Aivis Cloud CLI - Text-to-speech synthesis and model management

mbz-voice-sdk

🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.

text-to-speech-js

A small JavaScript library that provides a text to speech conversion using tts-api.com service.

bw-speech-recognition

💬Speech recognition for your React app

vue-webapi-speech-recognition

Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api

echogarden-migaku

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

@picovoice/koala-web

Koala Noise Suppression engine for web browsers (via WebAssembly)

react-speech-recognition-es

💬Speech recognition for your React app

@arach/speakeasy

SpeakEasy - Unified text-to-speech service with provider abstraction

ispikit

react-native-text-to-speech

React Native Text-To-Speech module for Android and iOS

@larriereguichet/vosk

Node binding for continuous offline voice recoginition with Vosk library.

@picovoice/leopard-web

Leopard Speech-to-Text engine for web browsers (via WebAssembly)

node-mfcc

Node.js implementation of the MFCC audio speech analysis algorithm.

@cephable/cephable-web

Add Cephable controls to your web-based apps

iobroker.sonus

With this adapter you can control ioBroker with voice in many different languages

@mirawision/reactive-hooks

A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

@arifdroid/enhanced-chat

Enhanced chat widget combining modern n8n styling with advanced voice features and alert mode

pocketsphinx

Node binding for continuous voice recoginition through pocketsphinx.

yandex-speech-promise

Promise based implementation of Yandex Speech Kit API

microsoft-speech-browser-sdk-cris

Microsoft Speech SDK for browsers (using CRIS endpoint)

xfyun-sdk

科大讯飞语音识别 SDK，支持浏览器中实时语音听写功能

microsoft-speech-browser-sdk-legacy

Microsoft Speech SDK for browsers

js-speech-rekognition

### Usage

@picovoice/orca-web

Orca Text-to-Speech engine for web browsers (via WebAssembly)

@moonshine-ai/moonshine-js

On-device speech-to-text and voice control for web applications with Moonshine.

mac-say

The macOS built-in `say` CLI for JavaScript

@albertsyh/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

speechjs

Chrome speech recognition API wrapper

primvoices-react

React client for the PrimVoices Agents API

react-native-voice-hold

React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

@transcribe/shout

Wasm build based on whisper.cpp.

@transcribe/transcriber

Transcribe speech to text in the browser.

@react-native-oh-tpl/react-native-tts

React Native Text-To-Speech module for Android and iOS

node-droid-language

A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

text-to-speech

Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

discord-tts

Node.js module to make your discord bot talk

@vocantai/n8n-nodes-translation-vocantai

Audio file transcription services. Your speech. Private.

@picovoice/leopard-react

React hook for Leopard Web SDK

@caspingus/ssml-check-core

Core library to check for valid SSML

node-mic-record

Record microphone sond using nodejs

@untemps/react-vocal

React component and hook to initiate a SpeechRecognition session

tiktok-tts

Use TikTok TTS from node.js

@caspingus/ssml-check

Check for valid SSML

@logikron/talk-widget-embed

Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

@picovoice/orca-node

Picovoice Orca Node.js binding

@steelbrain/media-speech-detection-web

Production-ready speech detection using Silero VAD ONNX model for web browsers

buzzphrase

Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

@bluefly/apple-fm

Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

@kkaczynski/use-whisper

React Hook for OpenAI Whisper API with speech recorder.

@lipsurf/plugins

Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

@arellak/elevenlabs-wrapper

Wrapper for the ElevenLabs API

speech-to-text-recognition

moved to speechless

ttsmaker

Text-to-Speech API wrapper for ttsmp3.com

aixblock-voice-ai-deepgram

A React component for real-time transcription and voice agent interactions using Deepgram APIs

@andresaya/ssml-builder

TypeScript library for building SSML documents

react-native-sfspeechrecognizer

iOS SFSpeechRecognizer bridge module for React Native

pronounceability

Calculate pronounceability for a given word.

n8n-nodes-groq

N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

whatsapp-claude-gpt

WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an