Found 736 results for speech recognition

electron-speech

speech recognition cli and api for node using electron

spremic

A simple JavaScript speech recognition library.

@mastashake08/speech-kit

Package for simplifying the Speech Recognition and Speech Utterence process.

sonus

Open source cross platform decentralized always-on speech recognition framework

react-voice-search

React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

vosk

Node binding for continuous offline voice recoginition with Vosk library.

speech-recognition-evaluation

Command line utility to evaluate Automated Speech Recognition (ASR) systems

@aurally/fancy-search

A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching

speech-js

lib for recognition and synthesis of speech

artyom.js

Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

@ng-web-apis/speech

A library for using Web Speech API with Angular

simple-speech-recognition

A simple wrapper for Speech Recognition APIs in the browser

whisper-onnx-speech-to-text

Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.

Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).

@antonyxuan/media-processor

processing audio media used for speech recognition

cordova-plugin-speech

This is cordova plugin for Speech Recognition and Text to Speech.

speech-recognition-react

A react library that encapsulates the native browser speech recognition api

@deepgram/sdk

Isomorphic Javascript client for Deepgram

cordova-plugin-speechrecognition-prakash

Cordova Plugin for Speech Recognition

@wasm-audio-decoders/opus-ml

Web Assembly streaming Opus decoder with Machine Learning enhancements

cordova-plugin-speechrecognition-lbp

Cordova Plugin for Speech Recognition

microsoft-cognitiveservices-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

corti

Replace window.SpeechRecognition with a mock object and automate your tests

@soniox/speech-to-text-web

Javascript client library for Soniox Speech-to-Text websocket API

speech-to-element

Add real-time speech to text functionality into your website with no effort

parakeet.js

NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

@google-cloud/speech

Cloud Speech Client Library for Node.js

microsoft-speech-browser-sdk

Microsoft Speech SDK for browsers

houndify-react-native

Allows react-native apps to connect to Houndify for speech recognition.

revai-node-sdk

Rev AI makes speech applications easy to build!

expo-speech

Provides text-to-speech functionality.

speaktome-api

JavaScript modules for Mozilla's cloud speech recognition API

speech-to-text

A speech to text module.

react-native-tts

React Native Text-To-Speech module for Android and iOS

speech-into-text

SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

@react-native-voice/voice

React Native Native Voice library for iOS and Android

@speechmatics/batch-client

Javascript client for the Speechmatics batch jobs API

oxford-speech-wrapper

simple bing voice recognition wrapper

react-speech

React component for the web speech synthesis api

@blrrt/cordova-plugin-speech-recognition-ios-browser-polyfill

Cordova iOS polyfill for the Speech Recognition API

react-native-kie-android-voice

A React Native Voice Recognition Module

@speechly/browser-client

JavaScript client for Speechly Streaming API

speech-recog-stream

A module to stream audio to a speech recognition server and get back the STT result"

sherpa-onnx-node

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

yanyu

A Chinese speech synthesis and recognition library toolkit

fft-js

Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.

en-pos

A better English POS tagger written in JavaScript

@blrrt/cordova-plugin-speech-recognition-ios

Cordova plugin exposing the iOS Speech Recognition API

@speechmatics/real-time-client

Client for the Speechmatics real-time API

node-record-lpcm16

Record a microphone input stream

elevenlabs-node

This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

@picovoice/porcupine-web

Porcupine wake word engine for web browsers (via WebAssembly)

@react-native-community/voice

React Native Native Voice library for iOS and Android

react-native-android-voice-persian

A React Native Voice Recognition Module With Persian Locale support

react-text-to-speech

An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

speechflow

Speech Processing Flow Graph

mic-to-speech

Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

retext-pos

retext plugin to add part-of-speech (POS) tags

ssml-check-core

Core library to check for valid SSML

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

google-cloud-speech-webaudio

Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.

react-hook-speech-to-text

Simple cross-browser speech to text using react hooks.

@wdragon/react-native-voice

React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@speechly/react-client

React client for Speechly Streaming API

@chengsokdara/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

web-wake-word

A web package for keyword detection

babbler

Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.

@swankylegg/voice-io

A browser-based speech recognition and synthesis assistant

@picovoice/porcupine-react

React component for Porcupine Web SDK

koi-koi

Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

@corti/dictation-web

Web component for Corti Dictation

ssml-check

Check for valid SSML

sam-js

SAM - The Software Automatic Mouth

sherpa-onnx-win-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

sherpa-onnx-win-ia32

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

aws-transcription-to-vtt

convert an AWS transcribe JSON body into a .vtt file

@speechmatics/browser-audio-input-react

React hooks for managing audio inputs and permissions across browsers

aws-transcribe

A client for Amazon Transcribe using the websocket interface

@aurally/speech-control

A class to handle microphone permissions, start and observe speech input

espeak-ng

eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

soundswallower

An even smaller speech recognizer

@speechmatics/real-time-client-react

React hooks for interacting with the Speechmatics Real-Time API

vocalize.ts

A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.

watson-html5-speech-recognition

Watson HTML5 Speech to Text

@speechmatics/flow-client

Javascript client for the Speechmatics Flow API

node-red-contrib-google-cloud

Node-RED nodes for Google Cloud Platform

@speechmatics/flow-client-react

React hooks for interacting with the Speechmatics Flow API

@picovoice/cheetah-web

Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

@picovoice/cobra-web

Cobra VAD engine for web browsers (via WebAssembly)

audio-to-text-node

Backend audio file to text transcription using Web Speech API with Puppeteer

@hahnpro/ms-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

@picovoice/cheetah-react

React hook for Cheetah Web SDK

cordova-plugin-speechtotext-activity

Cordova Plugin for Speech Recognition

piper-announce

AI-powered announcement generator using Piper TTS and OpenAI GPT models

ng-speech-recognition

AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

sherpa-onnx-linux-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

speech-recognition-android

Cordova Speech Recognition Plugin for Android

@picovoice/rhino-web

Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

@bbc/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

yandex-speech

node.js module for Yandex speech systems (ASR & TTS)

speech-ui-kitt

A flexible GUI for interacting with Speech Recognition

text2wav

Self-contained multilingual TTS speech synthesizer for Node.js in pure js

mmir-lib

MMIR (Mobile Multimodal Interaction and Relay) library

cybertyper

ReactJS component for automatically typing text synchronized with speech synthesis & recognition

avr-vad

A Node.js library for Voice Activity Detection using Silero VAD

node-witai-speech

This is an API wrapper for witai speech for nodejs

azure-speech-utilities

Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil

@larriereguichet/vosk

Node binding for continuous offline voice recoginition with Vosk library.

react-native-spokestack

React Native plugin for adding voice using Spokestack

mumble-js

A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

@picovoice/rhino-react

React component for Rhino Web SDK

edge-tts-universal

Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

brill

Part-of-speech tags from the Brill-tagger

sherpa-onnx-darwin-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

react-native-baidu-asr

提供百度语音 React Native 接口

vosk-lib

Vosk library for node, with type defenitions and multi-arch support.

@nextevi/voice-react

React SDK for NextEVI Voice AI Platform

@pr0gramm/fluester

Node.js bindings for OpenAI's Whisper. Optimized for CPU.

react-native-voice-hold

React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

whisper-speech-to-text

A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

@ssml-utilities/highlighter

SSML syntax highlighter for the SSML Utilities toolkit

@voice-ping/cognitive-services-speech

VoicePing Cognitive Services Speech SDK for JavaScript forked from Microsoft

@qdacity/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

mespeak

Text to speech synthesizer

@novo-learning/novo-sdk

SDK for the Novolanguage Speech Analysis API

pmacom-react-transcript-editor

A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

@euirim/microsoft-cognitiveservices-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

bingspeechrecognition-api

Module to use bing speech recognition api to convert speech to text

@kajidog/aivis-cloud-cli

Aivis Cloud CLI - Text-to-speech synthesis and model management

postcss-speech-bubble

PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

node-red-contrib-tts-ultimate

Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

cordova-plugin-speechrecognition-updated

Updated cordova-plugin-speechrecognition to remove onfulfilled() errors

mbz-voice-sdk

🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.

yet-another-react-native-voice

React Native Native Voice library for iOS and Android

bw-speech-recognition

💬Speech recognition for your React app

@mhpdev/react-native-speech

A high-performance React Native library for text-to-speech on iOS and Android

vue-webapi-speech-recognition

Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api

text-to-speech-js

A small JavaScript library that provides a text to speech conversion using tts-api.com service.

react-speech-recognition-es

💬Speech recognition for your React app

echogarden-migaku

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

@picovoice/koala-web

Koala Noise Suppression engine for web browsers (via WebAssembly)

n8n-nodes-groq-speech

N8N Community Node for Groq Text-to-Speech API integration

ispikit

react-native-text-to-speech

React Native Text-To-Speech module for Android and iOS

@arach/speakeasy

SpeakEasy - Unified text-to-speech service with provider abstraction

@picovoice/leopard-web

Leopard Speech-to-Text engine for web browsers (via WebAssembly)

node-mfcc

Node.js implementation of the MFCC audio speech analysis algorithm.

@cephable/cephable-web

Add Cephable controls to your web-based apps

iobroker.sonus

With this adapter you can control ioBroker with voice in many different languages

@mirawision/reactive-hooks

A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

pocketsphinx

Node binding for continuous voice recoginition through pocketsphinx.

microsoft-speech-browser-sdk-legacy

Microsoft Speech SDK for browsers

yandex-speech-promise

Promise based implementation of Yandex Speech Kit API

microsoft-speech-browser-sdk-cris

Microsoft Speech SDK for browsers (using CRIS endpoint)

xfyun-sdk

科大讯飞语音识别 SDK，支持浏览器中实时语音听写功能

@picovoice/orca-web

Orca Text-to-Speech engine for web browsers (via WebAssembly)

@moonshine-ai/moonshine-js

On-device speech-to-text and voice control for web applications with Moonshine.

@albertsyh/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

speechjs

Chrome speech recognition API wrapper

@transcribe/shout

Wasm build based on whisper.cpp.

primvoices-react

React client for the PrimVoices Agents API

text-to-speech

Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

@transcribe/transcriber

Transcribe speech to text in the browser.

mac-say

The macOS built-in `say` CLI for JavaScript

node-droid-language

A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

discord-tts

Node.js module to make your discord bot talk

@vocantai/n8n-nodes-translation-vocantai

Audio file transcription services. Your speech. Private.

@caspingus/ssml-check-core

Core library to check for valid SSML

@picovoice/leopard-react

React hook for Leopard Web SDK

node-mic-record

Record microphone sond using nodejs

@untemps/react-vocal

React component and hook to initiate a SpeechRecognition session

@picovoice/orca-node

Picovoice Orca Node.js binding

cordova-plugin-speech-edited-wai

Cordova Plugin for Speech Recognition ios, Speech Recognition Extension

@caspingus/ssml-check

Check for valid SSML

@steelbrain/media-speech-detection-web

Production-ready speech detection using Silero VAD ONNX model for web browsers

@logikron/talk-widget-embed

Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

buzzphrase

Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

@kkaczynski/use-whisper

React Hook for OpenAI Whisper API with speech recorder.

@arellak/elevenlabs-wrapper

Wrapper for the ElevenLabs API

tiktok-tts

Use TikTok TTS from node.js

@bluefly/apple-fm

Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

@lipsurf/plugins

Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

@react-native-oh-tpl/react-native-tts

React Native Text-To-Speech module for Android and iOS

ttsmaker

Text-to-Speech API wrapper for ttsmp3.com

aixblock-voice-ai-deepgram

A React component for real-time transcription and voice agent interactions using Deepgram APIs

react-native-sfspeechrecognizer

iOS SFSpeechRecognizer bridge module for React Native

@andresaya/ssml-builder

TypeScript library for building SSML documents

n8n-nodes-groq

N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

pronounceability

Calculate pronounceability for a given word.

browser-speech

🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech

ugai

A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

espeak

text-to-speech using espeak cli program

koi-app

Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

cordova-plugin-speechrecognition-edited-wai

Cordova Plugin for Speech Recognition ios, Speech Recognition Extension

@iteleport/speechly-browser-client

Browser client for Speechly API

koishi-plugin-fishaudio-vits

Text-to-speech via Fish Audio API

@squirrelsoft/dev-say

MCP server for macOS text-to-speech using the say command

node-red-contrib-google-cloud-ubos

Node-RED nodes for Google Cloud Platform

@qubby/use-whisper-beta

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

node-red-contrib-google-tts

A node-red node for translating text to speech using Google's TTS service.

spoken

JavaScript Web API for Text-to-Speech and Speech-to-Text.

react-native-azure-speech-to-text

A React Native package for Azure Speech to Text

klatt-syn

Klatt formant synthesizer

koishi-plugin-fish-audio-tts

Text-to-speech via Fish Audio API

react-native-speech

A text-to-speech library for React Native.

speechrecognizer

Cordova plugin which provides a speech recognition service

@qubby/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

react-audio-spectrogram-player

An audioplayer written in React that shows a spectrogram along with the audio.

@cloudraker/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

alexa-voice-service

Alexa Voice Service wrapper for the browser.

react-native-speech-iflytek

Speech module based on iflytekSpeech for react native

@steelbrain/media-buffer-speech

Speech buffering that accumulates audio chunks and releases them after natural pause periods

simpletts

A basic TTS manager

speech-to-text-recognition

moved to speechless

speakie

speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.

red-contrib-google-stt-long

Google STT

tts-cli

Command-line tool to convert text to speech

react-transcript-editor

A React component to make transcribing audio and video easier and faster.

speechify

Easily add speech to text functionality into your website

say2

Interactive text-to-speech CLI with multiple voices using ElevenLabs API

bumblebee-hotword-node

Bumblebee Hotword for NodeJS

web-speech-profanity

Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.

angular2-speech-engine

A set of Angular2 services and components to add speech recognition and synthesis to Angular2 applications

@kmoz000/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

realtime-ten-vad

Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.

react-native-deepgram

React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

@revrag-ai/embed-react-native

A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation