Found 736 results for speech recognition

react-speech

React component for the web speech synthesis api

@blrrt/cordova-plugin-speech-recognition-ios-browser-polyfill

Cordova iOS polyfill for the Speech Recognition API

react-native-kie-android-voice

A React Native Voice Recognition Module

@speechly/browser-client

JavaScript client for Speechly Streaming API

speech-recog-stream

A module to stream audio to a speech recognition server and get back the STT result"

sherpa-onnx-node

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

yanyu

A Chinese speech synthesis and recognition library toolkit

Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.

en-pos

A better English POS tagger written in JavaScript

@blrrt/cordova-plugin-speech-recognition-ios

Cordova plugin exposing the iOS Speech Recognition API

@speechmatics/real-time-client

Client for the Speechmatics real-time API

node-record-lpcm16

Record a microphone input stream

elevenlabs-node

This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API

@picovoice/porcupine-web

Porcupine wake word engine for web browsers (via WebAssembly)

@react-native-community/voice

React Native Native Voice library for iOS and Android

react-native-android-voice-persian

A React Native Voice Recognition Module With Persian Locale support

react-text-to-speech

An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.

speechflow

Speech Processing Flow Graph

mic-to-speech

Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

retext-pos

retext plugin to add part-of-speech (POS) tags

ssml-check-core

Core library to check for valid SSML

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

google-cloud-speech-webaudio

Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.

react-hook-speech-to-text

Simple cross-browser speech to text using react hooks.

@wdragon/react-native-voice

React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@speechly/react-client

React client for Speechly Streaming API

@chengsokdara/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

web-wake-word

A web package for keyword detection

babbler

Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.

@swankylegg/voice-io

A browser-based speech recognition and synthesis assistant

@picovoice/porcupine-react

React component for Porcupine Web SDK

koi-koi

Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

@corti/dictation-web

Web component for Corti Dictation

ssml-check

Check for valid SSML

sam-js

SAM - The Software Automatic Mouth

sherpa-onnx-win-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

sherpa-onnx-win-ia32

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

aws-transcription-to-vtt

convert an AWS transcribe JSON body into a .vtt file

@speechmatics/browser-audio-input-react

React hooks for managing audio inputs and permissions across browsers

aws-transcribe

A client for Amazon Transcribe using the websocket interface

@aurally/speech-control

A class to handle microphone permissions, start and observe speech input

espeak-ng

eSpeak-NG speech synthesizer, compiled to JavasScript + WASM

soundswallower

An even smaller speech recognizer

@speechmatics/real-time-client-react

React hooks for interacting with the Speechmatics Real-Time API

vocalize.ts

A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.

watson-html5-speech-recognition

Watson HTML5 Speech to Text

@speechmatics/flow-client

Javascript client for the Speechmatics Flow API

node-red-contrib-google-cloud

Node-RED nodes for Google Cloud Platform

@speechmatics/flow-client-react

React hooks for interacting with the Speechmatics Flow API

@picovoice/cheetah-web

Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

@picovoice/cobra-web

Cobra VAD engine for web browsers (via WebAssembly)

audio-to-text-node

Backend audio file to text transcription using Web Speech API with Puppeteer

@hahnpro/ms-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

@picovoice/cheetah-react

React hook for Cheetah Web SDK

cordova-plugin-speechtotext-activity

Cordova Plugin for Speech Recognition

piper-announce

AI-powered announcement generator using Piper TTS and OpenAI GPT models

ng-speech-recognition

AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

sherpa-onnx-linux-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

speech-recognition-android

Cordova Speech Recognition Plugin for Android

@picovoice/rhino-web

Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

@bbc/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

yandex-speech

node.js module for Yandex speech systems (ASR & TTS)

speech-ui-kitt

A flexible GUI for interacting with Speech Recognition

text2wav

Self-contained multilingual TTS speech synthesizer for Node.js in pure js

mmir-lib

MMIR (Mobile Multimodal Interaction and Relay) library

cybertyper

ReactJS component for automatically typing text synchronized with speech synthesis & recognition

avr-vad

A Node.js library for Voice Activity Detection using Silero VAD

node-witai-speech

This is an API wrapper for witai speech for nodejs

azure-speech-utilities

Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil

@larriereguichet/vosk

Node binding for continuous offline voice recoginition with Vosk library.

react-native-spokestack

React Native plugin for adding voice using Spokestack

mumble-js

A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

@picovoice/rhino-react

React component for Rhino Web SDK

edge-tts-universal

Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key

brill

Part-of-speech tags from the Brill-tagger

sherpa-onnx-darwin-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

react-native-baidu-asr

提供百度语音 React Native 接口

vosk-lib

Vosk library for node, with type defenitions and multi-arch support.

@nextevi/voice-react

React SDK for NextEVI Voice AI Platform

@pr0gramm/fluester

Node.js bindings for OpenAI's Whisper. Optimized for CPU.

react-native-voice-hold

React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes

whisper-speech-to-text

A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

@ssml-utilities/highlighter

SSML syntax highlighter for the SSML Utilities toolkit

@voice-ping/cognitive-services-speech

VoicePing Cognitive Services Speech SDK for JavaScript forked from Microsoft

@qdacity/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

mespeak

Text to speech synthesizer

@novo-learning/novo-sdk

SDK for the Novolanguage Speech Analysis API

pmacom-react-transcript-editor

A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.

@euirim/microsoft-cognitiveservices-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

bingspeechrecognition-api

Module to use bing speech recognition api to convert speech to text

@kajidog/aivis-cloud-cli

Aivis Cloud CLI - Text-to-speech synthesis and model management

postcss-speech-bubble

PostCSS plugin creates speech bubbles with just 1-2 lines of CSS

node-red-contrib-tts-ultimate

Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You

cordova-plugin-speechrecognition-updated

Updated cordova-plugin-speechrecognition to remove onfulfilled() errors

mbz-voice-sdk

🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.

yet-another-react-native-voice

React Native Native Voice library for iOS and Android

bw-speech-recognition

💬Speech recognition for your React app

@mhpdev/react-native-speech

A high-performance React Native library for text-to-speech on iOS and Android

vue-webapi-speech-recognition

Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api

text-to-speech-js

A small JavaScript library that provides a text to speech conversion using tts-api.com service.

react-speech-recognition-es

💬Speech recognition for your React app

echogarden-migaku

An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.

@picovoice/koala-web

Koala Noise Suppression engine for web browsers (via WebAssembly)

n8n-nodes-groq-speech

N8N Community Node for Groq Text-to-Speech API integration

ispikit

react-native-text-to-speech

React Native Text-To-Speech module for Android and iOS

@arach/speakeasy

SpeakEasy - Unified text-to-speech service with provider abstraction

@picovoice/leopard-web

Leopard Speech-to-Text engine for web browsers (via WebAssembly)

node-mfcc

Node.js implementation of the MFCC audio speech analysis algorithm.

@cephable/cephable-web

Add Cephable controls to your web-based apps

iobroker.sonus

With this adapter you can control ioBroker with voice in many different languages

@mirawision/reactive-hooks

A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.

pocketsphinx

Node binding for continuous voice recoginition through pocketsphinx.

microsoft-speech-browser-sdk-legacy

Microsoft Speech SDK for browsers

yandex-speech-promise

Promise based implementation of Yandex Speech Kit API

microsoft-speech-browser-sdk-cris

Microsoft Speech SDK for browsers (using CRIS endpoint)

xfyun-sdk

科大讯飞语音识别 SDK，支持浏览器中实时语音听写功能

@picovoice/orca-web

Orca Text-to-Speech engine for web browsers (via WebAssembly)

@moonshine-ai/moonshine-js

On-device speech-to-text and voice control for web applications with Moonshine.

@albertsyh/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

speechjs

Chrome speech recognition API wrapper

@transcribe/shout

Wasm build based on whisper.cpp.

primvoices-react

React client for the PrimVoices Agents API

text-to-speech

Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).

@transcribe/transcriber

Transcribe speech to text in the browser.

mac-say

The macOS built-in `say` CLI for JavaScript

node-droid-language

A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online

discord-tts

Node.js module to make your discord bot talk

@vocantai/n8n-nodes-translation-vocantai

Audio file transcription services. Your speech. Private.

@caspingus/ssml-check-core

Core library to check for valid SSML

@picovoice/leopard-react

React hook for Leopard Web SDK

node-mic-record

Record microphone sond using nodejs

@untemps/react-vocal

React component and hook to initiate a SpeechRecognition session

@picovoice/orca-node

Picovoice Orca Node.js binding

cordova-plugin-speech-edited-wai

Cordova Plugin for Speech Recognition ios, Speech Recognition Extension

@caspingus/ssml-check

Check for valid SSML

@steelbrain/media-speech-detection-web

Production-ready speech detection using Silero VAD ONNX model for web browsers

@logikron/talk-widget-embed

Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website

buzzphrase

Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections

@kkaczynski/use-whisper

React Hook for OpenAI Whisper API with speech recorder.

@arellak/elevenlabs-wrapper

Wrapper for the ElevenLabs API

tiktok-tts

Use TikTok TTS from node.js

@bluefly/apple-fm

Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing

@lipsurf/plugins

Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

@react-native-oh-tpl/react-native-tts

React Native Text-To-Speech module for Android and iOS

ttsmaker

Text-to-Speech API wrapper for ttsmp3.com

aixblock-voice-ai-deepgram

A React component for real-time transcription and voice agent interactions using Deepgram APIs

react-native-sfspeechrecognizer

iOS SFSpeechRecognizer bridge module for React Native

@andresaya/ssml-builder

TypeScript library for building SSML documents

n8n-nodes-groq

N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

pronounceability

Calculate pronounceability for a given word.

browser-speech

🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech

ugai

A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

espeak

text-to-speech using espeak cli program

koi-app

Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

cordova-plugin-speechrecognition-edited-wai

Cordova Plugin for Speech Recognition ios, Speech Recognition Extension

@iteleport/speechly-browser-client

Browser client for Speechly API

koishi-plugin-fishaudio-vits

Text-to-speech via Fish Audio API

@squirrelsoft/dev-say

MCP server for macOS text-to-speech using the say command

node-red-contrib-google-cloud-ubos

Node-RED nodes for Google Cloud Platform

@qubby/use-whisper-beta

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

node-red-contrib-google-tts

A node-red node for translating text to speech using Google's TTS service.

spoken

JavaScript Web API for Text-to-Speech and Speech-to-Text.

react-native-azure-speech-to-text

A React Native package for Azure Speech to Text

klatt-syn

Klatt formant synthesizer

koishi-plugin-fish-audio-tts

Text-to-speech via Fish Audio API

react-native-speech

A text-to-speech library for React Native.

speechrecognizer

Cordova plugin which provides a speech recognition service

@qubby/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

react-audio-spectrogram-player

An audioplayer written in React that shows a spectrogram along with the audio.

@cloudraker/use-whisper

React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.

alexa-voice-service

Alexa Voice Service wrapper for the browser.

react-native-speech-iflytek

Speech module based on iflytekSpeech for react native

@steelbrain/media-buffer-speech

Speech buffering that accumulates audio chunks and releases them after natural pause periods

simpletts

A basic TTS manager

speech-to-text-recognition

moved to speechless

speakie

speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.

red-contrib-google-stt-long

Google STT

tts-cli

Command-line tool to convert text to speech

react-transcript-editor

A React component to make transcribing audio and video easier and faster.

speechify

Easily add speech to text functionality into your website

say2

Interactive text-to-speech CLI with multiple voices using ElevenLabs API

bumblebee-hotword-node

Bumblebee Hotword for NodeJS

web-speech-profanity

Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.

angular2-speech-engine

A set of Angular2 services and components to add speech recognition and synthesis to Angular2 applications

@kmoz000/react-transcript-editor

A React component to make transcribing audio and video easier and faster.

realtime-ten-vad

Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.

react-native-deepgram

React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

@revrag-ai/embed-react-native

A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation

react-native-voice-enhanced

A maintained, enhanced fork of react-native-voice

real-time-speech-analyzer

Real-time speech analysis with local LLM using multiple concurrent analysis instructions

austack

TypeScript/JavaScript client SDK for Austack conversational AI

mfcc

Node.js implementation of the MFCC audio speech analysis algorithm.

n8n-nodes-ai-coustics-enhance

n8n community node for ai-coustics speech enhancement API

react-native-rhspeech

瑞昊RN项目语音合成组件

alexa-ssml

JSX for Alexa Skills Kit SSML

transpeech

TranSpeech is a small voice and text library. It allows you to recognize and synthesize speech using a browser, and translate text.

alexa-speech-utils

Helper functions for building speech responses

react-native-slang

React native interface for Slang

@stdlib/datasets-cmudict

The Carnegie Mellon Pronouncing Dictionary (CMUdict).

extra-amazontts

Generate speech audio from super long text, via Amazon Polly and ffmpeg.

@diego.mendez.ov/microsoft-cognitiveservices-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

texttospeech

Text to Speech (Pure Client Side)

@iris-family/speech-sdk

Speech SDK for Iris Family frontend projects

@freddydrodev/artyom

Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

falexa

Create your own verbal commands that map to custom Javascript functions

@varunmhajan/hom-i-voice-ai

Voice AI utilities for home loan assistance with India-specific formatting

salient

Salient is a natural language processing and sentiment analysis library

vosk-js

Node binding for continuous voice recoginition through vosk-api.

node-red-gcp-myproject

Node-RED nodes for Google Cloud Platform

speedyspeech

This is a module to quickly use the Web Speech API to recognize keywords as a user speaks.

react-native-bluemix

React Native module for IBM Bluemix services

node-speak

TTS (Text to Speech) for Node and Browser

cordova-plugin-iflyspeech

Cordova plugin to support mobile speech recognizer and synthesizer with iFlyTek voice cloud service

speech-tree

An events tree which lets you define a sequence of voice commands.

text-to-speech-offline

Simple Text to Speech Offline Using API Browser

ac-microsoft-cognitiveservices-speech-sdk

Microsoft Cognitive Services Speech SDK for JavaScript

@soundhound/houndify-react-native

Allows react-native apps to connect to Houndify for speech recognition.

react-native-voice-lp

React Native Native Voice library for iOS and Android

qt-ai-gateway-npm-sdk

A WebSocket-based TTS client with real-time audio streaming and playback

voicevox.js

A client for the VOICEVOX API, providing text-to-speech capabilities.

mmir-plugin-speech-nuance-lang

tools for querying supported languages (ASR and TTS) and voices (TTS) by Nuance / Cerence SpeechKit

javascript-speech-recognizer-library

A library for easily transcribing speech. Convert speech to text in JavaScript

better-speech-recognition

An improved speech recognition library with TypeScript support

react-native-dtneon-speech-iflytek

Speech module based on iflytekSpeech for react native

@itslanguage/api

The JavaScript API SDK for ITSLanguage.

com.adrenak.rnnoise4unity

RNNoise for Unity

formantanalyzer

Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a web browser using WebAudio API.

@itslanguage/recorder

JavaScript Recorder based on MediaRecorder from ITSLanguage.

praatio

A javascript library for working with praat, textgrids, time aligned audio transcripts, and audio files.

@usefulsensors/moonshine-js

On-device speech-to-text and voice control for web applications with Moonshine.

whatsapp-claude-gpt

WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an