electron-speech
speech recognition cli and api for node using electron
Found 736 results for speech recognition
speech recognition cli and api for node using electron
A simple JavaScript speech recognition library.
Package for simplifying the Speech Recognition and Speech Utterence process.
Open source cross platform decentralized always-on speech recognition framework
React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.
Node binding for continuous offline voice recoginition with Vosk library.
Vosk node API based on Koffi.
Command line utility to evaluate Automated Speech Recognition (ASR) systems
A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching
💬Speech recognition for your React app
lib for recognition and synthesis of speech
Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent
A library for using Web Speech API with Angular
A simple wrapper for Speech Recognition APIs in the browser
Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.
Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).
processing audio media used for speech recognition
This is cordova plugin for Speech Recognition and Text to Speech.
A react library that encapsulates the native browser speech recognition api
Isomorphic Javascript client for Deepgram
Cordova Plugin for Speech Recognition
Web Assembly streaming Opus decoder with Machine Learning enhancements
Cordova Plugin for Speech Recognition
Microsoft Cognitive Services Speech SDK for JavaScript
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Replace window.SpeechRecognition with a mock object and automate your tests
Javascript client library for Soniox Speech-to-Text websocket API
Add real-time speech to text functionality into your website with no effort
NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.
Cloud Speech Client Library for Node.js
Microsoft Speech SDK for browsers
Allows react-native apps to connect to Houndify for speech recognition.
Rev AI makes speech applications easy to build!
Provides text-to-speech functionality.
JavaScript modules for Mozilla's cloud speech recognition API
A speech to text module.
React Native Text-To-Speech module for Android and iOS
SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.
React Native Native Voice library for iOS and Android
Javascript client for the Speechmatics batch jobs API
simple bing voice recognition wrapper
React component for the web speech synthesis api
Cordova iOS polyfill for the Speech Recognition API
A React Native Voice Recognition Module
JavaScript client for Speechly Streaming API
A module to stream audio to a speech recognition server and get back the STT result"
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A Chinese speech synthesis and recognition library toolkit
Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.
A better English POS tagger written in JavaScript
Cordova plugin exposing the iOS Speech Recognition API
Client for the Speechmatics real-time API
Record a microphone input stream
This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API
Porcupine wake word engine for web browsers (via WebAssembly)
React Native Native Voice library for iOS and Android
A React Native Voice Recognition Module With Persian Locale support
An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.
Speech Processing Flow Graph
Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
retext plugin to add part-of-speech (POS) tags
Core library to check for valid SSML
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.
Simple cross-browser speech to text using react hooks.
React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React client for Speechly Streaming API
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
A web package for keyword detection
Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.
A browser-based speech recognition and synthesis assistant
React component for Porcupine Web SDK
Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.
Web component for Corti Dictation
Check for valid SSML
SAM - The Software Automatic Mouth
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
convert an AWS transcribe JSON body into a .vtt file
React hooks for managing audio inputs and permissions across browsers
A client for Amazon Transcribe using the websocket interface
A class to handle microphone permissions, start and observe speech input
eSpeak-NG speech synthesizer, compiled to JavasScript + WASM
An even smaller speech recognizer
React hooks for interacting with the Speechmatics Real-Time API
A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.
Watson HTML5 Speech to Text
Javascript client for the Speechmatics Flow API
Node-RED nodes for Google Cloud Platform
React hooks for interacting with the Speechmatics Flow API
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
Cobra VAD engine for web browsers (via WebAssembly)
Backend audio file to text transcription using Web Speech API with Puppeteer
Microsoft Cognitive Services Speech SDK for JavaScript
React hook for Cheetah Web SDK
Cordova Plugin for Speech Recognition
AI-powered announcement generator using Piper TTS and OpenAI GPT models
AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Cordova Speech Recognition Plugin for Android
Rhino Speech-to-Intent engine for web browsers (via WebAssembly)
A React component to make transcribing audio and video easier and faster.
node.js module for Yandex speech systems (ASR & TTS)
A flexible GUI for interacting with Speech Recognition
Self-contained multilingual TTS speech synthesizer for Node.js in pure js
MMIR (Mobile Multimodal Interaction and Relay) library
ReactJS component for automatically typing text synchronized with speech synthesis & recognition
A Node.js library for Voice Activity Detection using Silero VAD
This is an API wrapper for witai speech for nodejs
Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil
Node binding for continuous offline voice recoginition with Vosk library.
React Native plugin for adding voice using Spokestack
A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.
React component for Rhino Web SDK
Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key
Part-of-speech tags from the Brill-tagger
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
提供百度语音 React Native 接口
Vosk library for node, with type defenitions and multi-arch support.
React SDK for NextEVI Voice AI Platform
Node.js bindings for OpenAI's Whisper. Optimized for CPU.
React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
SSML syntax highlighter for the SSML Utilities toolkit
VoicePing Cognitive Services Speech SDK for JavaScript forked from Microsoft
A React component to make transcribing audio and video easier and faster.
Text to speech synthesizer
SDK for the Novolanguage Speech Analysis API
A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.
Microsoft Cognitive Services Speech SDK for JavaScript
Module to use bing speech recognition api to convert speech to text
Aivis Cloud CLI - Text-to-speech synthesis and model management
PostCSS plugin creates speech bubbles with just 1-2 lines of CSS
Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You
Updated cordova-plugin-speechrecognition to remove onfulfilled() errors
🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.
React Native Native Voice library for iOS and Android
💬Speech recognition for your React app
A high-performance React Native library for text-to-speech on iOS and Android
Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api
A small JavaScript library that provides a text to speech conversion using tts-api.com service.
💬Speech recognition for your React app
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Koala Noise Suppression engine for web browsers (via WebAssembly)
N8N Community Node for Groq Text-to-Speech API integration
ispikit
React Native Text-To-Speech module for Android and iOS
SpeakEasy - Unified text-to-speech service with provider abstraction
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
Node.js implementation of the MFCC audio speech analysis algorithm.
Add Cephable controls to your web-based apps
With this adapter you can control ioBroker with voice in many different languages
A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.
Node binding for continuous voice recoginition through pocketsphinx.
Microsoft Speech SDK for browsers
Promise based implementation of Yandex Speech Kit API
Microsoft Speech SDK for browsers (using CRIS endpoint)
科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能
Orca Text-to-Speech engine for web browsers (via WebAssembly)
On-device speech-to-text and voice control for web applications with Moonshine.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
Chrome speech recognition API wrapper
Wasm build based on whisper.cpp.
React client for the PrimVoices Agents API
Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).
Transcribe speech to text in the browser.
The macOS built-in `say` CLI for JavaScript
A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online
Node.js module to make your discord bot talk
Audio file transcription services. Your speech. Private.
Core library to check for valid SSML
React hook for Leopard Web SDK
Record microphone sond using nodejs
React component and hook to initiate a SpeechRecognition session
Picovoice Orca Node.js binding
Cordova Plugin for Speech Recognition ios, Speech Recognition Extension
Check for valid SSML
Production-ready speech detection using Silero VAD ONNX model for web browsers
Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website
Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections
React Hook for OpenAI Whisper API with speech recorder.
Wrapper for the ElevenLabs API
Use TikTok TTS from node.js
Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing
Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.
React Native Text-To-Speech module for Android and iOS
Text-to-Speech API wrapper for ttsmp3.com
A React component for real-time transcription and voice agent interactions using Deepgram APIs
iOS SFSpeechRecognizer bridge module for React Native
TypeScript library for building SSML documents
N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.
Calculate pronounceability for a given word.
🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech
A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services
text-to-speech using espeak cli program
Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.
Cordova Plugin for Speech Recognition ios, Speech Recognition Extension
Browser client for Speechly API
Text-to-speech via Fish Audio API
MCP server for macOS text-to-speech using the say command
Node-RED nodes for Google Cloud Platform
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby
A node-red node for translating text to speech using Google's TTS service.
JavaScript Web API for Text-to-Speech and Speech-to-Text.
A React Native package for Azure Speech to Text
Klatt formant synthesizer
Text-to-speech via Fish Audio API
A text-to-speech library for React Native.
Cordova plugin which provides a speech recognition service
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby
Converts an audio file to LINEAR16 Google-speech compatible file.
An audioplayer written in React that shows a spectrogram along with the audio.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
Alexa Voice Service wrapper for the browser.
Speech module based on iflytekSpeech for react native
Speech buffering that accumulates audio chunks and releases them after natural pause periods
A basic TTS manager
moved to speechless
speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.
Google STT
Command-line tool to convert text to speech
A React component to make transcribing audio and video easier and faster.
Easily add speech to text functionality into your website
Interactive text-to-speech CLI with multiple voices using ElevenLabs API
Bumblebee Hotword for NodeJS
Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.
A set of Angular2 services and components to add speech recognition and synthesis to Angular2 applications
A React component to make transcribing audio and video easier and faster.
Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation
A maintained, enhanced fork of react-native-voice
Real-time speech analysis with local LLM using multiple concurrent analysis instructions
TypeScript/JavaScript client SDK for Austack conversational AI
Node.js implementation of the MFCC audio speech analysis algorithm.
n8n community node for ai-coustics speech enhancement API
瑞昊RN项目语音合成组件
JSX for Alexa Skills Kit SSML
TranSpeech is a small voice and text library. It allows you to recognize and synthesize speech using a browser, and translate text.
Helper functions for building speech responses
React native interface for Slang
The Carnegie Mellon Pronouncing Dictionary (CMUdict).
Generate speech audio from super long text, via Amazon Polly and ffmpeg.
Microsoft Cognitive Services Speech SDK for JavaScript
Text to Speech (Pure Client Side)
Speech SDK for Iris Family frontend projects
Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent
Create your own verbal commands that map to custom Javascript functions