oxford-speech-wrapper
simple bing voice recognition wrapper
Found 736 results for speech recognition
simple bing voice recognition wrapper
React component for the web speech synthesis api
Cordova iOS polyfill for the Speech Recognition API
A React Native Voice Recognition Module
JavaScript client for Speechly Streaming API
A module to stream audio to a speech recognition server and get back the STT result"
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A Chinese speech synthesis and recognition library toolkit
Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.
A better English POS tagger written in JavaScript
Cordova plugin exposing the iOS Speech Recognition API
Client for the Speechmatics real-time API
Record a microphone input stream
This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API
Porcupine wake word engine for web browsers (via WebAssembly)
React Native Native Voice library for iOS and Android
A React Native Voice Recognition Module With Persian Locale support
An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.
Speech Processing Flow Graph
Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
retext plugin to add part-of-speech (POS) tags
Core library to check for valid SSML
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Google Cloud speech recognition and synthesis integrated with WebAudio, fully functional in the browser.
Simple cross-browser speech to text using react hooks.
React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React client for Speechly Streaming API
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
A web package for keyword detection
Wrapper for the Google Chrome speech synthesis and web speech recognition APIs.
A browser-based speech recognition and synthesis assistant
React component for Porcupine Web SDK
Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.
Web component for Corti Dictation
Check for valid SSML
SAM - The Software Automatic Mouth
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
convert an AWS transcribe JSON body into a .vtt file
React hooks for managing audio inputs and permissions across browsers
A client for Amazon Transcribe using the websocket interface
A class to handle microphone permissions, start and observe speech input
eSpeak-NG speech synthesizer, compiled to JavasScript + WASM
An even smaller speech recognizer
React hooks for interacting with the Speechmatics Real-Time API
A TypeScript library for integrating voice commands and speech synthesis into web applications. Easily set up voice interactions with custom text-to-speech options and speech recognition.
Watson HTML5 Speech to Text
Javascript client for the Speechmatics Flow API
Node-RED nodes for Google Cloud Platform
React hooks for interacting with the Speechmatics Flow API
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
Cobra VAD engine for web browsers (via WebAssembly)
Backend audio file to text transcription using Web Speech API with Puppeteer
Microsoft Cognitive Services Speech SDK for JavaScript
React hook for Cheetah Web SDK
Cordova Plugin for Speech Recognition
AI-powered announcement generator using Piper TTS and OpenAI GPT models
AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Cordova Speech Recognition Plugin for Android
Rhino Speech-to-Intent engine for web browsers (via WebAssembly)
A React component to make transcribing audio and video easier and faster.
node.js module for Yandex speech systems (ASR & TTS)
A flexible GUI for interacting with Speech Recognition
Self-contained multilingual TTS speech synthesizer for Node.js in pure js
MMIR (Mobile Multimodal Interaction and Relay) library
ReactJS component for automatically typing text synchronized with speech synthesis & recognition
A Node.js library for Voice Activity Detection using Silero VAD
This is an API wrapper for witai speech for nodejs
Provides a convenient abstraction layer over the Microsoft Cognitive Services Speech SDK, simplifying the integration of speech-to-text functionality into client applications. Using this npm package, developers can quickly integrate speech-to-text capabil
Node binding for continuous offline voice recoginition with Vosk library.
React Native plugin for adding voice using Spokestack
A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.
React component for Rhino Web SDK
Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key
Part-of-speech tags from the Brill-tagger
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
提供百度语音 React Native 接口
Vosk library for node, with type defenitions and multi-arch support.
React SDK for NextEVI Voice AI Platform
Node.js bindings for OpenAI's Whisper. Optimized for CPU.
React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
SSML syntax highlighter for the SSML Utilities toolkit
VoicePing Cognitive Services Speech SDK for JavaScript forked from Microsoft
A React component to make transcribing audio and video easier and faster.
Text to speech synthesizer
SDK for the Novolanguage Speech Analysis API
A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.
Microsoft Cognitive Services Speech SDK for JavaScript
Module to use bing speech recognition api to convert speech to text
Aivis Cloud CLI - Text-to-speech synthesis and model management
PostCSS plugin creates speech bubbles with just 1-2 lines of CSS
Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You
Updated cordova-plugin-speechrecognition to remove onfulfilled() errors
🎙️ MBZ Voice SDK: Easily add voice recognition, Gemini-based AI replies, and TTS to any web app.
React Native Native Voice library for iOS and Android
💬Speech recognition for your React app
A high-performance React Native library for text-to-speech on iOS and Android
Microphone icon as a single component (black as default and red when it's recording) to interact with the Web Speech Recognition Api
A small JavaScript library that provides a text to speech conversion using tts-api.com service.
💬Speech recognition for your React app
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Koala Noise Suppression engine for web browsers (via WebAssembly)
N8N Community Node for Groq Text-to-Speech API integration
ispikit
React Native Text-To-Speech module for Android and iOS
SpeakEasy - Unified text-to-speech service with provider abstraction
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
Node.js implementation of the MFCC audio speech analysis algorithm.
Add Cephable controls to your web-based apps
With this adapter you can control ioBroker with voice in many different languages
A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.
Node binding for continuous voice recoginition through pocketsphinx.
Microsoft Speech SDK for browsers
Promise based implementation of Yandex Speech Kit API
Microsoft Speech SDK for browsers (using CRIS endpoint)
科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能
Orca Text-to-Speech engine for web browsers (via WebAssembly)
On-device speech-to-text and voice control for web applications with Moonshine.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
Chrome speech recognition API wrapper
Wasm build based on whisper.cpp.
React client for the PrimVoices Agents API
Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).
Transcribe speech to text in the browser.
The macOS built-in `say` CLI for JavaScript
A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online
Node.js module to make your discord bot talk
Audio file transcription services. Your speech. Private.
Core library to check for valid SSML
React hook for Leopard Web SDK
Record microphone sond using nodejs
React component and hook to initiate a SpeechRecognition session
Picovoice Orca Node.js binding
Cordova Plugin for Speech Recognition ios, Speech Recognition Extension
Check for valid SSML
Production-ready speech detection using Silero VAD ONNX model for web browsers
Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website
Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections
React Hook for OpenAI Whisper API with speech recorder.
Wrapper for the ElevenLabs API
Use TikTok TTS from node.js
Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing
Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.
React Native Text-To-Speech module for Android and iOS
Text-to-Speech API wrapper for ttsmp3.com
A React component for real-time transcription and voice agent interactions using Deepgram APIs
iOS SFSpeechRecognizer bridge module for React Native
TypeScript library for building SSML documents
N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.
Calculate pronounceability for a given word.
🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech
A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services
text-to-speech using espeak cli program
Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.
Cordova Plugin for Speech Recognition ios, Speech Recognition Extension
Browser client for Speechly API
Text-to-speech via Fish Audio API
MCP server for macOS text-to-speech using the say command
Node-RED nodes for Google Cloud Platform
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby
A node-red node for translating text to speech using Google's TTS service.
JavaScript Web API for Text-to-Speech and Speech-to-Text.
A React Native package for Azure Speech to Text
Klatt formant synthesizer
Text-to-speech via Fish Audio API
A text-to-speech library for React Native.
Cordova plugin which provides a speech recognition service
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby
Converts an audio file to LINEAR16 Google-speech compatible file.
An audioplayer written in React that shows a spectrogram along with the audio.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
Alexa Voice Service wrapper for the browser.
Speech module based on iflytekSpeech for react native
Speech buffering that accumulates audio chunks and releases them after natural pause periods
A basic TTS manager
moved to speechless
speakie is a professional voice recognition package designed to empower your applications with seamless voice command integration. Easily create functions that respond to specific voice commands, making user interactions more natural and intuitive.
Google STT
Command-line tool to convert text to speech
A React component to make transcribing audio and video easier and faster.
Easily add speech to text functionality into your website
Interactive text-to-speech CLI with multiple voices using ElevenLabs API
Bumblebee Hotword for NodeJS
Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.
A set of Angular2 services and components to add speech recognition and synthesis to Angular2 applications
A React component to make transcribing audio and video easier and faster.
Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation
A maintained, enhanced fork of react-native-voice
Real-time speech analysis with local LLM using multiple concurrent analysis instructions
TypeScript/JavaScript client SDK for Austack conversational AI
Node.js implementation of the MFCC audio speech analysis algorithm.
n8n community node for ai-coustics speech enhancement API
瑞昊RN项目语音合成组件
JSX for Alexa Skills Kit SSML
TranSpeech is a small voice and text library. It allows you to recognize and synthesize speech using a browser, and translate text.
Helper functions for building speech responses
React native interface for Slang
The Carnegie Mellon Pronouncing Dictionary (CMUdict).
Generate speech audio from super long text, via Amazon Polly and ffmpeg.
Microsoft Cognitive Services Speech SDK for JavaScript
Text to Speech (Pure Client Side)
Speech SDK for Iris Family frontend projects
Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent
Create your own verbal commands that map to custom Javascript functions
Voice AI utilities for home loan assistance with India-specific formatting
Salient is a natural language processing and sentiment analysis library
Node binding for continuous voice recoginition through vosk-api.
Node-RED nodes for Google Cloud Platform
This is a module to quickly use the Web Speech API to recognize keywords as a user speaks.
React Native module for IBM Bluemix services
TTS (Text to Speech) for Node and Browser
Cordova plugin to support mobile speech recognizer and synthesizer with iFlyTek voice cloud service
An events tree which lets you define a sequence of voice commands.
Simple Text to Speech Offline Using API Browser
Microsoft Cognitive Services Speech SDK for JavaScript
Allows react-native apps to connect to Houndify for speech recognition.
React Native Native Voice library for iOS and Android
A WebSocket-based TTS client with real-time audio streaming and playback
A client for the VOICEVOX API, providing text-to-speech capabilities.
tools for querying supported languages (ASR and TTS) and voices (TTS) by Nuance / Cerence SpeechKit
A library for easily transcribing speech. Convert speech to text in JavaScript
An improved speech recognition library with TypeScript support
Speech module based on iflytekSpeech for react native
The JavaScript API SDK for ITSLanguage.
RNNoise for Unity
Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a web browser using WebAudio API.
JavaScript Recorder based on MediaRecorder from ITSLanguage.
A javascript library for working with praat, textgrids, time aligned audio transcripts, and audio files.
On-device speech-to-text and voice control for web applications with Moonshine.
WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an
Alexa speech synthesis markup generator (SSML), making it easy to do all the things.
Turn JSON from Amazon AWS Transcribe into VTT files for use as subtitles.
Calculation of sound parameters
Crawl Wikipedia pages and upload TTS to Youtube.
A speech bubble dialog component for React Native.
Speech to Text node for n8n
Easily convert text to speech using Google Wavenet voices on Node-RED.
n8n community node for Wiro AI's Generative AI APIs.
Real-time voice bot library with STT, LLM, and TTS capabilities
百度语音的Nodejs实现
React Native wrapper for Spokestack
Simple way to get TTS with node using TTS-API.com
Record a microphone input stream... in Typescript
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby