@deepgram/sdk
Isomorphic Javascript client for Deepgram
Found 766 results for speech
Isomorphic Javascript client for Deepgram
Microsoft Cognitive Services Speech SDK for JavaScript
Speech recognition for your React app
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Web Assembly streaming Opus decoder with Machine Learning enhancements
Cloud Speech Client Library for Node.js
Provides text-to-speech functionality.
React Native Text-To-Speech module for Android and iOS
React Native Native Voice library for iOS and Android
Add real-time speech to text functionality into your website with no effort
Rev AI makes speech applications easy to build!
Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.
A better English POS tagger written in JavaScript
Client for the Speechmatics real-time API
Javascript client for the Speechmatics batch jobs API
Record a microphone input stream
This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API
A javascript library for adding voice commands to your site, using speech recognition
A library that produces audio transcriptions and translations using the Sonix.AI service.
retext plugin to add part-of-speech (POS) tags
An easy-to-use React.js library that leverages the Web Speech API to convert text to speech.
React Native Native Voice library for iOS and Android
JavaScript client for Speechly Streaming API
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
Core library to check for valid SSML
Simple cross-browser speech to text using react hooks.
Javascript client library for Soniox Speech-to-Text websocket API
React hooks for managing audio inputs and permissions across browsers
Check for valid SSML
React component for the web speech synthesis api
React hooks for interacting with the Speechmatics Real-Time API
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Cordova Plugin for Speech Recognition
Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections
SAM - The Software Automatic Mouth
Porcupine wake word engine for web browsers (via WebAssembly)
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A client for Amazon Transcribe using the websocket interface
A React component to make transcribing audio and video easier and faster.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
eSpeak-NG speech synthesizer, compiled to JavasScript + WASM
Javascript client for the Speechmatics Flow API
React hooks for interacting with the Speechmatics Flow API
Vosk node API based on Koffi.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Polyfill for the Speech Recognition API using Speechly
Node binding for continuous offline voice recoginition with Vosk library.
A library for using Web Speech API with Angular
Mock SpeechRecognition for headless unit tests
React client for Speechly Streaming API
convert an AWS transcribe JSON body into a .vtt file
Web component for Corti Dictation
Microsoft Cognitive Services Speech SDK for JavaScript
A speech to text module.
Speech Synthesized for node js
Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent
A React component library for speech-to-text conversion with dual-mode support (API mode + native browser Speech Recognition)
React component for Porcupine Web SDK
VBEE SDK for TypeScript/JavaScript - Text-to-Speech and Voice AI Services
A Vue.js voice agent plugin for real-time voice communication via WebSocket
SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Microsoft Speech SDK for browsers
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
Self-contained multilingual TTS speech synthesizer for Node.js in pure js
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
Cobra VAD engine for web browsers (via WebAssembly)
Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You
Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key
Node-RED nodes for Google Cloud Platform
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React hook for Cheetah Web SDK
Replace window.SpeechRecognition with a mock object and automate your tests
Part-of-speech tags from the Brill-tagger
React Native Native Voice library for iOS and Android
This is an API wrapper for witai speech for nodejs
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
A Node.js library for Voice Activity Detection using Silero VAD
A React component to make transcribing audio and video easier and faster.
MMIR (Mobile Multimodal Interaction and Relay) library
Text to speech synthesizer
SDK for the Novolanguage Speech Analysis API
Microsoft Cognitive Services Speech SDK for JavaScript
Rhino Speech-to-Intent engine for web browsers (via WebAssembly)
PostCSS plugin creates speech bubbles with just 1-2 lines of CSS
科大讯飞语音识别和语音合成SDK,支持Vue2/3和React
React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice
Node.js implementation of the MFCC audio speech analysis algorithm.
Narratify - Transform your web content into captivating audio experiences. A React TTS narration system for Next.js applications.
Speech Processing Flow Graph
Real-time speech recognition with Next-gen Kaldi
SSML tag remover for the SSML Utilities toolkit
A high-performance React Native library for text-to-speech on iOS and Android
Node.js bindings for OpenAI's Whisper. Optimized for CPU.
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech
TypeScript client SDK for TTC Speech Service - Real-time Speech-to-Text and Text-to-Speech with WebSocket streaming
Speech module based on iflytekSpeech for react native
Eagle Speaker Recognition engine for web browsers (via WebAssembly)
React client for the PrimVoices Agents API
A React text-to-speech component
Koala Noise Suppression engine for web browsers (via WebAssembly)
Orca Text-to-Speech engine for web browsers (via WebAssembly)
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Node.js module to make your discord bot talk
A small JavaScript library that provides a text to speech conversion using tts-api.com service.
React component for Rhino Web SDK
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
SSML syntax highlighter for the SSML Utilities toolkit
React hook for Leopard Web SDK
React Native Text-To-Speech module for Android and iOS
Lightweight DOM utility for creating gentle, accessible UI components for families and children.
🎤 The friendly TTS CLI - Just run 'koko' for instant text-to-speech magic!
Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).
Use TikTok TTS from node.js
Microsoft Cognitive Services Speech SDK for JavaScript
JavaScript modules for Mozilla's cloud speech recognition API
A React component to make transcribing audio and video easier and faster.
A React Native library for converting speech to text.
百度语音的Nodejs实现
TypeScript client library for VOICEVOX ENGINE OSS. Auto-generated from OpenAPI schema using openapi-fetch and openapi-typescript.
Cordova Plugin for Speech Recognition
Core library to check for valid SSML
Vosk library for node, with type defenitions and multi-arch support.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A utility module for easy integration with Carter
Check for valid SSML
Wrapper for the ElevenLabs API
Node.js implementation of the MFCC audio speech analysis algorithm.
A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation
Add Cephable controls to your web-based apps
A basic TTS manager
A React Native Voice Recognition Module
Picovoice Orca Node.js binding
A library that produces audio transcriptions using the SBER Salute Speech service.
React Native Text-To-Speech module for Android and iOS
A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online
[](https://www.npmjs.com/package/react-deepspeech)
A flexible React component for conversational AI input with voice-to-text, file upload, and AI processing capabilities
A powerful speech recognition library for React Native applications, enabling real-time speech-to-text transcription.
Text-to-Speech API wrapper for ttsmp3.com
提供百度语音 React Native 接口
A web package for keyword detection
lib for recognition and synthesis of speech
Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a web browser using WebAudio API.
React Native Native Voice library for iOS and Android
Generate speech audio from super long text, via Amazon Polly and ffmpeg.
On-device speech-to-text and voice control for web applications with Moonshine.
Production-ready speech detection using Silero VAD ONNX model for web browsers
JavaScript Web API for Text-to-Speech and Speech-to-Text.
Microsoft Bing Speech API client
A custom React hook for speech recognition built using SpeechRecognition API
JavaScript Recorder based on MediaRecorder from ITSLanguage.
VoicePing Cognitive Services Speech SDK for JavaScript forked from Microsoft
A React component for real-time transcription and voice agent interactions using Deepgram APIs
Speech to text with timestamps and speaker diarization
A node-red node for translating text to speech using Google's TTS service.
A flexible GUI for interacting with Speech Recognition
Promise based implementation of Yandex Speech Kit API
The JavaScript API SDK for ITSLanguage.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
text-to-speech using espeak cli program
Alexa Voice Service wrapper for the browser.
A lightweight JavaScript library for adding voice command and synthesis capabilities to web applications using the Web Speech API.
React Native plugin for adding voice using Spokestack
RNNoise for Unity
Transcribe speech to text in the browser.
Клиент для работы с навыками Яндекс.Диалогов Алисы локально
Wasm build based on whisper.cpp.
Calculation of sound parameters
Speech Block Tool for Editor.js
A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.
Open source cross platform decentralized always-on speech recognition framework
React Hook for OpenAI Whisper API with speech recorder.
Simple way to get TTS with node using TTS-API.com
Speech module based on iflytekSpeech for react native
Calculate pronounceability for a given word.
The Carnegie Mellon Pronouncing Dictionary (CMUdict).
Fork of React Native Native Voice library for iOS and Android
Node binding for continuous voice recoginition through pocketsphinx.
Updated cordova-plugin-speechrecognition to remove onfulfilled() errors
WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an
React native interface for Slang
JSX for Alexa Skills Kit SSML
Generate speech audio from super long text through machine, via Google TTS, ffmpeg.
A text-to-speech library for React Native.
Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.
N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.
科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能
Microsoft Cognitive Services Speech SDK for JavaScript
Test a word for pronounceability
Simple cross-browser speech to text using react hooks.
React Native Native Voice library for iOS and Android
AI-powered announcement generator using Piper TTS and OpenAI GPT models
Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.
Salient is a natural language processing and sentiment analysis library
SpeakEasy - Unified text-to-speech service with provider abstraction
Web Speech API adapter to use Cognitive Services Speech Services for both speech-to-text and text-to-speech service.
Jarvis MCP – Browser-based voice input/output for AI Assistant conversations via MCP (Model Context Protocol)
processing audio media used for speech recognition
An even smaller speech recognizer
Alexa speech synthesis markup generator (SSML), making it easy to do all the things.
React wrapper for @speechmatics/diarized-transcription
Browser-based voice input/output for AI Assistant conversations via MCP (Model Context Protocol)
With this adapter you can control ioBroker with voice in many different languages
Node-RED nodes for Google Cloud Platform
React Native module for IBM Bluemix services
A simple Speech-to-Speech library using Pollinations AI for Text-to-Speech and Speech-to-Text in the browser
Create your own verbal commands that map to custom Javascript functions
A speech bubble dialog component for React Native.
Speech module based on iflytekSpeech for react native
Node.js bindings for ai-coustics speech enhancement SDK
A React component to make transcribing audio and video easier and faster.
React Native Native Voice library for iOS and Android
React Native Native Voice library for iOS and Android
n8n node for Berget AI speech-to-text models
### Usage
Node binding for continuous offline voice recoginition with Vosk library.
React Native Text-To-Speech module for Android and iOS
A natural language generator (NLG) that articulates concepts as words, phrases, and sentences.
speech recognition cli and api for node using electron
React Native Native Voice library for iOS and Android
Class wrapped around the SpeechRecognition Web API
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
**A speech synthesizer plugin for [loco-tts](https://gitlab.com/loco-tts/core).**
A class to handle microphone permissions, start and observe speech input
A speech act module for the gebo agent
The macOS built-in `say` CLI for JavaScript
Components for AI, that I constantly use
Official Node.js SDK for the NeoSpeech Text-to-Speech API
Helper functions for building speech responses
React component and hook to initiate a SpeechRecognition session
Expand custom utterance slots of phrases, to use with Alexa Skills Kit Sample Utterances
Google Speech to Text plugin using websockets to communicate with backend
Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.
Crawl Wikipedia pages and upload TTS to Youtube.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
A WebSocket-based TTS client with real-time audio streaming and playback
A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching
Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.
An audioplayer written in React that shows a spectrogram along with the audio.
Enhanced chat widget combining modern n8n styling with advanced voice features and alert mode
A JavaScript library for Ten VAD (Voice Activity Detection) based on WebAssembly
TTS (Text to Speech) for Node and Browser
Updated Node binding for continuous offline voice recoginition with Vosk library.
N8N Community Node for Groq Text-to-Speech API integration
react-native message bubble, both ios and android
NodeJS service + client package for using Microsoft Translator in front end applications
Record microphone sond using nodejs