@deepgram/sdk
Isomorphic Javascript client for Deepgram
Found 735 results for speech
Isomorphic Javascript client for Deepgram
Web Assembly streaming Opus decoder with Machine Learning enhancements
Speech recognition for your React app
Microsoft Cognitive Services Speech SDK for JavaScript
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Cloud Speech Client Library for Node.js
Provides text-to-speech functionality.
React Native Text-To-Speech module for Android and iOS
React Native Native Voice library for iOS and Android
Javascript client for the Speechmatics batch jobs API
Add real-time speech to text functionality into your website with no effort
Rev AI makes speech applications easy to build!
Simple pure Javascript implementation of the Cooley-Tukey algorithm. Note: fft-js was chosen as the name since a lot of the FFT implementations on NPM at the time this was published were wrappers for Ruby or C.
A better English POS tagger written in JavaScript
Client for the Speechmatics real-time API
Record a microphone input stream
A library that produces audio transcriptions and translations using the Sonix.AI service.
This is an open source Eleven Labs NodeJS package for converting text to speech using the Eleven Labs API
A javascript library for adding voice commands to your site, using speech recognition
React Native Native Voice library for iOS and Android
An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.
retext plugin to add part-of-speech (POS) tags
Core library to check for valid SSML
Simple cross-browser speech to text using react hooks.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
JavaScript client for Speechly Streaming API
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
SAM - The Software Automatic Mouth
Cordova Plugin for Speech Recognition
Check for valid SSML
React hooks for managing audio inputs and permissions across browsers
A client for Amazon Transcribe using the websocket interface
Porcupine wake word engine for web browsers (via WebAssembly)
React component for the web speech synthesis api
Javascript client library for Soniox Speech-to-Text websocket API
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React hooks for interacting with the Speechmatics Real-Time API
eSpeak-NG speech synthesizer, compiled to JavasScript + WASM
Javascript client for the Speechmatics Flow API
React hooks for interacting with the Speechmatics Flow API
Vosk node API based on Koffi.
Node binding for continuous offline voice recoginition with Vosk library.
React client for Speechly Streaming API
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Polyfill for the Speech Recognition API using Speechly
Microsoft Cognitive Services Speech SDK for JavaScript
Mock SpeechRecognition for headless unit tests
convert an AWS transcribe JSON body into a .vtt file
Node-RED nodes for Google Cloud Platform
Microsoft Speech SDK for browsers
React component for Porcupine Web SDK
Web component for Corti Dictation
AI-powered announcement generator using Piper TTS and OpenAI GPT models
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A speech to text module.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Self-contained multilingual TTS speech synthesizer for Node.js in pure js
Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent
A React component to make transcribing audio and video easier and faster. Forked from @bbc/react-transcript-editor with security updates, modern dependency fixes, and full React 18/19 compatibility.
A React component to make transcribing audio and video easier and faster.
This is an API wrapper for witai speech for nodejs
A Node.js library for Voice Activity Detection using Silero VAD
Part-of-speech tags from the Brill-tagger
Universal text-to-speech library using Microsoft Edge's online TTS service. Works in Node.js and browsers WITHOUT needing Microsoft Edge, Windows, or an API key
A library for using Web Speech API with Angular
Cobra VAD engine for web browsers (via WebAssembly)
Replace window.SpeechRecognition with a mock object and automate your tests
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
A React component to make transcribing audio and video easier and faster.
Node.js bindings for OpenAI's Whisper. Optimized for CPU.
SSML syntax highlighter for the SSML Utilities toolkit
VoicePing Cognitive Services Speech SDK for JavaScript forked from Microsoft
React hook for Cheetah Web SDK
SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.
A high-performance React Native library for text-to-speech on iOS and Android
Koala Noise Suppression engine for web browsers (via WebAssembly)
Transforms the text in speech and hear it using Sonos player or generate an audio file to be used with third parties nodes. Works with voices from Amazon, Google (without credentials as well), Microsoft TTS Azure, ElevenLabs.io TTS or your own voice. You
Speech Processing Flow Graph
PostCSS plugin creates speech bubbles with just 1-2 lines of CSS
React Native Native Voice library for iOS and Android
Text to speech synthesizer
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React Native Native Voice library for iOS and Android, folk from @react-native-voice/voice
SDK for the Novolanguage Speech Analysis API
A small JavaScript library that provides a text to speech conversion using tts-api.com service.
Rhino Speech-to-Intent engine for web browsers (via WebAssembly)
N8N Community Node for Groq Text-to-Speech API integration
SpeakEasy - Unified text-to-speech service with provider abstraction
Node.js implementation of the MFCC audio speech analysis algorithm.
Microsoft Cognitive Services Speech SDK for JavaScript
React Native Text-To-Speech module for Android and iOS
Aivis Cloud CLI - Text-to-speech synthesis and model management
Vosk library for node, with type defenitions and multi-arch support.
React component for Rhino Web SDK
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
Wasm build based on whisper.cpp.
Orca Text-to-Speech engine for web browsers (via WebAssembly)
Transcribe speech to text in the browser.
A web package for keyword detection
Audio file transcription services. Your speech. Private.
A node js droid language engine built using the sound library inside python library ttastromech, as well as other sound sources found online
The macOS built-in `say` CLI for JavaScript
Java application that allows to transform a text to speech using [Google Translate unofficial Java API](http://code.google.com/p/java-google-translate-text-to-speech/).
Cordova Plugin for Speech Recognition
Picovoice Orca Node.js binding
Backend audio file to text transcription using Web Speech API with Puppeteer
React client for the PrimVoices Agents API
Core library to check for valid SSML
A library that produces audio transcriptions using the SBER Salute Speech service.
Use TikTok TTS from node.js
Apple Foundation Models Framework integration with TypeScript support, Swift bridge, and privacy-first on-device AI processing
Embeddable voice chat widget for Logikron Talk - enables real-time voice conversations with AI agents on any website
Get a Buzzword Phrase. Because sometimes you need enhanced didactic mobility, liberating syndicated transitional projections
Record microphone sond using nodejs
A flexible GUI for interacting with Speech Recognition
Check for valid SSML
React Hook for OpenAI Whisper API with speech recorder.
Node.js module to make your discord bot talk
Text-to-Speech API wrapper for ttsmp3.com
🎤 Make websites that talk. Demo: https://computer_programmer.neocities.org/browser-speech
React Native Text-To-Speech module for Android and iOS
Production-ready speech detection using Silero VAD ONNX model for web browsers
Add Cephable controls to your web-based apps
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
A React Native Voice Recognition Module
Real-time speech recognition with Next-gen Kaldi
Command-line tool to convert text to speech
A React component for real-time transcription and voice agent interactions using Deepgram APIs
Eagle Speaker Recognition engine for web browsers (via WebAssembly)
TypeScript library for building SSML documents
An even smaller speech recognizer
A comprehensive collection of 50+ React hooks for state management, UI interactions, device APIs, async operations, drag & drop, audio/speech, and more. Full TypeScript support with SSR safety.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
Promise based implementation of Yandex Speech Kit API
科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能
Calculate pronounceability for a given word.
A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services
N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.
On-device speech-to-text and voice control for web applications with Moonshine.
Wrapper for the ElevenLabs API
An audioplayer written in React that shows a spectrogram along with the audio.
text-to-speech using espeak cli program
React hook for Leopard Web SDK
A lightweight wrapper for the Web Speech API's SpeechSynthesis, enabling easy queuing and management of text-to-speech utterances.
MCP server for macOS text-to-speech using the say command
A node-red node for translating text to speech using Google's TTS service.
A text-to-speech library for React Native.
Text-to-speech via Fish Audio API
MMIR (Mobile Multimodal Interaction and Relay) library
JavaScript Web API for Text-to-Speech and Speech-to-Text.
Klatt formant synthesizer
提供百度语音 React Native 接口
Converts an audio file to LINEAR16 Google-speech compatible file.
React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.
💬Speech recognition for your React app
Node-RED nodes for Google Cloud Platform
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby
React Native plugin for adding voice using Spokestack
Alexa Voice Service wrapper for the browser.
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in.
A basic TTS manager
TypeScript models for Sermo API (HTTP and WebSocket) generated from OpenAPI specifications
Google STT
Speech buffering that accumulates audio chunks and releases them after natural pause periods
Multi-Modal Input Library for voice, gesture, and traditional inputs.
A React component to make transcribing audio and video easier and faster.
Speech module based on iflytekSpeech for react native
Interactive text-to-speech CLI with multiple voices using ElevenLabs API
A custom React hook for speech recognition built using SpeechRecognition API
A powerful speech recognition library for React Native applications, enabling real-time speech-to-text transcription.
Realtime voice-activity detection (VAD) for Node.js, powered by TEN-VAD WebAssembly backend.
Easily add speech to text functionality into your website
A React component to make transcribing audio and video easier and faster.
Real-time speech analysis with local LLM using multiple concurrent analysis instructions
Live speech transcription library with multi-language support.
n8n community node for ai-coustics speech enhancement API
TypeScript/JavaScript client SDK for Austack conversational AI
A simple JavaScript speech recognition library.
With this adapter you can control ioBroker with voice in many different languages
node.js module for Yandex speech systems (ASR & TTS)
JSX for Alexa Skills Kit SSML
The Carnegie Mellon Pronouncing Dictionary (CMUdict).
Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.
Text to Speech (Pure Client Side)
Node.js implementation of the MFCC audio speech analysis algorithm.
Text-to-speech via Fish Audio API
Voice AI utilities for home loan assistance with India-specific formatting
Microsoft Cognitive Services Speech SDK for JavaScript
A powerful React Native library for integrating AI-powered voice agents into mobile applications. Features real-time voice communication, intelligent speech processing, customizable UI components, and comprehensive event handling for building conversation
Node-RED nodes for Google Cloud Platform
Unlock Sign Language Recognition, Avatar, and Speech Recognition.
speech recognition cli and api for node using electron
Microsoft Cognitive Services Speech SDK for JavaScript
TTS (Text to Speech) for Node and Browser
The JavaScript API SDK for ITSLanguage.
React Native Native Voice library for iOS and Android
Simple Text to Speech Offline Using API Browser
A WebSocket-based TTS client with real-time audio streaming and playback
A client for the VOICEVOX API, providing text-to-speech capabilities.
React Native speech recognition component for iOS 10+
Salient is a natural language processing and sentiment analysis library
RNNoise for Unity
Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.
React Native module for IBM Watson services
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby
Updated cordova-plugin-speechrecognition to remove onfulfilled() errors
WhatsApp-Claude-GPT is a WhatsApp chatbot that supports multiple AI providers for chat, optional image generation/editing, and voice (speech-to-text and text-to-speech). It’s built for natural, contextual conversations and can now also handle reminders an
React Native module for IBM Bluemix services
Alexa speech synthesis markup generator (SSML), making it easy to do all the things.
Calculation of sound parameters
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
Crawl Wikipedia pages and upload TTS to Youtube.
tools for querying supported languages (ASR and TTS) and voices (TTS) by Nuance / Cerence SpeechKit
Speech to Text node for n8n
A class to handle microphone permissions, start and observe speech input
React native interface for Slang
A lib to improve your apps search functionality by adding the opt-outable speech recognition, multiple parallel searches and automatic result matching
React Native Native Voice library for iOS and Android
Open source cross platform decentralized always-on speech recognition framework
百度语音的Nodejs实现
n8n community node for Wiro AI's Generative AI APIs.
Node binding for continuous voice recoginition through pocketsphinx.
Easily convert text to speech using Google Wavenet voices on Node-RED.
Record a microphone input stream... in Typescript
Simple way to get TTS with node using TTS-API.com
React Hook for OpenAI Whisper API with speech recorder and silence removal built-in in Qubby
Node binding for continuous voice recoginition through vosk-api.
Microsoft Speech SDK for browsers (using CRIS endpoint)
A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.
NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.
Lightweight Speech Recognition Library for Electron. Based on [nodejs-speech-kiosk-usercase](https://www.npmjs.com/package/nodejs-speech-kiosk-usercase) and [vosk-api](https://github.com/alphacep/vosk-api).
This is simple module for React Native for Android Text to Speech Engine
Extract formant features such as frequency, power, energy, and bandwidth of formants at syllable or word level from audio sources in a web browser using WebAudio API.
Speech Block Tool for Editor.js
Bumblebee Hotword for NodeJS
Watches your microphone stream to pull out speech segments that you can save to a file, or send to an endpoint for speech recognition. Ideal for saving audio for conversation monitoring and assistant apps that work like Google Home or Amazon Alexa.
A speech bubble dialog component for React Native.
The text generator that uses the soviet speech code. No LLM required!
JavaScript Recorder based on MediaRecorder from ITSLanguage.
Клиент для работы с навыками Яндекс.Диалогов Алисы локально
Node binding for continuous offline voice recoginition with Vosk library.
React Native Voice library with enhanced hold recording functionality and React Native 0.80+ compatibility fixes
React Native Native Voice library for iOS and Android
Text to speech REST API for multiple TTS engines
React component and hook to initiate a SpeechRecognition session
A Promise based Node.js/TypeScript port of the gTTS python library
Package for simplifying the Speech Recognition and Speech Utterence process.