@deepgram/captions
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Found 199 results for transcription
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Library for fetching temporary keys for Speechmatics APIs
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A client for Amazon Transcribe using the websocket interface
React hooks for managing audio inputs and permissions across browsers
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React hooks for interacting with the Speechmatics Real-Time API
Javascript client library for Soniox Speech-to-Text websocket API
The 134,000+ words and their pronunciations in the CMU pronouncing dictionary
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React hooks for interacting with the Speechmatics Flow API
Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut
NodeJS wrapper for Deepgram
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Audio file transcription services. Your speech. Private.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Make your app understand language. Summarize conversations, categorize articles, and more.
A speech to text module.
Web component for Corti Dictation
Official SDK for Meeting BaaS API - https://meetingbaas.com
Voice-To-Text recorder with sound notifications - optimized for macOS
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.
This can use to convert voice to text real time in device
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
React hook for Cheetah Web SDK
SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.
Advanced n8n node for Puter.js AI with RAG agentic capabilities, document processing, audio transcription, Supabase integration, and cost-optimized model priorities
Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Convert AWS transcription JSON to srt
Dictate Button (Web Component)
Unofficial Node.js API client for the Caret HTTP API
Unlimited AI models and Canvas library. Supports ts & js (supports front/back end).
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
Node.js SDK for Fireflies.ai API
CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama
Native module for An another Node binding of whisper.cpp (win32-x64-cuda)
Korean transliteration tool for JavaScript
Native module for An another Node binding of whisper.cpp (darwin-arm64)
Native module for An another Node binding of whisper.cpp (win32-x64-vulkan)
🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency
A TypeScript library for extracting and working with YouTube video transcripts.
n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more
Google Meet integration plugin for ElizaOS - manage meetings, get participant info, and access meeting artifacts via Google Meet REST API
Native module for An another Node binding of whisper.cpp (linux-x64-cuda)
An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.
Audio and video transcription using ElevenLabs Scribe
Native module for An another Node binding of whisper.cpp (linux-x64-vulkan)
Phonetic transcription tools with react js for input, outputing, etc
AudioPod SDK for Node.js and React - Professional Audio Processing powered by AI
Native module for An another Node binding of whisper.cpp (linux-x64)
Native module for An another Node binding of whisper.cpp (linux-arm64-vulkan)
Backend audio file to text transcription using Web Speech API with Puppeteer
Native module for An another Node binding of whisper.cpp (win32-arm64)
Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.
Native module for An another Node binding of whisper.cpp (linux-arm64-cuda)
Native module for An another Node binding of whisper.cpp (linux-arm64)
Native module for An another Node binding of whisper.cpp (win32-arm64-vulkan)
Native module for An another Node binding of whisper.cpp (darwin-x64)
Native module for An another Node binding of whisper.cpp (win32-x64)
Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly
Mirador 3 plugin to render a hidden (but selectable) or visible text overlay
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
Official JavaScript SDK for the VidNavigator Developer API
A React component for real-time transcription and voice agent interactions using Deepgram APIs
Djelia JavaScript SDK - Advanced AI for African Languages
JavaScript SDK for interacting with CastleGuard APIs
A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services
Model Context Protocol server for AssemblyAI transcription services
Picovoice Leopard React Native binding
Picovoice Cheetah React Native binding
A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.
A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models
A simple universal transcriber for languages with unicode characters.
pre-compiled builds of liblouis for js
Live SDK for Maestra AI transcription services
Transposer connector is a PeerTube language tool plugin to transcribe and translate with Whisper
Un simple transcriptor fonológico para la lengua española.
Live speech transcription library with multi-language support.
Model Context Protocol server for AssemblyAI transcription services
An AI code writer application using OpenAI APIs for audio transcription and chat completion.
Real-time speech analysis with local LLM using multiple concurrent analysis instructions
CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama
A command-line utility to generate transcripts from a Discord channel
Generate subtitles for your videos via Automatic Speech Recognition.
A NodeJS library for transcribing audio/video to text.
javascript bindings for liblouis
👂 Realtime speech-to-text (S2T) transcription with RxJS
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.
Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage
React Native Pitch Tracker implemented with Tensorflow Lite Model
NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.
A beautiful, production-ready voice transcription package for React applications using the Web Speech API
accentuation, syllabification and transcription utilities for Modern Greek
This can use to convert voice to text real time in device
A custom Annotorious editor/view plugin
AI-Powered Audio Transcription Desktop Application
🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.
n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription
StrangeText Transcription
base for twitter reply bot using autohook
Modern voice feedback SDK with beautiful UI components and AI-powered analysis
Multi-tenant ElevenLabs MCP server for advanced speech-to-text transcription with speaker diarization
React Native Pitch Tracker implemented with Tensorflow Lite Model
A simple tool to merge multiple WebVTT (.vtt) files into a single file.
openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription
iOS SFSpeechRecognizer bridge module for React Native
Mirador 3 plugin which renders a separate window, with OCR text
Automatically generate and overlay subtitles for any video.
👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text
Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API
Documentation generator for ES6.
A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.
This is a node.js module used to transcribe wav files using Olaris v2 realtime transcription service
Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary
Creates Live Transcription of a media input stream in multiple languages
An application for getting audio files with pronunciation from public dictionaries
Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary
Strange Text Transliterator (GOTO: spongescribe)
Easy and crystal-clear API for transcription words.
A CLI tool to generate karaoke-style subtitles on videos using Whisper and FFmpeg.
A client for Amazon Transcribe using the websocket interface
React component for speech-to-text transcription with silence detection
Node and Express backend for easy MongoDB storage of Polyanno annotations
Strange Text Transliterator (GOTO: spongescribe)
PeerTube plugin transcribe and translate
Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look
TypeScript SDK for the Speechall API - A powerful and flexible speech-to-text service
Generate subtitles for your videos via Automatic Speech Recognition.
A universal Text-to-Speech (TTS) and Speech-to-Text (STT) SDK supporting multiple providers (OpenAI, Google Gemini, Deepgram, Groq PlayAI, Cartesia, AssemblyAI) with audio merging capabilities
Generate subtitles for your videos via Automatic Speech Recognition.
Strange Text Transliterator (GOTO: spongescribe)
The Palladius system for transcribing Chinese characters into the Cyrillic alphabet
Common functions for Sentira AI
A simple recording and transcription module.
simple bing voice recognition wrapper
Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.
Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.
React Native module for transcribing WAV files using WhisperKit
👂 RxJS operator for realtime speech-to-text (STT/S2T) using Google speeh-to-text
A library for easily transcribing speech. Convert speech to text in JavaScript
javascript bindings for liblouis
A command-line tool to create video presentations with title cards and transcriptions
Strangetext Transcription - Use: 'spongescribe'
Native Node.js bindings for OpenAI's Whisper using whisper.cpp. High-performance local speech-to-text with custom model support.
Generate subtitles for your videos via Automatic Speech Recognition.
Node wrapper for Deepgram
Glǽmscribe (also written Glaemscribe) is a software dedicated to the transcription of texts between writing systems, and more specifically dedicated to the transcription of J.R.R. Tolkien's invented languages to some of his devised writing systems.
React hook for real-time audio transcription using Gladia API
Speech recognition library that uses web-based services to convert speech to text in multiple languages
AI-powered CLI tool that transforms video content into intelligent, structured notes and scripts
CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts
👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe
TypeScript client library for Realtime Speech-to-Text server
Real-time speech transcription and translation SDK
N8N node for processing audio files via an ASR service
make friendly URLs by stripping out non lating chars, and convert other chars to their latin counterparts
A CLI tool to transcribe voice to text with interactive UI
[](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [](./LICENSE) [ or visible text overlay
A React component for editing SRT and VTT subtitles directly in a textarea, styled with TailwindCSS.
Real-time speech-to-text CLI tool using OpenAI Realtime API
React wrapper for @speechmatics/diarized-transcription
Embedable vocal intelligence
Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data
A library for AI-powered audio transcription with local and remote server fallback.
Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly
Steal money from big companies
A package to transcribe media files using Deepgram with speaker-labeled subtitles (SRT/VTT).
react component for transcription with ability to edit the words as well as the words. synchronization is done between words and wave-forms.
Utility for grouping transcribed text according to diarization
Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data
React components for transcribing with easyscribe.org
A simple module that simulates the life cycle of an amoeba at the molecular level by using a (semi) TRNG.
CLI for using the tafrigh library.
Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data
A CLI tool for transcribing audio files to subtitles
Generate subtitles for your videos via Automatic Speech Recognition.
A hyphenation library for JavaScript