@deepgram/captions
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Found 200 results for transcription
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Library for fetching temporary keys for Speechmatics APIs
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React hooks for managing audio inputs and permissions across browsers
A client for Amazon Transcribe using the websocket interface
Javascript client library for Soniox Speech-to-Text websocket API
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React hooks for interacting with the Speechmatics Real-Time API
Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut
The 134,000+ words and their pronunciations in the CMU pronouncing dictionary
React hooks for interacting with the Speechmatics Flow API
NodeJS wrapper for Deepgram
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
A speech to text module.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Web component for Corti Dictation
Make your app understand language. Summarize conversations, categorize articles, and more.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Official SDK for Meeting BaaS API - https://meetingbaas.com
SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.
Voice-To-Text recorder with sound notifications - optimized for macOS
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data
This can use to convert voice to text real time in device
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
React hook for Cheetah Web SDK
Advanced n8n node for Puter.js AI with RAG agentic capabilities, document processing, audio transcription, Supabase integration, and cost-optimized model priorities
Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Convert AWS transcription JSON to srt
Audio file transcription services. Your speech. Private.
Unlimited AI models and Canvas library. Supports ts & js (supports front/back end).
Node.js SDK for Fireflies.ai API
Unofficial Node.js API client for the Caret HTTP API
A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text
Dictate Button (Web Component)
Native module for An another Node binding of whisper.cpp (win32-x64-cuda)
Korean transliteration tool for JavaScript
Native module for An another Node binding of whisper.cpp (darwin-arm64)
Native module for An another Node binding of whisper.cpp (win32-x64-vulkan)
🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency
A TypeScript library for extracting and working with YouTube video transcripts.
An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.
Google Meet integration plugin for ElizaOS - manage meetings, get participant info, and access meeting artifacts via Google Meet REST API
Steal money from big companies
n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more
Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data
Native module for An another Node binding of whisper.cpp (linux-x64-cuda)
Native module for An another Node binding of whisper.cpp (linux-x64-vulkan)
Native module for An another Node binding of whisper.cpp (linux-x64)
Native module for An another Node binding of whisper.cpp (linux-arm64-vulkan)
Native module for An another Node binding of whisper.cpp (linux-arm64-cuda)
AudioPod SDK for Node.js and React - Professional Audio Processing powered by AI
Phonetic transcription tools with react js for input, outputing, etc
Native module for An another Node binding of whisper.cpp (win32-x64)
Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.
Native module for An another Node binding of whisper.cpp (win32-arm64)
Native module for An another Node binding of whisper.cpp (darwin-x64)
Backend audio file to text transcription using Web Speech API with Puppeteer
Native module for An another Node binding of whisper.cpp (linux-arm64)
Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly
Native module for An another Node binding of whisper.cpp (win32-arm64-vulkan)
N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.
CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
This can use to convert voice to text real time in device
Mirador 3 plugin to render a hidden (but selectable) or visible text overlay
A React component for real-time transcription and voice agent interactions using Deepgram APIs
Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data
Picovoice Leopard React Native binding
A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services
JavaScript SDK for interacting with CastleGuard APIs
Model Context Protocol server for AssemblyAI transcription services
Picovoice Cheetah React Native binding
A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.
A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models
A simple universal transcriber for languages with unicode characters.
Real-time speech-to-text CLI tool using OpenAI Realtime API
pre-compiled builds of liblouis for js
Un simple transcriptor fonológico para la lengua española.
Live SDK for Maestra AI transcription services
Model Context Protocol server for AssemblyAI transcription services
An AI code writer application using OpenAI APIs for audio transcription and chat completion.
Real-time speech analysis with local LLM using multiple concurrent analysis instructions
Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly
Own your transcription workflow. Press Cmd+Shift+X, speak, get text in clipboard instantly.
CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama
A command-line utility to generate transcripts from a Discord channel
A NodeJS library for transcribing audio/video to text.
Generate subtitles for your videos via Automatic Speech Recognition.
Transposer connector is a PeerTube language tool plugin to transcribe and translate with Whisper
javascript bindings for liblouis
👂 Realtime speech-to-text (S2T) transcription with RxJS
A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.
React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.
Official JavaScript SDK for the VidNavigator Developer API
Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage
NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.
A beautiful, production-ready voice transcription package for React applications using the Web Speech API
React Native Pitch Tracker implemented with Tensorflow Lite Model
accentuation, syllabification and transcription utilities for Modern Greek
Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API
A custom Annotorious editor/view plugin
AI-Powered Audio Transcription Desktop Application
🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.
StrangeText Transcription
base for twitter reply bot using autohook
Modern voice feedback SDK with beautiful UI components and AI-powered analysis
Djelia JavaScript SDK - Advanced AI for African Languages
React Native Pitch Tracker implemented with Tensorflow Lite Model
n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription
openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription
A simple tool to merge multiple WebVTT (.vtt) files into a single file.
Multi-tenant ElevenLabs MCP server for advanced speech-to-text transcription with speaker diarization
iOS SFSpeechRecognizer bridge module for React Native
Easy and crystal-clear API for transcription words.
👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text
Mirador 3 plugin which renders a separate window, with OCR text
Automatically generate and overlay subtitles for any video.
Audio and video transcription using ElevenLabs Scribe
Documentation generator for ES6.
A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.
An application for getting audio files with pronunciation from public dictionaries
Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary
Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary
Creates Live Transcription of a media input stream in multiple languages
Strange Text Transliterator (GOTO: spongescribe)
Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.
Live speech transcription library with multi-language support.
PeerTube plugin transcribe and translate
React hook for real-time audio transcription using Gladia API
This is a node.js module used to transcribe wav files using Olaris v2 realtime transcription service
A client for Amazon Transcribe using the websocket interface
Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look
Node and Express backend for easy MongoDB storage of Polyanno annotations
Strange Text Transliterator (GOTO: spongescribe)
Generate subtitles for your videos via Automatic Speech Recognition.
[](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [](./LICENSE) [
Generate subtitles for your videos via Automatic Speech Recognition.
simple bing voice recognition wrapper
Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.
A library for easily transcribing speech. Convert speech to text in JavaScript
👂 RxJS operator for realtime speech-to-text (STT/S2T) using Google speeh-to-text
React Native module for transcribing WAV files using WhisperKit
React component for speech-to-text transcription with silence detection
Native Node.js bindings for OpenAI's Whisper using whisper.cpp. High-performance local speech-to-text with custom model support.
AI-powered CLI tool that transforms video content into intelligent, structured notes and scripts
javascript bindings for liblouis
A simple recording and transcription module.
Strangetext Transcription - Use: 'spongescribe'
A command-line tool to create video presentations with title cards and transcriptions
Glǽmscribe (also written Glaemscribe) is a software dedicated to the transcription of texts between writing systems, and more specifically dedicated to the transcription of J.R.R. Tolkien's invented languages to some of his devised writing systems.
Generate subtitles for your videos via Automatic Speech Recognition.
CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts
Node wrapper for Deepgram
A CLI tool to generate karaoke-style subtitles on videos using Whisper and FFmpeg.
A universal Text-to-Speech (TTS) and Speech-to-Text (STT) SDK supporting multiple providers (OpenAI, Google Gemini, Deepgram, Groq PlayAI, Cartesia, AssemblyAI) with audio merging capabilities
TypeScript SDK for the Speechall API - A powerful and flexible speech-to-text service
Speech recognition library that uses web-based services to convert speech to text in multiple languages
make friendly URLs by stripping out non lating chars, and convert other chars to their latin counterparts
A CLI tool to transcribe voice to text with interactive UI
Real-time speech transcription and translation SDK
👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe
TypeScript client library for Realtime Speech-to-Text server
A simple Node.js library to transcribe and summarize meeting recordings using OpenAI's GPT model and Whisper.
A node.js writable audio stream for google Speech-to-Text
Simple, lightweight, and fast Node.js module for enabling DNA sequences.
A package for converting DNA sequences into RNA
Fireflies.ai API wrapper
A React component for recording and transcribing audio using the Web Audio API and OpenAI.
This is the official alinkeo core npm package
Generate subtitles for your videos via Automatic Speech Recognition.
A JavaScript interface to the CMU Pronouncing Dictionary
N8N node for processing audio files via an ASR service
Node-RED node that simulates the life cycle of an amoeba
Official SentiraAI TypeScript SDK
Node.js + TypeScript library for extracting text from Douyin/TikTok videos
Unofficial Rev AI Node.js client
Privacy-first P2P meeting transcription and AI SDK
Mirador 3 plugin to render a hidden (but selectable) or visible text overlay
A React component for editing SRT and VTT subtitles directly in a textarea, styled with TailwindCSS.
Welcome to MeeshX! Choose your favorite provider and transcribe your audio content in less than 5 minutes.
CLI tool for real-time audio transcription using OpenAI's Whisper API
React wrapper for @speechmatics/diarized-transcription
Embedable vocal intelligence
A library for AI-powered audio transcription with local and remote server fallback.
A package to transcribe media files using Deepgram with speaker-labeled subtitles (SRT/VTT).
react component for transcription with ability to edit the words as well as the words. synchronization is done between words and wave-forms.
Utility for grouping transcribed text according to diarization
React components for transcribing with easyscribe.org
A simple module that simulates the life cycle of an amoeba at the molecular level by using a (semi) TRNG.
CLI for using the tafrigh library.
Generate subtitles for your videos via Automatic Speech Recognition.
A hyphenation library for JavaScript
A CLI tool for transcribing audio files to subtitles