Found 200 results for transcription

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

@speechmatics/auth

Library for fetching temporary keys for Speechmatics APIs

sherpa-onnx-node

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@speechmatics/browser-audio-input-react

React hooks for managing audio inputs and permissions across browsers

aws-transcribe

A client for Amazon Transcribe using the websocket interface

@soniox/speech-to-text-web

Javascript client library for Soniox Speech-to-Text websocket API

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@speechmatics/real-time-client-react

React hooks for interacting with the Speechmatics Real-Time API

tap2talk

Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut

cmu-pronouncing-dictionary

The 134,000+ words and their pronunciations in the CMU pronouncing dictionary

@speechmatics/flow-client-react

React hooks for interacting with the Speechmatics Flow API

sherpa-onnx-win-ia32

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

oneai

Make your app understand language. Summarize conversations, categorize articles, and more.

sherpa-onnx-win-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@meeting-baas/sdk

Official SDK for Meeting BaaS API - https://meetingbaas.com

speech-into-text

SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

koshi-vox

Voice-To-Text recorder with sound notifications - optimized for macOS

sherpa-onnx-linux-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@liveprompt/mcp-server

Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

@ascendtis/react-native-voice-to-text

This can use to convert voice to text real time in device

@picovoice/cheetah-web

Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

n8n-nodes-puter-ai

Advanced n8n node for Puter.js AI with RAG agentic capabilities, document processing, audio transcription, Supabase integration, and cost-optimized model priorities

@thaleslaray/n8n-nodes-elevenlabs

Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI

sherpa-onnx-darwin-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@vocantai/n8n-nodes-translation-vocantai

Audio file transcription services. Your speech. Private.

apexify.js

Unlimited AI models and Canvas library. Supports ts & js (supports front/back end).

@firefliesai/fireflies-node-sdk

Node.js SDK for Fireflies.ai API

@theventures/caret

Unofficial Node.js API client for the Caret HTTP API

whisper-speech-to-text

A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

@fugood/node-whisper-win32-x64-cuda

Native module for An another Node binding of whisper.cpp (win32-x64-cuda)

aromanize

Korean transliteration tool for JavaScript

@fugood/node-whisper-darwin-arm64

Native module for An another Node binding of whisper.cpp (darwin-arm64)

@fugood/node-whisper-win32-x64-vulkan

Native module for An another Node binding of whisper.cpp (win32-x64-vulkan)

susurro-audio

🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency

talisik-huntress

A TypeScript library for extracting and working with YouTube video transcripts.

@fugood/whisper.node

An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.

@elizaos/plugin-google-meet-cute

Google Meet integration plugin for ElizaOS - manage meetings, get participant info, and access meeting artifacts via Google Meet REST API

n8n-nodes-get-transcribe

n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more

@bharatgolchha/liveprompt-mcp-server

Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

@fugood/node-whisper-linux-x64-cuda

Native module for An another Node binding of whisper.cpp (linux-x64-cuda)

@fugood/node-whisper-linux-x64-vulkan

Native module for An another Node binding of whisper.cpp (linux-x64-vulkan)

@fugood/node-whisper-linux-x64

Native module for An another Node binding of whisper.cpp (linux-x64)

@fugood/node-whisper-linux-arm64-vulkan

Native module for An another Node binding of whisper.cpp (linux-arm64-vulkan)

@fugood/node-whisper-linux-arm64-cuda

Native module for An another Node binding of whisper.cpp (linux-arm64-cuda)

audiopod-sdk

AudioPod SDK for Node.js and React - Professional Audio Processing powered by AI

react-phonetic-transcription

Phonetic transcription tools with react js for input, outputing, etc

@fugood/node-whisper-win32-x64

Native module for An another Node binding of whisper.cpp (win32-x64)

n8n-nodes-transcribe-audio

Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.

@fugood/node-whisper-win32-arm64

Native module for An another Node binding of whisper.cpp (win32-arm64)

@fugood/node-whisper-darwin-x64

Native module for An another Node binding of whisper.cpp (darwin-x64)

audio-to-text-node

Backend audio file to text transcription using Web Speech API with Puppeteer

@fugood/node-whisper-linux-arm64

Native module for An another Node binding of whisper.cpp (linux-arm64)

whisper-web-transcriber

Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly

@fugood/node-whisper-win32-arm64-vulkan

Native module for An another Node binding of whisper.cpp (win32-arm64-vulkan)

N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

@adamhancock/transcribe-cli

CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

@picovoice/leopard-web

Leopard Speech-to-Text engine for web browsers (via WebAssembly)

@appcitor/react-native-voice-to-text

This can use to convert voice to text real time in device

mirador-textoverlay

Mirador 3 plugin to render a hidden (but selectable) or visible text overlay

aixblock-voice-ai-deepgram

A React component for real-time transcription and voice agent interactions using Deepgram APIs

liveprompt-mcp-server

Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

@picovoice/leopard-react-native

Picovoice Leopard React Native binding

ugai

A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

castleguard-sdk

JavaScript SDK for interacting with CastleGuard APIs

assembly-ai-mcp

Model Context Protocol server for AssemblyAI transcription services

@picovoice/cheetah-react-native

Picovoice Cheetah React Native binding

paragrafs

A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.

audio2textjs

A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models

universal-transcriber

A simple universal transcriber for languages with unicode characters.

@sorenpeng/rtstt

Real-time speech-to-text CLI tool using OpenAI Realtime API

liblouis-build

pre-compiled builds of liblouis for js

transcriptor-fonologico

Un simple transcriptor fonológico para la lengua española.

@maestra-ai/live-sdk

Live SDK for Maestra AI transcription services

assemblyai-mcp-server

Model Context Protocol server for AssemblyAI transcription services

ai-code-writer

An AI code writer application using OpenAI APIs for audio transcription and chat completion.

real-time-speech-analyzer

Real-time speech analysis with local LLM using multiple concurrent analysis instructions

@chand1012/whisper-web-transcriber-desktop

Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly

whisper-clipboard-cli

Own your transcription workflow. Press Cmd+Shift+X, speak, get text in clipboard instantly.

@adamhancock/transcribe

CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

discord-transcript-generator

A command-line utility to generate transcripts from a Discord channel

tafrigh

A NodeJS library for transcribing audio/video to text.

peertube-plugin-transcription

Generate subtitles for your videos via Automatic Speech Recognition.

peertube-plugin-transposer-connector

Transposer connector is a PeerTube language tool plugin to transcribe and translate with Whisper

liblouis

javascript bindings for liblouis

@bottlenose/rxtranscribe

👂 Realtime speech-to-text (S2T) transcription with RxJS

@daitanjs/speech

A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

react-native-deepgram

React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

vidnavigator

Official JavaScript SDK for the VidNavigator Developer API

@chinchillaenterprises/mcp-recall

Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage

parakeet.js

NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

react-speech-recognition-ui

A beautiful, production-ready voice transcription package for React applications using the Web Speech API

react-native-pitch-tracker

React Native Pitch Tracker implemented with Tensorflow Lite Model

modern-greek-accentuation

accentuation, syllabification and transcription utilities for Modern Greek

gladia

Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API

annotorious-tahqiq

A custom Annotorious editor/view plugin

open-transcribe

AI-Powered Audio Transcription Desktop Application

whispermix

🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.

spongescribe

StrangeText Transcription

twitter-reply-bot

base for twitter reply bot using autohook

@voicefeedback/sdk

Modern voice feedback SDK with beautiful UI components and AI-powered analysis

djelia

Djelia JavaScript SDK - Advanced AI for African Languages

react-native-pitch-tracker-extended

React Native Pitch Tracker implemented with Tensorflow Lite Model

n8n-nodes-dudoxx

n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription

openai-whisper-js

openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription

merge-vtt

A simple tool to merge multiple WebVTT (.vtt) files into a single file.

@chinchillaenterprises/mcp-elevenlabs

Multi-tenant ElevenLabs MCP server for advanced speech-to-text transcription with speaker diarization

react-native-sfspeechrecognizer

iOS SFSpeechRecognizer bridge module for React Native

transcription-words

Easy and crystal-clear API for transcription words.

@rxtk/stt-deepgram

👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text

@4eyes/mirador-ocr-helper

Mirador 3 plugin which renders a separate window, with OCR text

autosub

Automatically generate and overlay subtitles for any video.

elevenlabs-scribe-transcriber

Audio and video transcription using ElevenLabs Scribe

transcription

Documentation generator for ES6.

podcast-takeaways

A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.

pronunciation-finder

An application for getting audio files with pronunciation from public dictionaries

cmu-pronouncing-dictionary-cjs

Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary

kana-transformer

Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary

transcription-lib-grpc-js

Creates Live Transcription of a media input stream in multiple languages

twitter-reply

Strange Text Transliterator (GOTO: spongescribe)

audio-transcripter

Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.

voicescribe

Live speech transcription library with multi-language support.

peertube-plugin-transcribe-translate

PeerTube plugin transcribe and translate

@apto-space/react-use-transcribe-gladia

React hook for real-time audio transcription using Gladia API

olaris-wav-realtime-transcription

This is a node.js module used to transcribe wav files using Olaris v2 realtime transcription service

@igoratron/aws-transcribe

A client for Amazon Transcribe using the websocket interface

video-summary

Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look

polyanno_storage

Node and Express backend for easy MongoDB storage of Polyanno annotations

twitter-search-bot

Strange Text Transliterator (GOTO: spongescribe)

peertube-plugin-displ-transcription

Generate subtitles for your videos via Automatic Speech Recognition.

@voxextractlabs/vox-whisper

[![NPM](https://img.shields.io/npm/v/@voxextractlabs/vox-whisper?label=npm)](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [![License](https://img.shields.io/npm/l/@voxextractlabs/vox-whisper)](./LICENSE) [![Build Status](https://github.com/V