Found 199 results for transcription

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

@speechmatics/auth

Library for fetching temporary keys for Speechmatics APIs

sherpa-onnx-node

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

aws-transcribe

A client for Amazon Transcribe using the websocket interface

@speechmatics/browser-audio-input-react

React hooks for managing audio inputs and permissions across browsers

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@speechmatics/real-time-client-react

React hooks for interacting with the Speechmatics Real-Time API

@soniox/speech-to-text-web

Javascript client library for Soniox Speech-to-Text websocket API

cmu-pronouncing-dictionary

The 134,000+ words and their pronunciations in the CMU pronouncing dictionary

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@speechmatics/flow-client-react

React hooks for interacting with the Speechmatics Flow API

tap2talk

Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut

sherpa-onnx-darwin-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

@vocantai/n8n-nodes-translation-vocantai

Audio file transcription services. Your speech. Private.

sherpa-onnx-win-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

sherpa-onnx-win-ia32

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

oneai

Make your app understand language. Summarize conversations, categorize articles, and more.

@meeting-baas/sdk

Official SDK for Meeting BaaS API - https://meetingbaas.com

koshi-vox

Voice-To-Text recorder with sound notifications - optimized for macOS

sherpa-onnx-linux-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

@ascendtis/react-native-voice-to-text

This can use to convert voice to text real time in device

@picovoice/cheetah-web

Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

@picovoice/cheetah-react

React hook for Cheetah Web SDK

speech-into-text

SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

n8n-nodes-puter-ai

Advanced n8n node for Puter.js AI with RAG agentic capabilities, document processing, audio transcription, Supabase integration, and cost-optimized model priorities

@thaleslaray/n8n-nodes-elevenlabs

Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI

abjad-convert

sherpa-onnx-darwin-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

aws-transcription-to-srt

Convert AWS transcription JSON to srt

dictate-button

Dictate Button (Web Component)

@theventures/caret

Unofficial Node.js API client for the Caret HTTP API

apexify.js

Unlimited AI models and Canvas library. Supports ts & js (supports front/back end).

whisper-speech-to-text

A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

@firefliesai/fireflies-node-sdk

Node.js SDK for Fireflies.ai API

@adamhancock/transcribe-cli

CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

@fugood/node-whisper-win32-x64-cuda

Native module for An another Node binding of whisper.cpp (win32-x64-cuda)

aromanize

Korean transliteration tool for JavaScript

@fugood/node-whisper-darwin-arm64

Native module for An another Node binding of whisper.cpp (darwin-arm64)

@fugood/node-whisper-win32-x64-vulkan

Native module for An another Node binding of whisper.cpp (win32-x64-vulkan)

susurro-audio

🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency

talisik-huntress

A TypeScript library for extracting and working with YouTube video transcripts.

n8n-nodes-get-transcribe

n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more

@elizaos/plugin-google-meet-cute

Google Meet integration plugin for ElizaOS - manage meetings, get participant info, and access meeting artifacts via Google Meet REST API

@fugood/node-whisper-linux-x64-cuda

Native module for An another Node binding of whisper.cpp (linux-x64-cuda)

@fugood/whisper.node

An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.

elevenlabs-scribe-transcriber

Audio and video transcription using ElevenLabs Scribe

@fugood/node-whisper-linux-x64-vulkan

Native module for An another Node binding of whisper.cpp (linux-x64-vulkan)

react-phonetic-transcription

Phonetic transcription tools with react js for input, outputing, etc

audiopod-sdk

AudioPod SDK for Node.js and React - Professional Audio Processing powered by AI

@fugood/node-whisper-linux-x64

Native module for An another Node binding of whisper.cpp (linux-x64)

@fugood/node-whisper-linux-arm64-vulkan

Native module for An another Node binding of whisper.cpp (linux-arm64-vulkan)

audio-to-text-node

Backend audio file to text transcription using Web Speech API with Puppeteer

@fugood/node-whisper-win32-arm64

Native module for An another Node binding of whisper.cpp (win32-arm64)

n8n-nodes-transcribe-audio

Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.

@fugood/node-whisper-linux-arm64-cuda

Native module for An another Node binding of whisper.cpp (linux-arm64-cuda)

@fugood/node-whisper-linux-arm64

Native module for An another Node binding of whisper.cpp (linux-arm64)

@fugood/node-whisper-win32-arm64-vulkan

Native module for An another Node binding of whisper.cpp (win32-arm64-vulkan)

@fugood/node-whisper-darwin-x64

Native module for An another Node binding of whisper.cpp (darwin-x64)

@fugood/node-whisper-win32-x64

Native module for An another Node binding of whisper.cpp (win32-x64)

whisper-web-transcriber

Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly

mirador-textoverlay

Mirador 3 plugin to render a hidden (but selectable) or visible text overlay

@picovoice/leopard-web

Leopard Speech-to-Text engine for web browsers (via WebAssembly)

vidnavigator

Official JavaScript SDK for the VidNavigator Developer API

aixblock-voice-ai-deepgram

A React component for real-time transcription and voice agent interactions using Deepgram APIs

djelia

Djelia JavaScript SDK - Advanced AI for African Languages

castleguard-sdk

JavaScript SDK for interacting with CastleGuard APIs

ugai

A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

assembly-ai-mcp

Model Context Protocol server for AssemblyAI transcription services

@picovoice/leopard-react-native

Picovoice Leopard React Native binding

@picovoice/cheetah-react-native

Picovoice Cheetah React Native binding

paragrafs

A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.

audio2textjs

A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models

universal-transcriber

A simple universal transcriber for languages with unicode characters.

liblouis-build

pre-compiled builds of liblouis for js

@maestra-ai/live-sdk

Live SDK for Maestra AI transcription services

peertube-plugin-transposer-connector

Transposer connector is a PeerTube language tool plugin to transcribe and translate with Whisper

transcriptor-fonologico

Un simple transcriptor fonológico para la lengua española.

voicescribe

Live speech transcription library with multi-language support.

assemblyai-mcp-server

Model Context Protocol server for AssemblyAI transcription services

ai-code-writer

An AI code writer application using OpenAI APIs for audio transcription and chat completion.

real-time-speech-analyzer

Real-time speech analysis with local LLM using multiple concurrent analysis instructions

@adamhancock/transcribe

CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

discord-transcript-generator

A command-line utility to generate transcripts from a Discord channel

peertube-plugin-transcription

Generate subtitles for your videos via Automatic Speech Recognition.

tafrigh

A NodeJS library for transcribing audio/video to text.

liblouis

javascript bindings for liblouis

@bottlenose/rxtranscribe

👂 Realtime speech-to-text (S2T) transcription with RxJS

react-native-deepgram

React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

@daitanjs/speech

A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

@chinchillaenterprises/mcp-recall

Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage

react-native-pitch-tracker

React Native Pitch Tracker implemented with Tensorflow Lite Model

parakeet.js

NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

react-speech-recognition-ui

A beautiful, production-ready voice transcription package for React applications using the Web Speech API

modern-greek-accentuation

accentuation, syllabification and transcription utilities for Modern Greek

@appcitor/react-native-voice-to-text

This can use to convert voice to text real time in device

annotorious-tahqiq

A custom Annotorious editor/view plugin

open-transcribe

AI-Powered Audio Transcription Desktop Application

whispermix

🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.

n8n-nodes-dudoxx

n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription

spongescribe

StrangeText Transcription

twitter-reply-bot

base for twitter reply bot using autohook

@voicefeedback/sdk

Modern voice feedback SDK with beautiful UI components and AI-powered analysis

@chinchillaenterprises/mcp-elevenlabs

Multi-tenant ElevenLabs MCP server for advanced speech-to-text transcription with speaker diarization

react-native-pitch-tracker-extended

React Native Pitch Tracker implemented with Tensorflow Lite Model

merge-vtt

A simple tool to merge multiple WebVTT (.vtt) files into a single file.

openai-whisper-js

openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription

react-native-sfspeechrecognizer

iOS SFSpeechRecognizer bridge module for React Native

@4eyes/mirador-ocr-helper

Mirador 3 plugin which renders a separate window, with OCR text

autosub

Automatically generate and overlay subtitles for any video.

@rxtk/stt-deepgram

👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text

gladia

Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API

transcription

Documentation generator for ES6.

podcast-takeaways

A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.

olaris-wav-realtime-transcription

This is a node.js module used to transcribe wav files using Olaris v2 realtime transcription service

kana-transformer

Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary

transcription-lib-grpc-js

Creates Live Transcription of a media input stream in multiple languages

pronunciation-finder

An application for getting audio files with pronunciation from public dictionaries

cmu-pronouncing-dictionary-cjs

Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary

twitter-reply

Strange Text Transliterator (GOTO: spongescribe)

transcription-words

Easy and crystal-clear API for transcription words.

karaoke-transcriber

A CLI tool to generate karaoke-style subtitles on videos using Whisper and FFmpeg.

@igoratron/aws-transcribe

A client for Amazon Transcribe using the websocket interface

react-transcribe

React component for speech-to-text transcription with silence detection

polyanno_storage

Node and Express backend for easy MongoDB storage of Polyanno annotations

twitter-search-bot

Strange Text Transliterator (GOTO: spongescribe)

peertube-plugin-transcribe-translate

PeerTube plugin transcribe and translate

video-summary

Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look

@speechall/sdk

TypeScript SDK for the Speechall API - A powerful and flexible speech-to-text service

peertube-plugin-displ-transcription

Generate subtitles for your videos via Automatic Speech Recognition.

multi-voice-sdk

A universal Text-to-Speech (TTS) and Speech-to-Text (STT) SDK supporting multiple providers (OpenAI, Google Gemini, Deepgram, Groq PlayAI, Cartesia, AssemblyAI) with audio merging capabilities

peertube-plugin-transcription-fairkom

Generate subtitles for your videos via Automatic Speech Recognition.

image-generation

Strange Text Transliterator (GOTO: spongescribe)

node-palladius

The Palladius system for transcribing Chinese characters into the Cyrillic alphabet

@sentira-ai/common

Common functions for Sentira AI

transcord

A simple recording and transcription module.

oxford-speech-wrapper

simple bing voice recognition wrapper

voicely

Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.

audio-transcripter

Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.

atc-transcription

React Native module for transcribing WAV files using WhisperKit

@rxtk/stt-gcp

👂 RxJS operator for realtime speech-to-text (STT/S2T) using Google speeh-to-text

javascript-speech-recognizer-library

A library for easily transcribing speech. Convert speech to text in JavaScript

liblouis-js

javascript bindings for liblouis

maketalk

A command-line tool to create video presentations with title cards and transcriptions

spongescribebot

Strangetext Transcription - Use: 'spongescribe'

nwhisper

Native Node.js bindings for OpenAI's Whisper using whisper.cpp. High-performance local speech-to-text with custom model support.

peertube-plugin-display-transcription-test

Generate subtitles for your videos via Automatic Speech Recognition.

node-deepgram

Node wrapper for Deepgram

glaemscribe

Glǽmscribe (also written Glaemscribe) is a software dedicated to the transcription of texts between writing systems, and more specifically dedicated to the transcription of J.R.R. Tolkien's invented languages to some of his devised writing systems.

@apto-space/react-use-transcribe-gladia

React hook for real-time audio transcription using Gladia API

noapi-speech2text

Speech recognition library that uses web-based services to convert speech to text in multiple languages

vidscript

AI-powered CLI tool that transforms video content into intelligent, structured notes and scripts

meeting-whisperer

CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts

@rxtk/stt-aws

👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe

realtime-stt-client

TypeScript client library for Realtime Speech-to-Text server

speak-precisely-sdk

Real-time speech transcription and translation SDK

n8n-nodes-asr

N8N node for processing audio files via an ASR service

friendlyjs

make friendly URLs by stripping out non lating chars, and convert other chars to their latin counterparts

voicekey

A CLI tool to transcribe voice to text with interactive UI

@voxextractlabs/vox-whisper

[![NPM](https://img.shields.io/npm/v/@voxextractlabs/vox-whisper?label=npm)](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [![License](https://img.shields.io/npm/l/@voxextractlabs/vox-whisper)](./LICENSE) [![Build Status](https://github.com/V