JSPM

Found 199 results for transcription

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

  • v1.2.0
  • 70.77
  • Published

@speechmatics/auth

Library for fetching temporary keys for Speechmatics APIs

    • v0.1.0
    • 51.39
    • Published

    sherpa-onnx-node

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 47.33
    • Published

    aws-transcribe

    A client for Amazon Transcribe using the websocket interface

    • v1.1.1
    • 44.15
    • Published

    sherpa-onnx-linux-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 43.16
    • Published

    cmu-pronouncing-dictionary

    The 134,000+ words and their pronunciations in the CMU pronouncing dictionary

    • v3.0.0
    • 42.39
    • Published

    sherpa-onnx

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 42.23
    • Published

    tap2talk

    Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut

    • v5.1.7
    • 41.44
    • Published

    deepgram

    NodeJS wrapper for Deepgram

    • v1.0.3
    • 41.04
    • Published

    sherpa-onnx-darwin-arm64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 40.56
    • Published

    sherpa-onnx-win-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 39.78
    • Published

    sherpa-onnx-win-ia32

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 39.73
    • Published

    oneai

    Make your app understand language. Summarize conversations, categorize articles, and more.

    • v0.8.4
    • 39.45
    • Published

    @meeting-baas/sdk

    Official SDK for Meeting BaaS API - https://meetingbaas.com

    • v5.0.2
    • 37.02
    • Published

    koshi-vox

    Voice-To-Text recorder with sound notifications - optimized for macOS

    • v1.2.6
    • 36.87
    • Published

    sherpa-onnx-linux-arm64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.10
    • 36.25
    • Published

    n8n-nodes-groq

    N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

    • v0.2.0
    • 35.05
    • Published

    @picovoice/cheetah-web

    Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

      • v2.3.0
      • 34.10
      • Published

      speech-into-text

      SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

      • v4.0.2
      • 33.30
      • Published

      n8n-nodes-puter-ai

      Advanced n8n node for Puter.js AI with RAG agentic capabilities, document processing, audio transcription, Supabase integration, and cost-optimized model priorities

      • v2.0.4
      • 32.53
      • Published

      @thaleslaray/n8n-nodes-elevenlabs

      Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI

      • v0.3.9
      • 32.15
      • Published

      sherpa-onnx-darwin-x64

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.10
      • 31.50
      • Published

      dictate-button

      Dictate Button (Web Component)

      • v1.1.1
      • 30.88
      • Published

      @theventures/caret

      Unofficial Node.js API client for the Caret HTTP API

      • v0.1.1
      • 30.72
      • Published

      apexify.js

      Unlimited AI models and Canvas library. Supports ts & js (supports front/back end).

        • v4.9.25
        • 30.50
        • Published

        whisper-speech-to-text

        A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

        • v1.0.3
        • 30.46
        • Published

        @adamhancock/transcribe-cli

        CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

        • v1.0.4
        • 30.08
        • Published

        aromanize

        Korean transliteration tool for JavaScript

        • v0.1.5
        • 30.04
        • Published

        susurro-audio

        🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency

        • v2.1.1
        • 28.53
        • Published

        talisik-huntress

        A TypeScript library for extracting and working with YouTube video transcripts.

        • v1.1.7
        • 28.43
        • Published

        n8n-nodes-get-transcribe

        n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more

        • v0.1.2
        • 28.34
        • Published

        @elizaos/plugin-google-meet-cute

        Google Meet integration plugin for ElizaOS - manage meetings, get participant info, and access meeting artifacts via Google Meet REST API

        • v1.5.0
        • 28.33
        • Published

        @fugood/whisper.node

        An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.

        • v1.0.3
        • 28.14
        • Published

        audiopod-sdk

        AudioPod SDK for Node.js and React - Professional Audio Processing powered by AI

        • v1.2.0
        • 27.54
        • Published

        audio-to-text-node

        Backend audio file to text transcription using Web Speech API with Puppeteer

        • v0.1.2
        • 27.29
        • Published

        n8n-nodes-transcribe-audio

        Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.

        • v0.1.23
        • 27.20
        • Published

        whisper-web-transcriber

        Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly

        • v0.2.4
        • 26.76
        • Published

        mirador-textoverlay

        Mirador 3 plugin to render a hidden (but selectable) or visible text overlay

        • v0.3.8
        • 25.83
        • Published

        @picovoice/leopard-web

        Leopard Speech-to-Text engine for web browsers (via WebAssembly)

          • v2.0.1
          • 25.62
          • Published

          vidnavigator

          Official JavaScript SDK for the VidNavigator Developer API

          • v0.1.5
          • 25.62
          • Published

          aixblock-voice-ai-deepgram

          A React component for real-time transcription and voice agent interactions using Deepgram APIs

            • v0.0.7
            • 25.39
            • Published

            djelia

            Djelia JavaScript SDK - Advanced AI for African Languages

            • v2.0.0
            • 25.32
            • Published

            castleguard-sdk

            JavaScript SDK for interacting with CastleGuard APIs

              • v2.0.0
              • 25.02
              • Published

              ugai

              A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

              • v1.1.0
              • 24.46
              • Published

              assembly-ai-mcp

              Model Context Protocol server for AssemblyAI transcription services

                • v0.0.2
                • 24.26
                • Published

                paragrafs

                A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.

                • v1.5.1
                • 22.50
                • Published

                audio2textjs

                A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models

                • v1.0.5
                • 22.43
                • Published

                universal-transcriber

                A simple universal transcriber for languages with unicode characters.

                • v1.0.0
                • 21.91
                • Published

                liblouis-build

                pre-compiled builds of liblouis for js

                • v3.2.0-rc
                • 21.91
                • Published

                voicescribe

                Live speech transcription library with multi-language support.

                  • v0.1.0
                  • 20.83
                  • Published

                  assemblyai-mcp-server

                  Model Context Protocol server for AssemblyAI transcription services

                    • v0.0.1
                    • 20.74
                    • Published

                    ai-code-writer

                    An AI code writer application using OpenAI APIs for audio transcription and chat completion.

                    • v3.1.0
                    • 20.54
                    • Published

                    real-time-speech-analyzer

                    Real-time speech analysis with local LLM using multiple concurrent analysis instructions

                    • v1.0.0
                    • 20.27
                    • Published

                    @adamhancock/transcribe

                    CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

                    • v1.0.5
                    • 20.15
                    • Published

                    tafrigh

                    A NodeJS library for transcribing audio/video to text.

                      • v4.0.2
                      • 19.53
                      • Published

                      liblouis

                      javascript bindings for liblouis

                      • v0.4.0
                      • 18.55
                      • Published

                      react-native-deepgram

                      React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                      • v0.1.21
                      • 17.64
                      • Published

                      @daitanjs/speech

                      A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

                      • v1.0.6
                      • 17.54
                      • Published

                      @chinchillaenterprises/mcp-recall

                      Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage

                      • v1.1.0
                      • 17.01
                      • Published

                      parakeet.js

                      NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

                      • v0.0.3
                      • 16.42
                      • Published

                      react-speech-recognition-ui

                      A beautiful, production-ready voice transcription package for React applications using the Web Speech API

                      • v0.0.8
                      • 16.23
                      • Published

                      modern-greek-accentuation

                      accentuation, syllabification and transcription utilities for Modern Greek

                      • v1.2.1
                      • 16.11
                      • Published

                      open-transcribe

                      AI-Powered Audio Transcription Desktop Application

                      • v1.2.4
                      • 14.91
                      • Published

                      whispermix

                      🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.

                      • v1.3.6
                      • 14.44
                      • Published

                      n8n-nodes-dudoxx

                      n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription

                      • v0.1.1
                      • 14.40
                      • Published

                      spongescribe

                      StrangeText Transcription

                      • v0.6.5
                      • 14.32
                      • Published

                      twitter-reply-bot

                      base for twitter reply bot using autohook

                      • v1.5.1
                      • 14.23
                      • Published

                      @voicefeedback/sdk

                      Modern voice feedback SDK with beautiful UI components and AI-powered analysis

                      • v1.0.4
                      • 13.89
                      • Published

                      merge-vtt

                      A simple tool to merge multiple WebVTT (.vtt) files into a single file.

                      • v1.0.4
                      • 13.55
                      • Published

                      openai-whisper-js

                      openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription

                      • v1.0.7
                      • 13.49
                      • Published

                      autosub

                      Automatically generate and overlay subtitles for any video.

                      • v1.0.4
                      • 12.61
                      • Published

                      @rxtk/stt-deepgram

                      👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text

                      • v0.0.0
                      • 12.53
                      • Published

                      gladia

                      Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API

                      • v0.1.3
                      • 12.33
                      • Published

                      transcription

                      Documentation generator for ES6.

                      • v0.2.1
                      • 12.05
                      • Published

                      podcast-takeaways

                      A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.

                        • v0.4.5
                        • 11.40
                        • Published

                        kana-transformer

                        Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary

                        • v3.5.0
                        • 11.27
                        • Published

                        transcription-lib-grpc-js

                        Creates Live Transcription of a media input stream in multiple languages

                        • v1.0.2
                        • 11.21
                        • Published

                        pronunciation-finder

                        An application for getting audio files with pronunciation from public dictionaries

                        • v0.8.0
                        • 11.16
                        • Published

                        cmu-pronouncing-dictionary-cjs

                        Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary

                        • v3.0.0
                        • 11.16
                        • Published

                        twitter-reply

                        Strange Text Transliterator (GOTO: spongescribe)

                        • v0.0.0
                        • 10.67
                        • Published

                        transcription-words

                        Easy and crystal-clear API for transcription words.

                        • v1.2.1
                        • 10.35
                        • Published

                        karaoke-transcriber

                        A CLI tool to generate karaoke-style subtitles on videos using Whisper and FFmpeg.

                          • v1.0.3
                          • 10.26
                          • Published

                          react-transcribe

                          React component for speech-to-text transcription with silence detection

                          • v0.1.0
                          • 9.82
                          • Published

                          polyanno_storage

                          Node and Express backend for easy MongoDB storage of Polyanno annotations

                          • v0.1.5
                          • 9.82
                          • Published

                          twitter-search-bot

                          Strange Text Transliterator (GOTO: spongescribe)

                          • v0.0.0
                          • 9.82
                          • Published

                          video-summary

                          Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look

                          • v1.0.8
                          • 9.72
                          • Published

                          @speechall/sdk

                          TypeScript SDK for the Speechall API - A powerful and flexible speech-to-text service

                          • v1.0.0
                          • 9.69
                          • Published

                          multi-voice-sdk

                          A universal Text-to-Speech (TTS) and Speech-to-Text (STT) SDK supporting multiple providers (OpenAI, Google Gemini, Deepgram, Groq PlayAI, Cartesia, AssemblyAI) with audio merging capabilities

                            • v1.1.0
                            • 9.06
                            • Published

                            image-generation

                            Strange Text Transliterator (GOTO: spongescribe)

                            • v0.0.0
                            • 8.63
                            • Published

                            node-palladius

                            The Palladius system for transcribing Chinese characters into the Cyrillic alphabet

                            • v0.6.4
                            • 8.58
                            • Published

                            transcord

                            A simple recording and transcription module.

                            • v0.2.0
                            • 8.41
                            • Published

                            voicely

                            Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.

                            • v1.1.5
                            • 8.03
                            • Published

                            audio-transcripter

                            Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.

                            • v2.0.1
                            • 7.99
                            • Published

                            atc-transcription

                            React Native module for transcribing WAV files using WhisperKit

                              • v1.0.0
                              • 7.99
                              • Published

                              @rxtk/stt-gcp

                              👂 RxJS operator for realtime speech-to-text (STT/S2T) using Google speeh-to-text

                              • v0.0.0
                              • 7.95
                              • Published

                              liblouis-js

                              javascript bindings for liblouis

                              • v0.2.0
                              • 7.60
                              • Published

                              maketalk

                              A command-line tool to create video presentations with title cards and transcriptions

                                • v1.4.0
                                • 7.26
                                • Published

                                spongescribebot

                                Strangetext Transcription - Use: 'spongescribe'

                                • v0.0.0
                                • 7.26
                                • Published

                                nwhisper

                                Native Node.js bindings for OpenAI's Whisper using whisper.cpp. High-performance local speech-to-text with custom model support.

                                • v0.3.0
                                • 7.22
                                • Published

                                node-deepgram

                                Node wrapper for Deepgram

                                • v1.0.10
                                • 7.17
                                • Published

                                glaemscribe

                                Glǽmscribe (also written Glaemscribe) is a software dedicated to the transcription of texts between writing systems, and more specifically dedicated to the transcription of J.R.R. Tolkien's invented languages to some of his devised writing systems.

                                • v1.3.1
                                • 7.14
                                • Published

                                noapi-speech2text

                                Speech recognition library that uses web-based services to convert speech to text in multiple languages

                                • v1.0.3
                                • 6.37
                                • Published

                                vidscript

                                AI-powered CLI tool that transforms video content into intelligent, structured notes and scripts

                                • v1.0.6
                                • 6.33
                                • Published

                                meeting-whisperer

                                CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts

                                  • v0.1.0
                                  • 5.28
                                  • Published

                                  @rxtk/stt-aws

                                  👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe

                                  • v0.0.0
                                  • 5.28
                                  • Published

                                  realtime-stt-client

                                  TypeScript client library for Realtime Speech-to-Text server

                                    • v0.0.1-poc.6
                                    • 5.28
                                    • Published

                                    speak-precisely-sdk

                                    Real-time speech transcription and translation SDK

                                    • v1.0.1
                                    • 5.26
                                    • Published

                                    n8n-nodes-asr

                                    N8N node for processing audio files via an ASR service

                                    • v0.1.1
                                    • 5.26
                                    • Published

                                    friendlyjs

                                    make friendly URLs by stripping out non lating chars, and convert other chars to their latin counterparts

                                    • v0.0.1
                                    • 5.26
                                    • Published

                                    voicekey

                                    A CLI tool to transcribe voice to text with interactive UI

                                    • v1.0.0
                                    • 5.26
                                    • Published

                                    @voxextractlabs/vox-whisper

                                    [![NPM](https://img.shields.io/npm/v/@voxextractlabs/vox-whisper?label=npm)](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [![License](https://img.shields.io/npm/l/@voxextractlabs/vox-whisper)](./LICENSE) [![Build Status](https://github.com/V

                                    • v1.0.1
                                    • 5.25
                                    • Published

                                    ai-meeting-summarizer

                                    A simple Node.js library to transcribe and summarize meeting recordings using OpenAI's GPT model and Whisper.

                                      • v1.0.1
                                      • 5.25
                                      • Published

                                      dnable

                                      Simple, lightweight, and fast Node.js module for enabling DNA sequences.

                                        • v0.2.1
                                        • 4.41
                                        • Published

                                        fireflies

                                        Fireflies.ai API wrapper

                                        • v0.0.1
                                        • 4.18
                                        • Published

                                        lucidtalk-core

                                        Privacy-first P2P meeting transcription and AI SDK

                                        • v1.0.0
                                        • 4.02
                                        • Published

                                        speechtotext-openaikey

                                        A React component for recording and transcribing audio using the Web Audio API and OpenAI.

                                          • v1.0.2
                                          • 4.02
                                          • Published

                                          alinkeo-core

                                          This is the official alinkeo core npm package

                                          • v1.0.0
                                          • 3.98
                                          • Published

                                          @lunarisapp/cmudict

                                          A JavaScript interface to the CMU Pronouncing Dictionary

                                          • v1.0.0
                                          • 3.98
                                          • Published

                                          douyin-text-extractor

                                          Node.js + TypeScript library for extracting text from Douyin/TikTok videos

                                          • v1.1.2
                                          • 2.44
                                          • Published

                                          rev_ai

                                          Unofficial Rev AI Node.js client

                                          • v0.5.1
                                          • 2.35
                                          • Published

                                          whisper-stream-js

                                          CLI tool for real-time audio transcription using OpenAI's Whisper API

                                          • v1.0.0
                                          • 2.34
                                          • Published

                                          meeshx-sdk

                                          Welcome to MeeshX! Choose your favorite provider and transcribe your audio content in less than 5 minutes.

                                            • v1.0.0
                                            • 2.33
                                            • Published

                                            subtitles-editor

                                            A React component for editing SRT and VTT subtitles directly in a textarea, styled with TailwindCSS.

                                            • v0.0.1
                                            • 2.31
                                            • Published

                                            @sorenpeng/rtstt

                                            Real-time speech-to-text CLI tool using OpenAI Realtime API

                                            • v1.0.0
                                            • 0.00
                                            • Published

                                            ctrl.so

                                            Embedable vocal intelligence

                                            • v0.0.8
                                            • 0.00
                                            • Published

                                            liveprompt-mcp-server

                                            Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

                                            • v1.0.1
                                            • 0.00
                                            • Published

                                            openwhisper

                                            A library for AI-powered audio transcription with local and remote server fallback.

                                              • v1.0.1
                                              • 0.00
                                              • Published

                                              robinwood

                                              Steal money from big companies

                                              • v1.0.1
                                              • 0.00
                                              • Published

                                              deepgram-media-transcriber

                                              A package to transcribe media files using Deepgram with speaker-labeled subtitles (SRT/VTT).

                                                • v1.0.8
                                                • 0.00
                                                • Published

                                                edit-wave-transcript

                                                react component for transcription with ability to edit the words as well as the words. synchronization is done between words and wave-forms.

                                                • v0.2.0
                                                • 0.00
                                                • Published

                                                @liveprompt/mcp-server

                                                Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

                                                • v2.0.13
                                                • 0.00
                                                • Published

                                                @easyscribe/react

                                                React components for transcribing with easyscribe.org

                                                • v1.0.1
                                                • 0.00
                                                • Published

                                                amoeba-life-cycle

                                                A simple module that simulates the life cycle of an amoeba at the molecular level by using a (semi) TRNG.

                                                • v1.0.0
                                                • 0.00
                                                • Published

                                                tafrigh-cli

                                                CLI for using the tafrigh library.

                                                  • v1.4.2
                                                  • 0.00
                                                  • Published

                                                  @lunarity/a2s-cli

                                                  A CLI tool for transcribing audio files to subtitles

                                                  • v1.0.3
                                                  • 0.00
                                                  • Published