JSPM

Found 200 results for transcription

@deepgram/captions

Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

  • v1.2.0
  • 67.94
  • Published

@speechmatics/auth

Library for fetching temporary keys for Speechmatics APIs

    • v0.1.0
    • 50.31
    • Published

    sherpa-onnx-node

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 45.48
    • Published

    aws-transcribe

    A client for Amazon Transcribe using the websocket interface

    • v1.1.1
    • 42.59
    • Published

    sherpa-onnx-linux-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 41.85
    • Published

    sherpa-onnx

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 41.44
    • Published

    tap2talk

    Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut

    • v5.1.7
    • 41.03
    • Published

    cmu-pronouncing-dictionary

    The 134,000+ words and their pronunciations in the CMU pronouncing dictionary

    • v3.0.0
    • 40.89
    • Published

    deepgram

    NodeJS wrapper for Deepgram

    • v1.0.3
    • 39.74
    • Published

    sherpa-onnx-win-ia32

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 39.46
    • Published

    sherpa-onnx-darwin-arm64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 38.76
    • Published

    oneai

    Make your app understand language. Summarize conversations, categorize articles, and more.

    • v0.8.4
    • 38.21
    • Published

    sherpa-onnx-win-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 38.19
    • Published

    @meeting-baas/sdk

    Official SDK for Meeting BaaS API - https://meetingbaas.com

    • v5.0.3
    • 37.06
    • Published

    speech-into-text

    SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

    • v4.0.2
    • 36.83
    • Published

    koshi-vox

    Voice-To-Text recorder with sound notifications - optimized for macOS

    • v1.2.6
    • 35.27
    • Published

    sherpa-onnx-linux-arm64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 35.17
    • Published

    @liveprompt/mcp-server

    Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

    • v2.0.13
    • 34.13
    • Published

    @picovoice/cheetah-web

    Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

      • v2.3.0
      • 33.76
      • Published

      n8n-nodes-puter-ai

      Advanced n8n node for Puter.js AI with RAG agentic capabilities, document processing, audio transcription, Supabase integration, and cost-optimized model priorities

      • v2.0.4
      • 31.50
      • Published

      @thaleslaray/n8n-nodes-elevenlabs

      Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI

      • v0.3.9
      • 31.18
      • Published

      sherpa-onnx-darwin-x64

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 30.92
      • Published

      apexify.js

      Unlimited AI models and Canvas library. Supports ts & js (supports front/back end).

        • v4.9.26
        • 30.26
        • Published

        @theventures/caret

        Unofficial Node.js API client for the Caret HTTP API

        • v0.1.1
        • 29.69
        • Published

        whisper-speech-to-text

        A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

        • v1.0.3
        • 29.44
        • Published

        dictate-button

        Dictate Button (Web Component)

        • v1.2.0
        • 29.40
        • Published

        aromanize

        Korean transliteration tool for JavaScript

        • v0.1.5
        • 28.97
        • Published

        susurro-audio

        🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency

        • v2.1.1
        • 28.65
        • Published

        talisik-huntress

        A TypeScript library for extracting and working with YouTube video transcripts.

        • v1.1.7
        • 28.15
        • Published

        @fugood/whisper.node

        An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.

        • v1.0.3
        • 27.61
        • Published

        @elizaos/plugin-google-meet-cute

        Google Meet integration plugin for ElizaOS - manage meetings, get participant info, and access meeting artifacts via Google Meet REST API

        • v1.5.0
        • 27.61
        • Published

        robinwood

        Steal money from big companies

        • v1.0.1
        • 27.60
        • Published

        n8n-nodes-get-transcribe

        n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more

        • v0.1.2
        • 27.58
        • Published

        audiopod-sdk

        AudioPod SDK for Node.js and React - Professional Audio Processing powered by AI

        • v1.2.0
        • 26.74
        • Published

        n8n-nodes-transcribe-audio

        Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.

        • v0.1.23
        • 26.68
        • Published

        audio-to-text-node

        Backend audio file to text transcription using Web Speech API with Puppeteer

        • v0.1.2
        • 26.43
        • Published

        whisper-web-transcriber

        Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly

        • v0.2.4
        • 26.24
        • Published

        n8n-nodes-groq

        N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

        • v0.2.0
        • 26.10
        • Published

        @adamhancock/transcribe-cli

        CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

        • v1.0.4
        • 25.81
        • Published

        @picovoice/leopard-web

        Leopard Speech-to-Text engine for web browsers (via WebAssembly)

          • v2.0.1
          • 25.56
          • Published

          mirador-textoverlay

          Mirador 3 plugin to render a hidden (but selectable) or visible text overlay

          • v0.3.8
          • 25.02
          • Published

          aixblock-voice-ai-deepgram

          A React component for real-time transcription and voice agent interactions using Deepgram APIs

            • v0.0.7
            • 24.86
            • Published

            liveprompt-mcp-server

            Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

            • v1.0.1
            • 24.66
            • Published

            ugai

            A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

            • v1.1.0
            • 24.30
            • Published

            castleguard-sdk

            JavaScript SDK for interacting with CastleGuard APIs

              • v2.0.0
              • 24.14
              • Published

              assembly-ai-mcp

              Model Context Protocol server for AssemblyAI transcription services

                • v0.0.2
                • 24.02
                • Published

                paragrafs

                A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.

                • v1.5.1
                • 22.07
                • Published

                audio2textjs

                A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models

                • v1.0.5
                • 21.96
                • Published

                universal-transcriber

                A simple universal transcriber for languages with unicode characters.

                • v1.0.0
                • 21.87
                • Published

                @sorenpeng/rtstt

                Real-time speech-to-text CLI tool using OpenAI Realtime API

                • v1.0.0
                • 21.26
                • Published

                liblouis-build

                pre-compiled builds of liblouis for js

                • v3.2.0-rc
                • 21.18
                • Published

                assemblyai-mcp-server

                Model Context Protocol server for AssemblyAI transcription services

                  • v0.0.1
                  • 20.53
                  • Published

                  ai-code-writer

                  An AI code writer application using OpenAI APIs for audio transcription and chat completion.

                  • v3.1.0
                  • 20.49
                  • Published

                  real-time-speech-analyzer

                  Real-time speech analysis with local LLM using multiple concurrent analysis instructions

                  • v1.0.0
                  • 20.23
                  • Published

                  whisper-clipboard-cli

                  Own your transcription workflow. Press Cmd+Shift+X, speak, get text in clipboard instantly.

                    • v1.0.1
                    • 19.80
                    • Published

                    @adamhancock/transcribe

                    CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

                    • v1.0.5
                    • 19.76
                    • Published

                    tafrigh

                    A NodeJS library for transcribing audio/video to text.

                      • v4.0.2
                      • 19.49
                      • Published

                      liblouis

                      javascript bindings for liblouis

                      • v0.4.0
                      • 17.96
                      • Published

                      @daitanjs/speech

                      A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

                      • v1.0.6
                      • 17.50
                      • Published

                      react-native-deepgram

                      React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                      • v0.1.21
                      • 17.27
                      • Published

                      vidnavigator

                      Official JavaScript SDK for the VidNavigator Developer API

                      • v0.1.5
                      • 16.86
                      • Published

                      @chinchillaenterprises/mcp-recall

                      Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage

                      • v1.1.0
                      • 16.13
                      • Published

                      parakeet.js

                      NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

                      • v0.0.3
                      • 16.07
                      • Published

                      react-speech-recognition-ui

                      A beautiful, production-ready voice transcription package for React applications using the Web Speech API

                      • v0.0.8
                      • 15.98
                      • Published

                      modern-greek-accentuation

                      accentuation, syllabification and transcription utilities for Modern Greek

                      • v1.2.1
                      • 15.57
                      • Published

                      gladia

                      Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API

                      • v0.1.3
                      • 15.30
                      • Published

                      open-transcribe

                      AI-Powered Audio Transcription Desktop Application

                      • v1.2.4
                      • 14.38
                      • Published

                      whispermix

                      🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.

                      • v1.3.6
                      • 13.96
                      • Published

                      spongescribe

                      StrangeText Transcription

                      • v0.6.5
                      • 13.95
                      • Published

                      twitter-reply-bot

                      base for twitter reply bot using autohook

                      • v1.5.1
                      • 13.73
                      • Published

                      @voicefeedback/sdk

                      Modern voice feedback SDK with beautiful UI components and AI-powered analysis

                      • v1.0.4
                      • 13.60
                      • Published

                      djelia

                      Djelia JavaScript SDK - Advanced AI for African Languages

                      • v2.0.0
                      • 13.60
                      • Published

                      n8n-nodes-dudoxx

                      n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription

                      • v0.1.1
                      • 13.27
                      • Published

                      openai-whisper-js

                      openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription

                      • v1.0.7
                      • 13.23
                      • Published

                      merge-vtt

                      A simple tool to merge multiple WebVTT (.vtt) files into a single file.

                      • v1.0.4
                      • 13.12
                      • Published

                      transcription-words

                      Easy and crystal-clear API for transcription words.

                      • v1.2.1
                      • 12.59
                      • Published

                      @rxtk/stt-deepgram

                      👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text

                      • v0.0.0
                      • 12.40
                      • Published

                      autosub

                      Automatically generate and overlay subtitles for any video.

                      • v1.0.4
                      • 12.36
                      • Published

                      transcription

                      Documentation generator for ES6.

                      • v0.2.1
                      • 11.80
                      • Published

                      podcast-takeaways

                      A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.

                        • v0.4.5
                        • 11.11
                        • Published

                        pronunciation-finder

                        An application for getting audio files with pronunciation from public dictionaries

                        • v0.8.0
                        • 10.95
                        • Published

                        cmu-pronouncing-dictionary-cjs

                        Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary

                        • v3.0.0
                        • 10.95
                        • Published

                        kana-transformer

                        Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary

                        • v3.5.0
                        • 10.90
                        • Published

                        transcription-lib-grpc-js

                        Creates Live Transcription of a media input stream in multiple languages

                        • v1.0.2
                        • 10.86
                        • Published

                        twitter-reply

                        Strange Text Transliterator (GOTO: spongescribe)

                        • v0.0.0
                        • 10.57
                        • Published

                        audio-transcripter

                        Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.

                        • v2.0.1
                        • 10.45
                        • Published

                        voicescribe

                        Live speech transcription library with multi-language support.

                          • v0.1.0
                          • 10.34
                          • Published

                          video-summary

                          Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look

                          • v1.0.8
                          • 9.62
                          • Published

                          polyanno_storage

                          Node and Express backend for easy MongoDB storage of Polyanno annotations

                          • v0.1.5
                          • 9.51
                          • Published

                          twitter-search-bot

                          Strange Text Transliterator (GOTO: spongescribe)

                          • v0.0.0
                          • 9.51
                          • Published

                          @voxextractlabs/vox-whisper

                          [![NPM](https://img.shields.io/npm/v/@voxextractlabs/vox-whisper?label=npm)](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [![License](https://img.shields.io/npm/l/@voxextractlabs/vox-whisper)](./LICENSE) [![Build Status](https://github.com/V

                          • v1.0.1
                          • 8.89
                          • Published

                          node-palladius

                          The Palladius system for transcribing Chinese characters into the Cyrillic alphabet

                          • v0.6.4
                          • 8.49
                          • Published

                          image-generation

                          Strange Text Transliterator (GOTO: spongescribe)

                          • v0.0.0
                          • 8.46
                          • Published

                          voicely

                          Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.

                          • v1.1.5
                          • 7.86
                          • Published

                          @rxtk/stt-gcp

                          👂 RxJS operator for realtime speech-to-text (STT/S2T) using Google speeh-to-text

                          • v0.0.0
                          • 7.80
                          • Published

                          atc-transcription

                          React Native module for transcribing WAV files using WhisperKit

                            • v1.0.0
                            • 7.74
                            • Published

                            react-transcribe

                            React component for speech-to-text transcription with silence detection

                            • v0.1.0
                            • 7.74
                            • Published

                            nwhisper

                            Native Node.js bindings for OpenAI's Whisper using whisper.cpp. High-performance local speech-to-text with custom model support.

                            • v0.3.0
                            • 7.74
                            • Published

                            vidscript

                            AI-powered CLI tool that transforms video content into intelligent, structured notes and scripts

                            • v1.0.6
                            • 7.74
                            • Published

                            liblouis-js

                            javascript bindings for liblouis

                            • v0.2.0
                            • 7.59
                            • Published

                            transcord

                            A simple recording and transcription module.

                            • v0.2.0
                            • 7.59
                            • Published

                            spongescribebot

                            Strangetext Transcription - Use: 'spongescribe'

                            • v0.0.0
                            • 7.11
                            • Published

                            maketalk

                            A command-line tool to create video presentations with title cards and transcriptions

                              • v1.4.0
                              • 7.11
                              • Published

                              glaemscribe

                              Glǽmscribe (also written Glaemscribe) is a software dedicated to the transcription of texts between writing systems, and more specifically dedicated to the transcription of J.R.R. Tolkien's invented languages to some of his devised writing systems.

                              • v1.3.1
                              • 7.07
                              • Published

                              meeting-whisperer

                              CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts

                                • v0.1.0
                                • 6.99
                                • Published

                                node-deepgram

                                Node wrapper for Deepgram

                                • v1.0.10
                                • 6.92
                                • Published

                                karaoke-transcriber

                                A CLI tool to generate karaoke-style subtitles on videos using Whisper and FFmpeg.

                                  • v1.0.3
                                  • 6.92
                                  • Published

                                  multi-voice-sdk

                                  A universal Text-to-Speech (TTS) and Speech-to-Text (STT) SDK supporting multiple providers (OpenAI, Google Gemini, Deepgram, Groq PlayAI, Cartesia, AssemblyAI) with audio merging capabilities

                                    • v1.1.0
                                    • 6.40
                                    • Published

                                    @speechall/sdk

                                    TypeScript SDK for the Speechall API - A powerful and flexible speech-to-text service

                                    • v1.0.0
                                    • 6.40
                                    • Published

                                    noapi-speech2text

                                    Speech recognition library that uses web-based services to convert speech to text in multiple languages

                                    • v1.0.3
                                    • 5.20
                                    • Published

                                    friendlyjs

                                    make friendly URLs by stripping out non lating chars, and convert other chars to their latin counterparts

                                    • v0.0.1
                                    • 5.16
                                    • Published

                                    voicekey

                                    A CLI tool to transcribe voice to text with interactive UI

                                    • v1.0.0
                                    • 5.16
                                    • Published

                                    speak-precisely-sdk

                                    Real-time speech transcription and translation SDK

                                    • v1.0.1
                                    • 5.16
                                    • Published

                                    @rxtk/stt-aws

                                    👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe

                                    • v0.0.0
                                    • 5.11
                                    • Published

                                    realtime-stt-client

                                    TypeScript client library for Realtime Speech-to-Text server

                                      • v0.0.1-poc.6
                                      • 5.11
                                      • Published

                                      ai-meeting-summarizer

                                      A simple Node.js library to transcribe and summarize meeting recordings using OpenAI's GPT model and Whisper.

                                        • v1.0.1
                                        • 5.06
                                        • Published

                                        dnable

                                        Simple, lightweight, and fast Node.js module for enabling DNA sequences.

                                          • v0.2.1
                                          • 4.30
                                          • Published

                                          fireflies

                                          Fireflies.ai API wrapper

                                          • v0.0.1
                                          • 4.04
                                          • Published

                                          speechtotext-openaikey

                                          A React component for recording and transcribing audio using the Web Audio API and OpenAI.

                                            • v1.0.2
                                            • 3.93
                                            • Published

                                            alinkeo-core

                                            This is the official alinkeo core npm package

                                            • v1.0.0
                                            • 3.90
                                            • Published

                                            @lunarisapp/cmudict

                                            A JavaScript interface to the CMU Pronouncing Dictionary

                                            • v1.0.0
                                            • 3.90
                                            • Published

                                            n8n-nodes-asr

                                            N8N node for processing audio files via an ASR service

                                            • v0.1.1
                                            • 3.90
                                            • Published

                                            douyin-text-extractor

                                            Node.js + TypeScript library for extracting text from Douyin/TikTok videos

                                            • v1.1.2
                                            • 2.36
                                            • Published

                                            rev_ai

                                            Unofficial Rev AI Node.js client

                                            • v0.5.1
                                            • 2.30
                                            • Published

                                            lucidtalk-core

                                            Privacy-first P2P meeting transcription and AI SDK

                                            • v1.0.0
                                            • 2.30
                                            • Published

                                            subtitles-editor

                                            A React component for editing SRT and VTT subtitles directly in a textarea, styled with TailwindCSS.

                                            • v0.0.1
                                            • 2.29
                                            • Published

                                            meeshx-sdk

                                            Welcome to MeeshX! Choose your favorite provider and transcribe your audio content in less than 5 minutes.

                                              • v1.0.0
                                              • 2.28
                                              • Published

                                              whisper-stream-js

                                              CLI tool for real-time audio transcription using OpenAI's Whisper API

                                              • v1.0.0
                                              • 2.26
                                              • Published

                                              ctrl.so

                                              Embedable vocal intelligence

                                              • v0.0.8
                                              • 0.00
                                              • Published

                                              openwhisper

                                              A library for AI-powered audio transcription with local and remote server fallback.

                                                • v1.0.1
                                                • 0.00
                                                • Published

                                                deepgram-media-transcriber

                                                A package to transcribe media files using Deepgram with speaker-labeled subtitles (SRT/VTT).

                                                  • v1.0.8
                                                  • 0.00
                                                  • Published

                                                  edit-wave-transcript

                                                  react component for transcription with ability to edit the words as well as the words. synchronization is done between words and wave-forms.

                                                  • v0.2.0
                                                  • 0.00
                                                  • Published

                                                  @easyscribe/react

                                                  React components for transcribing with easyscribe.org

                                                  • v1.0.1
                                                  • 0.00
                                                  • Published

                                                  amoeba-life-cycle

                                                  A simple module that simulates the life cycle of an amoeba at the molecular level by using a (semi) TRNG.

                                                  • v1.0.0
                                                  • 0.00
                                                  • Published

                                                  tafrigh-cli

                                                  CLI for using the tafrigh library.

                                                    • v1.4.2
                                                    • 0.00
                                                    • Published

                                                    @lunarity/a2s-cli

                                                    A CLI tool for transcribing audio files to subtitles

                                                    • v1.0.3
                                                    • 0.00
                                                    • Published