JSPM

Found 200 results for transcription

audio-to-text-node

Backend audio file to text transcription using Web Speech API with Puppeteer

  • v0.1.2
  • 26.56
  • Published

whisper-web-transcriber

Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly

  • v0.2.4
  • 26.26
  • Published

@adamhancock/transcribe-cli

CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

  • v1.0.4
  • 26.13
  • Published

n8n-nodes-groq

N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

  • v0.2.0
  • 25.58
  • Published

@picovoice/leopard-web

Leopard Speech-to-Text engine for web browsers (via WebAssembly)

    • v2.0.1
    • 25.39
    • Published

    mirador-textoverlay

    Mirador 3 plugin to render a hidden (but selectable) or visible text overlay

    • v0.3.8
    • 25.14
    • Published

    aixblock-voice-ai-deepgram

    A React component for real-time transcription and voice agent interactions using Deepgram APIs

      • v0.0.7
      • 25.02
      • Published

      liveprompt-mcp-server

      Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

      • v1.0.1
      • 24.69
      • Published

      castleguard-sdk

      JavaScript SDK for interacting with CastleGuard APIs

        • v2.0.0
        • 24.43
        • Published

        ugai

        A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

        • v1.1.0
        • 24.32
        • Published

        assembly-ai-mcp

        Model Context Protocol server for AssemblyAI transcription services

          • v0.0.2
          • 24.04
          • Published

          audio2textjs

          A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models

          • v1.0.5
          • 22.10
          • Published

          paragrafs

          A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.

          • v1.5.1
          • 22.09
          • Published

          liblouis-build

          pre-compiled builds of liblouis for js

          • v3.2.0-rc
          • 21.52
          • Published

          universal-transcriber

          A simple universal transcriber for languages with unicode characters.

          • v1.0.0
          • 21.43
          • Published

          @sorenpeng/rtstt

          Real-time speech-to-text CLI tool using OpenAI Realtime API

          • v1.0.0
          • 20.84
          • Published

          assemblyai-mcp-server

          Model Context Protocol server for AssemblyAI transcription services

            • v0.0.1
            • 20.55
            • Published

            voicescribe

            Live speech transcription library with multi-language support.

              • v0.1.0
              • 20.34
              • Published

              real-time-speech-analyzer

              Real-time speech analysis with local LLM using multiple concurrent analysis instructions

              • v1.0.0
              • 20.25
              • Published

              ai-code-writer

              An AI code writer application using OpenAI APIs for audio transcription and chat completion.

              • v3.1.0
              • 20.09
              • Published

              @adamhancock/transcribe

              CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

              • v1.0.5
              • 19.78
              • Published

              tafrigh

              A NodeJS library for transcribing audio/video to text.

                • v4.0.2
                • 19.10
                • Published

                liblouis

                javascript bindings for liblouis

                • v0.4.0
                • 18.05
                • Published

                react-native-deepgram

                React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                • v0.1.21
                • 17.38
                • Published

                @daitanjs/speech

                A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

                • v1.0.6
                • 17.15
                • Published

                vidnavigator

                Official JavaScript SDK for the VidNavigator Developer API

                • v0.1.5
                • 16.87
                • Published

                parakeet.js

                NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

                • v0.0.3
                • 16.17
                • Published

                @chinchillaenterprises/mcp-recall

                Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage

                • v1.1.0
                • 16.11
                • Published

                modern-greek-accentuation

                accentuation, syllabification and transcription utilities for Modern Greek

                • v1.2.1
                • 15.82
                • Published

                react-speech-recognition-ui

                A beautiful, production-ready voice transcription package for React applications using the Web Speech API

                • v0.0.8
                • 15.66
                • Published

                open-transcribe

                AI-Powered Audio Transcription Desktop Application

                • v1.2.4
                • 14.55
                • Published

                whispermix

                🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.

                • v1.3.6
                • 14.19
                • Published

                spongescribe

                StrangeText Transcription

                • v0.6.5
                • 13.94
                • Published

                twitter-reply-bot

                base for twitter reply bot using autohook

                • v1.5.1
                • 13.89
                • Published

                @voicefeedback/sdk

                Modern voice feedback SDK with beautiful UI components and AI-powered analysis

                • v1.0.4
                • 13.69
                • Published

                djelia

                Djelia JavaScript SDK - Advanced AI for African Languages

                • v2.0.0
                • 13.69
                • Published

                n8n-nodes-dudoxx

                n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription

                • v0.1.1
                • 13.29
                • Published

                openai-whisper-js

                openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription

                • v1.0.7
                • 13.24
                • Published

                merge-vtt

                A simple tool to merge multiple WebVTT (.vtt) files into a single file.

                • v1.0.4
                • 13.19
                • Published

                @rxtk/stt-deepgram

                👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text

                • v0.0.0
                • 12.41
                • Published

                autosub

                Automatically generate and overlay subtitles for any video.

                • v1.0.4
                • 12.37
                • Published

                gladia

                Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API

                • v0.1.3
                • 12.00
                • Published

                transcription

                Documentation generator for ES6.

                • v0.2.1
                • 11.87
                • Published

                podcast-takeaways

                A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.

                  • v0.4.5
                  • 11.09
                  • Published

                  kana-transformer

                  Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary

                  • v3.5.0
                  • 11.07
                  • Published

                  pronunciation-finder

                  An application for getting audio files with pronunciation from public dictionaries

                  • v0.8.0
                  • 10.96
                  • Published

                  cmu-pronouncing-dictionary-cjs

                  Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary

                  • v3.0.0
                  • 10.96
                  • Published

                  transcription-lib-grpc-js

                  Creates Live Transcription of a media input stream in multiple languages

                  • v1.0.2
                  • 10.91
                  • Published

                  twitter-reply

                  Strange Text Transliterator (GOTO: spongescribe)

                  • v0.0.0
                  • 10.58
                  • Published

                  audio-transcripter

                  Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.

                  • v2.0.1
                  • 10.50
                  • Published

                  transcription-words

                  Easy and crystal-clear API for transcription words.

                  • v1.2.1
                  • 10.12
                  • Published

                  video-summary

                  Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look

                  • v1.0.8
                  • 9.63
                  • Published

                  polyanno_storage

                  Node and Express backend for easy MongoDB storage of Polyanno annotations

                  • v0.1.5
                  • 9.56
                  • Published

                  twitter-search-bot

                  Strange Text Transliterator (GOTO: spongescribe)

                  • v0.0.0
                  • 9.56
                  • Published

                  @speechall/sdk

                  TypeScript SDK for the Speechall API - A powerful and flexible speech-to-text service

                  • v1.0.0
                  • 9.52
                  • Published

                  @voxextractlabs/vox-whisper

                  [![NPM](https://img.shields.io/npm/v/@voxextractlabs/vox-whisper?label=npm)](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [![License](https://img.shields.io/npm/l/@voxextractlabs/vox-whisper)](./LICENSE) [![Build Status](https://github.com/V

                  • v1.0.1
                  • 9.00
                  • Published

                  node-palladius

                  The Palladius system for transcribing Chinese characters into the Cyrillic alphabet

                  • v0.6.4
                  • 8.50
                  • Published

                  image-generation

                  Strange Text Transliterator (GOTO: spongescribe)

                  • v0.0.0
                  • 8.47
                  • Published

                  transcord

                  A simple recording and transcription module.

                  • v0.2.0
                  • 8.23
                  • Published

                  voicely

                  Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.

                  • v1.1.5
                  • 7.91
                  • Published

                  @rxtk/stt-gcp

                  👂 RxJS operator for realtime speech-to-text (STT/S2T) using Google speeh-to-text

                  • v0.0.0
                  • 7.81
                  • Published

                  atc-transcription

                  React Native module for transcribing WAV files using WhisperKit

                    • v1.0.0
                    • 7.78
                    • Published

                    react-transcribe

                    React component for speech-to-text transcription with silence detection

                    • v0.1.0
                    • 7.78
                    • Published

                    nwhisper

                    Native Node.js bindings for OpenAI's Whisper using whisper.cpp. High-performance local speech-to-text with custom model support.

                    • v0.3.0
                    • 7.78
                    • Published

                    liblouis-js

                    javascript bindings for liblouis

                    • v0.2.0
                    • 7.43
                    • Published

                    spongescribebot

                    Strangetext Transcription - Use: 'spongescribe'

                    • v0.0.0
                    • 7.15
                    • Published

                    maketalk

                    A command-line tool to create video presentations with title cards and transcriptions

                      • v1.4.0
                      • 7.15
                      • Published

                      glaemscribe

                      Glǽmscribe (also written Glaemscribe) is a software dedicated to the transcription of texts between writing systems, and more specifically dedicated to the transcription of J.R.R. Tolkien's invented languages to some of his devised writing systems.

                      • v1.3.1
                      • 7.08
                      • Published

                      meeting-whisperer

                      CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts

                        • v0.1.0
                        • 7.03
                        • Published

                        node-deepgram

                        Node wrapper for Deepgram

                        • v1.0.10
                        • 7.00
                        • Published

                        karaoke-transcriber

                        A CLI tool to generate karaoke-style subtitles on videos using Whisper and FFmpeg.

                          • v1.0.3
                          • 7.00
                          • Published

                          multi-voice-sdk

                          A universal Text-to-Speech (TTS) and Speech-to-Text (STT) SDK supporting multiple providers (OpenAI, Google Gemini, Deepgram, Groq PlayAI, Cartesia, AssemblyAI) with audio merging capabilities

                            • v1.1.0
                            • 6.50
                            • Published

                            vidscript

                            AI-powered CLI tool that transforms video content into intelligent, structured notes and scripts

                            • v1.0.6
                            • 6.16
                            • Published

                            noapi-speech2text

                            Speech recognition library that uses web-based services to convert speech to text in multiple languages

                            • v1.0.3
                            • 5.23
                            • Published

                            friendlyjs

                            make friendly URLs by stripping out non lating chars, and convert other chars to their latin counterparts

                            • v0.0.1
                            • 5.16
                            • Published

                            voicekey

                            A CLI tool to transcribe voice to text with interactive UI

                            • v1.0.0
                            • 5.16
                            • Published

                            speak-precisely-sdk

                            Real-time speech transcription and translation SDK

                            • v1.0.1
                            • 5.16
                            • Published

                            @rxtk/stt-aws

                            👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe

                            • v0.0.0
                            • 5.14
                            • Published

                            realtime-stt-client

                            TypeScript client library for Realtime Speech-to-Text server

                              • v0.0.1-poc.6
                              • 5.14
                              • Published

                              ai-meeting-summarizer

                              A simple Node.js library to transcribe and summarize meeting recordings using OpenAI's GPT model and Whisper.

                                • v1.0.1
                                • 5.12
                                • Published

                                dnable

                                Simple, lightweight, and fast Node.js module for enabling DNA sequences.

                                  • v0.2.1
                                  • 4.29
                                  • Published

                                  fireflies

                                  Fireflies.ai API wrapper

                                  • v0.0.1
                                  • 4.10
                                  • Published

                                  speechtotext-openaikey

                                  A React component for recording and transcribing audio using the Web Audio API and OpenAI.

                                    • v1.0.2
                                    • 3.96
                                    • Published

                                    alinkeo-core

                                    This is the official alinkeo core npm package

                                    • v1.0.0
                                    • 3.90
                                    • Published

                                    @lunarisapp/cmudict

                                    A JavaScript interface to the CMU Pronouncing Dictionary

                                    • v1.0.0
                                    • 3.90
                                    • Published

                                    n8n-nodes-asr

                                    N8N node for processing audio files via an ASR service

                                    • v0.1.1
                                    • 3.90
                                    • Published

                                    douyin-text-extractor

                                    Node.js + TypeScript library for extracting text from Douyin/TikTok videos

                                    • v1.1.2
                                    • 2.40
                                    • Published

                                    rev_ai

                                    Unofficial Rev AI Node.js client

                                    • v0.5.1
                                    • 2.31
                                    • Published

                                    lucidtalk-core

                                    Privacy-first P2P meeting transcription and AI SDK

                                    • v1.0.0
                                    • 2.31
                                    • Published

                                    subtitles-editor

                                    A React component for editing SRT and VTT subtitles directly in a textarea, styled with TailwindCSS.

                                    • v0.0.1
                                    • 2.29
                                    • Published

                                    meeshx-sdk

                                    Welcome to MeeshX! Choose your favorite provider and transcribe your audio content in less than 5 minutes.

                                      • v1.0.0
                                      • 2.28
                                      • Published

                                      whisper-stream-js

                                      CLI tool for real-time audio transcription using OpenAI's Whisper API

                                      • v1.0.0
                                      • 2.27
                                      • Published

                                      whisper-clipboard-cli

                                      Own your transcription workflow. Press Cmd+Shift+X, speak, get text in clipboard instantly.

                                        • v1.0.0
                                        • 0.00
                                        • Published

                                        ctrl.so

                                        Embedable vocal intelligence

                                        • v0.0.8
                                        • 0.00
                                        • Published

                                        openwhisper

                                        A library for AI-powered audio transcription with local and remote server fallback.

                                          • v1.0.1
                                          • 0.00
                                          • Published

                                          deepgram-media-transcriber

                                          A package to transcribe media files using Deepgram with speaker-labeled subtitles (SRT/VTT).

                                            • v1.0.8
                                            • 0.00
                                            • Published

                                            edit-wave-transcript

                                            react component for transcription with ability to edit the words as well as the words. synchronization is done between words and wave-forms.

                                            • v0.2.0
                                            • 0.00
                                            • Published

                                            @easyscribe/react

                                            React components for transcribing with easyscribe.org

                                            • v1.0.1
                                            • 0.00
                                            • Published

                                            amoeba-life-cycle

                                            A simple module that simulates the life cycle of an amoeba at the molecular level by using a (semi) TRNG.

                                            • v1.0.0
                                            • 0.00
                                            • Published

                                            tafrigh-cli

                                            CLI for using the tafrigh library.

                                              • v1.4.2
                                              • 0.00
                                              • Published

                                              @lunarity/a2s-cli

                                              A CLI tool for transcribing audio files to subtitles

                                              • v1.0.3
                                              • 0.00
                                              • Published