JSPM

Found 200 results for transcription

speech-into-text

SpeechToText is a lightweight, multi-language voice-to-text tool for real-time transcription in web apps.

  • v4.0.2
  • 36.69
  • Published

whisper-speech-to-text

A JavaScript library enabling in-browser audio recording and transcription using OpenAI's Whisper Speech-to-Text

  • v1.0.3
  • 35.47
  • Published

koshi-vox

Voice-To-Text recorder with sound notifications - optimized for macOS

  • v1.2.6
  • 35.36
  • Published

sherpa-onnx-linux-arm64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.11
  • 35.06
  • Published

@liveprompt/mcp-server

Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

  • v2.0.13
  • 34.21
  • Published

@picovoice/cheetah-web

Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

    • v2.3.0
    • 33.63
    • Published

    n8n-nodes-puter-ai

    Advanced n8n node for Puter.js AI with RAG agentic capabilities, document processing, audio transcription, Supabase integration, and cost-optimized model priorities

    • v2.0.4
    • 31.58
    • Published

    @thaleslaray/n8n-nodes-elevenlabs

    Nó n8n para integração com a API da ElevenLabs incluindo Speech-to-Text, Text-to-Speech e Conversational AI

    • v0.3.9
    • 31.06
    • Published

    sherpa-onnx-darwin-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 31.01
    • Published

    @theventures/caret

    Unofficial Node.js API client for the Caret HTTP API

    • v0.1.1
    • 30.87
    • Published

    apexify.js

    Unlimited AI models and Canvas library. Supports ts & js (supports front/back end).

      • v4.9.26
      • 30.15
      • Published

      dictate-button

      Dictate Button (Web Component)

      • v1.2.0
      • 29.48
      • Published

      aromanize

      Korean transliteration tool for JavaScript

      • v0.1.5
      • 29.04
      • Published

      susurro-audio

      🎙️ Real-time conversational audio with AI transcription. Build ChatGPT-style voice interfaces in minutes with <300ms latency

      • v2.1.1
      • 28.73
      • Published

      n8n-nodes-get-transcribe

      n8n node for GetTranscribe API integration - transcribe videos from Instagram, TikTok, YouTube and more

      • v0.1.2
      • 28.67
      • Published

      talisik-huntress

      A TypeScript library for extracting and working with YouTube video transcripts.

      • v1.1.7
      • 28.04
      • Published

      @fugood/whisper.node

      An another Node binding of whisper.cpp to make same API with whisper.rn as much as possible.

      • v1.0.3
      • 27.91
      • Published

      robinwood

      Steal money from big companies

      • v1.0.1
      • 27.49
      • Published

      audiopod-sdk

      AudioPod SDK for Node.js and React - Professional Audio Processing powered by AI

      • v1.2.0
      • 26.63
      • Published

      n8n-nodes-transcribe-audio

      Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.

      • v0.1.23
      • 26.61
      • Published

      audio-to-text-node

      Backend audio file to text transcription using Web Speech API with Puppeteer

      • v0.1.2
      • 26.47
      • Published

      n8n-nodes-groq

      N8N community node for Groq API - Speech-to-Text transcription using Whisper AI. Convert audio to text with high accuracy. Perfect for WhatsApp voice messages, audio files, and voice automation workflows.

      • v0.2.0
      • 26.18
      • Published

      @adamhancock/transcribe-cli

      CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

      • v1.0.4
      • 25.88
      • Published

      @picovoice/leopard-web

      Leopard Speech-to-Text engine for web browsers (via WebAssembly)

        • v2.0.1
        • 25.46
        • Published

        aixblock-voice-ai-deepgram

        A React component for real-time transcription and voice agent interactions using Deepgram APIs

          • v0.0.7
          • 25.13
          • Published

          whisper-web-transcriber

          Real-time audio transcription in the browser using OpenAI's Whisper model via WebAssembly

          • v0.2.5
          • 25.07
          • Published

          mirador-textoverlay

          Mirador 3 plugin to render a hidden (but selectable) or visible text overlay

          • v0.3.8
          • 24.92
          • Published

          liveprompt-mcp-server

          Model Context Protocol server for liveprompt.ai - Enable external applications to connect and access meeting data

          • v1.0.1
          • 24.57
          • Published

          audio2textjs

          A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models

          • v1.0.5
          • 24.57
          • Published

          ugai

          A JavaScript/Node.js package for Akan language Text-to-Speech (TTS), Speech-to-Text (STT), and Spellchecker services

          • v1.1.0
          • 24.20
          • Published

          castleguard-sdk

          JavaScript SDK for interacting with CastleGuard APIs

            • v2.0.0
            • 24.20
            • Published

            assembly-ai-mcp

            Model Context Protocol server for AssemblyAI transcription services

              • v0.0.2
              • 23.93
              • Published

              liblouis-build

              pre-compiled builds of liblouis for js

              • v3.2.0-rc
              • 22.55
              • Published

              paragrafs

              A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.

              • v1.5.1
              • 22.02
              • Published

              universal-transcriber

              A simple universal transcriber for languages with unicode characters.

              • v1.0.0
              • 21.93
              • Published

              @sorenpeng/rtstt

              Real-time speech-to-text CLI tool using OpenAI Realtime API

              • v1.0.0
              • 21.32
              • Published

              whisper-clipboard-cli

              Own your transcription workflow. Press Cmd+Shift+X, speak, get text in clipboard instantly.

                • v1.0.1
                • 20.59
                • Published

                ai-code-writer

                An AI code writer application using OpenAI APIs for audio transcription and chat completion.

                • v3.1.0
                • 20.55
                • Published

                assemblyai-mcp-server

                Model Context Protocol server for AssemblyAI transcription services

                  • v0.0.1
                  • 20.46
                  • Published

                  real-time-speech-analyzer

                  Real-time speech analysis with local LLM using multiple concurrent analysis instructions

                  • v1.0.0
                  • 20.15
                  • Published

                  @adamhancock/transcribe

                  CLI tool for transcribing and summarizing MP4 recordings using Whisper and Ollama

                  • v1.0.5
                  • 19.71
                  • Published

                  tafrigh

                  A NodeJS library for transcribing audio/video to text.

                    • v4.0.2
                    • 19.55
                    • Published

                    liblouis

                    javascript bindings for liblouis

                    • v0.4.0
                    • 17.89
                    • Published

                    @daitanjs/speech

                    A library for text-to-speech (TTS) and speech-to-text (STT) operations using Google Cloud and OpenAI.

                    • v1.0.6
                    • 17.55
                    • Published

                    react-native-deepgram

                    React Native SDK for Deepgram's AI-powered speech-to-text, real-time transcription, and text intelligence APIs. Supports live audio streaming, file transcription, sentiment analysis, and topic detection for iOS and Android.

                    • v0.1.21
                    • 17.46
                    • Published

                    @elizaos/plugin-google-meet-cute

                    Google Meet integration plugin for ElizaOS - manage meetings, get participant info, and access meeting artifacts via Google Meet REST API

                    • v1.5.0
                    • 16.94
                    • Published

                    vidnavigator

                    Official JavaScript SDK for the VidNavigator Developer API

                    • v0.1.5
                    • 16.82
                    • Published

                    parakeet.js

                    NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web.

                    • v0.0.3
                    • 16.25
                    • Published

                    @chinchillaenterprises/mcp-recall

                    Event-driven MCP server for Recall.ai meeting transcription with enhanced speaker identification and local storage

                    • v1.1.0
                    • 16.08
                    • Published

                    react-speech-recognition-ui

                    A beautiful, production-ready voice transcription package for React applications using the Web Speech API

                    • v0.0.8
                    • 16.03
                    • Published

                    modern-greek-accentuation

                    accentuation, syllabification and transcription utilities for Modern Greek

                    • v1.2.1
                    • 15.76
                    • Published

                    gladia

                    Official TypeScript SDK for Gladia - State-of-the-art Speech to Text API

                    • v0.1.3
                    • 15.23
                    • Published

                    @voicefeedback/sdk

                    Modern voice feedback SDK with beautiful UI components and AI-powered analysis

                    • v1.0.4
                    • 15.13
                    • Published

                    whispermix

                    🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.

                    • v1.3.6
                    • 14.52
                    • Published

                    open-transcribe

                    AI-Powered Audio Transcription Desktop Application

                    • v1.2.4
                    • 14.41
                    • Published

                    spongescribe

                    StrangeText Transcription

                    • v0.6.5
                    • 13.91
                    • Published

                    twitter-reply-bot

                    base for twitter reply bot using autohook

                    • v1.5.1
                    • 13.76
                    • Published

                    djelia

                    Djelia JavaScript SDK - Advanced AI for African Languages

                    • v2.0.0
                    • 13.75
                    • Published

                    n8n-nodes-dudoxx

                    n8n community node package for interacting with the AssemblyAI API for speech-to-text transcription

                    • v0.1.1
                    • 13.22
                    • Published

                    openai-whisper-js

                    openai-whisper-js is a Node.js wrapper for the OpenAI Whisper library, enabling seamless audio transcription using Whisper models. This package simplifies the process of interacting with Whisper by providing a JavaScript interface to execute transcription

                    • v1.0.7
                    • 13.20
                    • Published

                    merge-vtt

                    A simple tool to merge multiple WebVTT (.vtt) files into a single file.

                    • v1.0.4
                    • 13.07
                    • Published

                    transcription-words

                    Easy and crystal-clear API for transcription words.

                    • v1.2.1
                    • 12.63
                    • Published

                    @rxtk/stt-deepgram

                    👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text

                    • v0.0.0
                    • 12.36
                    • Published

                    autosub

                    Automatically generate and overlay subtitles for any video.

                    • v1.0.4
                    • 12.33
                    • Published

                    transcription

                    Documentation generator for ES6.

                    • v0.2.1
                    • 11.16
                    • Published

                    podcast-takeaways

                    A system to download the most recent episode of a plugin, transcribe it with the OpenAI Whisper API and then use GPT to determine one takeaway to apply from the episode.

                      • v0.4.5
                      • 11.07
                      • Published

                      pronunciation-finder

                      An application for getting audio files with pronunciation from public dictionaries

                      • v0.8.0
                      • 10.92
                      • Published

                      cmu-pronouncing-dictionary-cjs

                      Common JS version of the 134,000+ words and their pronunciations in the CMU pronouncing dictionary

                      • v3.0.0
                      • 10.92
                      • Published

                      transcription-lib-grpc-js

                      Creates Live Transcription of a media input stream in multiple languages

                      • v1.0.2
                      • 10.82
                      • Published

                      twitter-reply

                      Strange Text Transliterator (GOTO: spongescribe)

                      • v0.0.0
                      • 10.53
                      • Published

                      audio-transcripter

                      Lightweight TypeScript library for transcribing audio files using Google Gemini 2.0 models. Supports local files, remote URLs, and Blobs.

                      • v2.0.1
                      • 10.40
                      • Published

                      voicescribe

                      Live speech transcription library with multi-language support.

                        • v0.1.0
                        • 10.37
                        • Published

                        kana-transformer

                        Transform kana to en|ru language or vice versa, using specific transliteration system; convert one kana to the other syllabary

                        • v3.5.0
                        • 9.74
                        • Published

                        video-summary

                        Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look

                        • v1.0.8
                        • 9.59
                        • Published

                        polyanno_storage

                        Node and Express backend for easy MongoDB storage of Polyanno annotations

                        • v0.1.5
                        • 9.48
                        • Published

                        twitter-search-bot

                        Strange Text Transliterator (GOTO: spongescribe)

                        • v0.0.0
                        • 9.48
                        • Published

                        @voxextractlabs/vox-whisper

                        [![NPM](https://img.shields.io/npm/v/@voxextractlabs/vox-whisper?label=npm)](https://www.npmjs.com/package/@voxextractlabs/vox-whisper) [![License](https://img.shields.io/npm/l/@voxextractlabs/vox-whisper)](./LICENSE) [![Build Status](https://github.com/V

                        • v1.0.1
                        • 8.91
                        • Published

                        node-palladius

                        The Palladius system for transcribing Chinese characters into the Cyrillic alphabet

                        • v0.6.4
                        • 8.46
                        • Published

                        image-generation

                        Strange Text Transliterator (GOTO: spongescribe)

                        • v0.0.0
                        • 8.44
                        • Published

                        fireflies

                        Fireflies.ai API wrapper

                        • v0.0.1
                        • 8.39
                        • Published

                        voicely

                        Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.

                        • v1.1.5
                        • 7.95
                        • Published

                        @rxtk/stt-gcp

                        👂 RxJS operator for realtime speech-to-text (STT/S2T) using Google speeh-to-text

                        • v0.0.0
                        • 7.78
                        • Published

                        atc-transcription

                        React Native module for transcribing WAV files using WhisperKit

                          • v1.0.0
                          • 7.71
                          • Published

                          nwhisper

                          Native Node.js bindings for OpenAI's Whisper using whisper.cpp. High-performance local speech-to-text with custom model support.

                          • v0.3.0
                          • 7.71
                          • Published

                          react-transcribe

                          React component for speech-to-text transcription with silence detection

                          • v0.1.0
                          • 7.71
                          • Published

                          vidscript

                          AI-powered CLI tool that transforms video content into intelligent, structured notes and scripts

                          • v1.0.6
                          • 7.71
                          • Published

                          liblouis-js

                          javascript bindings for liblouis

                          • v0.2.0
                          • 7.61
                          • Published

                          transcord

                          A simple recording and transcription module.

                          • v0.2.0
                          • 7.61
                          • Published

                          maketalk

                          A command-line tool to create video presentations with title cards and transcriptions

                            • v1.4.0
                            • 7.18
                            • Published

                            glaemscribe

                            Glǽmscribe (also written Glaemscribe) is a software dedicated to the transcription of texts between writing systems, and more specifically dedicated to the transcription of J.R.R. Tolkien's invented languages to some of his devised writing systems.

                            • v1.3.1
                            • 7.04
                            • Published

                            meeting-whisperer

                            CLI tool that transcribes audio/video files using AssemblyAI's API and outputs formatted markdown transcripts

                              • v0.1.0
                              • 6.96
                              • Published

                              node-deepgram

                              Node wrapper for Deepgram

                              • v1.0.10
                              • 6.94
                              • Published

                              karaoke-transcriber

                              A CLI tool to generate karaoke-style subtitles on videos using Whisper and FFmpeg.

                                • v1.0.3
                                • 6.94
                                • Published

                                multi-voice-sdk

                                A universal Text-to-Speech (TTS) and Speech-to-Text (STT) SDK supporting multiple providers (OpenAI, Google Gemini, Deepgram, Groq PlayAI, Cartesia, AssemblyAI) with audio merging capabilities

                                  • v1.1.0
                                  • 6.65
                                  • Published

                                  @speechall/sdk

                                  TypeScript SDK for the Speechall API - A powerful and flexible speech-to-text service

                                  • v1.0.0
                                  • 6.65
                                  • Published

                                  noapi-speech2text

                                  Speech recognition library that uses web-based services to convert speech to text in multiple languages

                                  • v1.0.3
                                  • 5.25
                                  • Published

                                  spongescribebot

                                  Strangetext Transcription - Use: 'spongescribe'

                                  • v0.0.0
                                  • 5.25
                                  • Published

                                  speechtotext-openaikey

                                  A React component for recording and transcribing audio using the Web Audio API and OpenAI.

                                    • v1.0.2
                                    • 5.25
                                    • Published

                                    friendlyjs

                                    make friendly URLs by stripping out non lating chars, and convert other chars to their latin counterparts

                                    • v0.0.1
                                    • 5.14
                                    • Published

                                    voicekey

                                    A CLI tool to transcribe voice to text with interactive UI

                                    • v1.0.0
                                    • 5.14
                                    • Published

                                    speak-precisely-sdk

                                    Real-time speech transcription and translation SDK

                                    • v1.0.1
                                    • 5.14
                                    • Published

                                    @rxtk/stt-aws

                                    👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe

                                    • v0.0.0
                                    • 5.09
                                    • Published

                                    realtime-stt-client

                                    TypeScript client library for Realtime Speech-to-Text server

                                      • v0.0.1-poc.6
                                      • 5.09
                                      • Published

                                      ai-meeting-summarizer

                                      A simple Node.js library to transcribe and summarize meeting recordings using OpenAI's GPT model and Whisper.

                                        • v1.0.1
                                        • 5.07
                                        • Published

                                        dnable

                                        Simple, lightweight, and fast Node.js module for enabling DNA sequences.

                                          • v0.2.1
                                          • 4.28
                                          • Published

                                          alinkeo-core

                                          This is the official alinkeo core npm package

                                          • v1.0.0
                                          • 3.89
                                          • Published

                                          @lunarisapp/cmudict

                                          A JavaScript interface to the CMU Pronouncing Dictionary

                                          • v1.0.0
                                          • 3.89
                                          • Published

                                          n8n-nodes-asr

                                          N8N node for processing audio files via an ASR service

                                          • v0.1.1
                                          • 3.89
                                          • Published

                                          douyin-text-extractor

                                          Node.js + TypeScript library for extracting text from Douyin/TikTok videos

                                          • v1.1.2
                                          • 2.45
                                          • Published

                                          lucidtalk-core

                                          Privacy-first P2P meeting transcription and AI SDK

                                          • v1.0.0
                                          • 2.33
                                          • Published

                                          rev_ai

                                          Unofficial Rev AI Node.js client

                                          • v0.5.1
                                          • 2.33
                                          • Published

                                          subtitles-editor

                                          A React component for editing SRT and VTT subtitles directly in a textarea, styled with TailwindCSS.

                                          • v0.0.1
                                          • 2.28
                                          • Published

                                          meeshx-sdk

                                          Welcome to MeeshX! Choose your favorite provider and transcribe your audio content in less than 5 minutes.

                                            • v1.0.0
                                            • 2.28
                                            • Published

                                            whisper-stream-js

                                            CLI tool for real-time audio transcription using OpenAI's Whisper API

                                            • v1.0.0
                                            • 2.25
                                            • Published

                                            ctrl.so

                                            Embedable vocal intelligence

                                            • v0.0.8
                                            • 0.00
                                            • Published

                                            openwhisper

                                            A library for AI-powered audio transcription with local and remote server fallback.

                                              • v1.0.1
                                              • 0.00
                                              • Published

                                              deepgram-media-transcriber

                                              A package to transcribe media files using Deepgram with speaker-labeled subtitles (SRT/VTT).

                                                • v1.0.8
                                                • 0.00
                                                • Published

                                                edit-wave-transcript

                                                react component for transcription with ability to edit the words as well as the words. synchronization is done between words and wave-forms.

                                                • v0.2.0
                                                • 0.00
                                                • Published

                                                @easyscribe/react

                                                React components for transcribing with easyscribe.org

                                                • v1.0.1
                                                • 0.00
                                                • Published

                                                amoeba-life-cycle

                                                A simple module that simulates the life cycle of an amoeba at the molecular level by using a (semi) TRNG.

                                                • v1.0.0
                                                • 0.00
                                                • Published

                                                tafrigh-cli

                                                CLI for using the tafrigh library.

                                                  • v1.4.2
                                                  • 0.00
                                                  • Published

                                                  @lunarity/a2s-cli

                                                  A CLI tool for transcribing audio files to subtitles

                                                  • v1.0.3
                                                  • 0.00
                                                  • Published