JSPM

whisper-onnx-speech-to-text

1.0.1
  • ESM via JSPM
  • ES Module Entrypoint
  • Export Map
  • Keywords
  • License
  • Repository URL
  • TypeScript Types
  • README
  • Created
  • Published
  • Downloads 30
  • Score
    100M100P100Q62000F
  • License MIT

Node.js plugin for speech recognition that works with OpenAI's Whisper models using ONNX.

Package Exports

  • whisper-onnx-speech-to-text
  • whisper-onnx-speech-to-text/dist/index.js

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (whisper-onnx-speech-to-text) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

whisper-onnx-speech-to-text

npm downloads npm downloads

Transcribe speech to text on node.js using OpenAI's Whisper models converted to cross-platform ONNX format

Installation

  1. Add dependency to project
npm install whisper-onnx-speech-to-text
  1. Download whisper model of choice
npx whisper-onnx-speech-to-text download

Usage

import { initWhisper } from 'whisper-onnx-speech-to-text';

const whisper = await initWhisper("base.en");

const transcript = await whisper.transcribe("example/sample.wav");

Result (JSON)

[
  {
    text: " And so my fellow Americans ask not what your country can do for you, ask what you can do for your country."
    chunks: [
       { timestamp: [0, 8.18],  text: " And so my fellow Americans ask not what your country can do for you" },
       { timestamp: [8.18, 11.06], text: " ask what you can do for your country." }
    ]
  }
]

API

initWhisper

The initWhisper() takes the name of the model and returns an instance of the Whisper class initialized with the chosen model.

Whisper

The Whisper class has the following methods:

  • transcribe(filePath: string, language?: string) : transcribes speech from wav file.
    • filePath: path to wav file
    • language: target language for recognition. Name format - the full name in English like 'spanish'
  • disposeModel() : dispose initialized model.

Made with