Package Exports

@derogab/stt-proxy

Readme

stt-proxy

A simple and lightweight proxy for seamless integration with multiple STT providers including Whisper.cpp.

Features

Multi-provider support: Switch between STT providers with environment variables.
TypeScript support: Full TypeScript definitions included.
Simple API: Single function interface for all providers.
Automatic provider detection: Automatically selects the best available provider based on environment variables.

Installation

npm install @derogab/stt-proxy

Quick Start

import { transcribe } from '@derogab/stt-proxy';

const result = await transcribe('/path/to/audio.wav');
console.log(result.text);

Configuration

The package automatically detects which STT provider to use based on your environment variables. Configure one or more providers:

Whisper.cpp (Local)

WHISPER_CPP_MODEL_PATH=/path/to/ggml-base.bin # Required, path to your GGML model file

Download models from HuggingFace:

curl -L -o ggml-base.bin https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.bin

API Reference

`transcribe(audio: string | Buffer, options?): Promise<TranscribeOutput>`

Transcribes audio to text using the configured STT provider. The package automatically manages provider initialization and cleanup.

Parameters:

audio: Path to audio file (string) or audio Buffer
options (optional): Transcription options

Returns:

Promise that resolves to an object with text property

Options Format:

type TranscribeOptions = {
  language?: string;   // Language code (e.g., 'en', 'es', 'fr')
  translate?: boolean; // Translate to English
};

Output Format:

type TranscribeOutput = {
  text: string;
};

Example:

// Transcribe from file path
const result1 = await transcribe('/path/to/audio.wav');
console.log(result1.text);

// Transcribe from Buffer
const audioBuffer = fs.readFileSync('/path/to/audio.wav');
const result2 = await transcribe(audioBuffer);
console.log(result2.text);

// With options
const result3 = await transcribe('/path/to/audio.wav', {
  language: 'en',
  translate: false
});
console.log(result3.text);

Provider Priority

The package selects providers in the following order:

Whisper.cpp (if WHISPER_CPP_MODEL_PATH is set)

If no providers are configured, the function throws an error.

Requirements

FFmpeg: Required for audio conversion.

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt install ffmpeg

# Windows (with Chocolatey)
choco install ffmpeg

Development

# Install dependencies
npm install

# Build the package
npm run build

Credits

STT Proxy is made with ♥ by derogab and it's released under the MIT license.

Contributors

Tip

If you like this project or directly benefit from it, please consider buying me a coffee:
🔗 bc1qd0qatgz8h62uvnr74utwncc6j5ckfz2v2g4lef
⚡️ derogab@sats.mobi
💶 Sponsor on GitHub

JSPM

@derogab/stt-proxy