@deepgram/sdk
Isomorphic Javascript client for Deepgram
Found 98 results for asr
Isomorphic Javascript client for Deepgram
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
Kaldi in-browser speech recognition based on a WASM build of the Vosk library
JavaScript client for Speechly Streaming API
Speech recognition module for react native using Vosk library
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
The **Inworld AI Node.js SDK** enables Developers to easily integrate AI characters into your Node.js environment.
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Polyfill for the Speech Recognition API using Speechly
React client for Speechly Streaming API
Picovoice Leopard Node.js binding
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
Fork of ccoreilly's vosk-browser v0.0.8
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
MMIR (Mobile Multimodal Interaction and Relay) library
Real-time speech recognition with Next-gen Kaldi
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
React component for Rhino Web SDK
abstract-state-router renderer for Svelte
The Inworld Runtime SDK is the first AI runtime built for consumer applications. Ship faster, automate operations, and experiment in real-time.
A powerful real-time communication SDK for voice interactions with Coze AI bots | 扣子官方实时通信 SDK,用于与 Coze AI bots 进行语音交互
aliyun nls sdk for nodejs
Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection
FlyCut Caption - AI-powered video subtitle editing React component with complete editing suite
提供百度语音 React Native 接口
TypeScript SDK for the Dwani API, supporting Chat, Vision, ASR, TTS, Translation, and Documents
The Inworld Three.js library for the Web SDK. Includes Innequin, and Ready Player Me avatars ready to be used in a Three.js scene.
On-device speech-to-text and voice control for web applications with Moonshine.
Perform speech-to-text on audio files within your n8n workflows.This node provides local audio transcription, no internet or third-party APIs required for processing.
Promise based implementation of Yandex Speech Kit API
React Native plugin for adding voice using Spokestack
JavaScript SDK for Cariva ASR
Picovoice Cheetah Node.js binding
Node.js SDK for Cariva ASR
FlyCut Caption - AI-powered video subtitle editing React component with complete editing suite
Browser SDK for Cariva ASR
JavaScript SDK for Cariva ASR
科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能
小派智能语音问答
爱收入微信SDK
On-device speech-to-text and voice control for web applications with Moonshine.
speech to text functionality with minimum configuration and maximum compatibility
JavaScript SDK for Cariva ASR
node.js module for Yandex speech systems (ASR & TTS)
WebAssembly build of the Vosk library
React Native wrapper for Spokestack
tools for querying supported languages (ASR and TTS) and voices (TTS) for mmir speech plugins
ASR online decoding using Kaldi NNet3 GrammarFST
Accurate prayer times using custom algorithm for dynamic angles and nrel-spa for extreme precision
Browser client for Speechly API
tools for querying supported languages (ASR and TTS) and voices (TTS) by Nuance / Cerence SpeechKit
React client for connecting with AIMET global ASR
豆包(字节跳动)实时语音识别 SDK
adhan-clock is a prayer times calculation library for Muslims
Node.js wrapper for OpenAI Whisper speech recognition with TypeScript support
Cordova Nuance Ndev Plugin
Minimal cross-platform wrapper around NVIDIA/Riva streaming ASR WebSocket API with optional client-side silence detection.
typescript version of aispeech
a node-red integration of mozilla deepspeech
Enterprise Node.js SDK for Abena AI Services - ASR, TTS, Translation
百度飞桨语音服务,需要ffmpeg环境
kylin asr assistant
react gnani speech to text component
Yazıları süreye çevirir.
Plugin for the MMIR framework that adds state-machines for managing speech input/output states
Calculates WER, WCR, SER metrics
Node.JS library for Sber SmartSpeech Speech-to-Text with streaming recognition
Professional Islamic prayer times calculator with multiple calculation methods, adjustments, and caching. Used in Salat Now app by Anis Mosbah.
Really easy-to-use Typescript client for FunASR runtime server.
JavaScript client for Vatis Tech ASR services.
Adaptive dictation-mode speech recognizer ponyfill compatible with WebChat that gives the user time to think and stutter/stammer.
Islamic prayer times calculation with special support for Moroccan methods and Maliki madhab
Command line utility to evaluate Automated Speech Recognition (ASR) systems
Node.JS library for Yandex Cloud Speech-to-Text with streaming recognition
Isomorphic Javascript client for Deepgram
Pure browser PCM S16LE audio recorder via AudioWorklet for ASR (no MediaRecorder, no Opus).
统一的语音服务SDK,支持多个云服务商的ASR和TTS服务
Fork of ccoreilly's vosk-browser to enable mbr vectors on partial results. Praise kaldi
Creates Live Transcription of a media input stream in multiple languages
Isomorphic Javascript client for Deepgram
a node-red integration of the coqui stt component
One Button for Voice Input
Virlow Speech-to-Text API Node.js Package
The Inworld Three.js library for the Web SDK. Includes Innequin, and Ready Player Me avatars ready to be used in a Three.js scene.
JavaScript SDK for Cariva ASR
N8N node for processing audio files via an ASR service
Node.JS library for Tinkoff VoiceKit Speech-to-Text with streaming recognition
Manually update the browser's scroll position when using pushState routing with abstract-state-router
(fork of @m-abdi/voice2text)speech to text functionality with minimum configuration and maximum compatibility
sdk for fanolabs asr
[openai whisper-asr](https://github.com/ahmetoner/whisper-asr-webservice) 语音识别服务,支持一百多种语言+翻译,适配wechaty语音消息
Asr Vietspech library for convert audio to text with Vietnamese language.
Node.js bindings for Vosk speech recognition
kylin asr assistant
Isomorphic Javascript client for Deepgram