JSPM

Found 463 results for recognition

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.11
  • 39.43
  • Published

sherpa-onnx-linux-x64

Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

  • v1.12.11
  • 38.49
  • Published

vosk-koffi

Vosk node API based on Koffi.

  • v1.1.1
  • 38.29
  • Published

iink-ts

iinkTS is the fastest way to integrate handwriting panel and recognition in your webapp

  • v3.0.2
  • 38.03
  • Published

@scrypted/objectdetector

Scrypted Video Analysis Plugin. Installed alongside a detection service like OpenCV or TensorFlow.

    • v0.1.72
    • 37.78
    • Published

    sherpa-onnx-darwin-arm64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 36.51
    • Published

    @biopassid/face-sdk

    <h1 align="center"> <br> <a href="http://www.biopassid.com"><img src="https://uploads-ssl.webflow.com/5ec3d6d0293839cf102a656a/63a0d4cec83bbddea006d27a_biopassamarelo.svg" alt="BioPass ID" width="200"></a> <br>

    • v1.3.41
    • 35.82
    • Published

    sherpa-onnx-win-x64

    Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

    • v1.12.11
    • 35.17
    • Published

    @computer-use/libnut

    libnut is an N-API module for desktop automation with node

    • v4.2.0
    • 34.78
    • Published

    @amityeko/rnr-admin

    AmityEko Packaged Business Capabilities - Rewards and Recognitions moderation

      • v0.6.0
      • 34.56
      • Published

      artyom.js

      Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

      • v1.0.6
      • 34.33
      • Published

      sherpa-onnx-win-ia32

      Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

      • v1.12.11
      • 34.26
      • Published

      react-native-apple-shazamkit

      🎵 Powerful music recognition for React Native using Apple's ShazamKit. Identify songs, get metadata, and integrate with Apple Music seamlessly.

      • v1.0.17
      • 33.32
      • Published

      @amityeko/rnr-client

      AmityEko Packaged Business Capabilities - Rewards and Recognitions client

        • v0.6.0
        • 32.35
        • Published

        corti

        Replace window.SpeechRecognition with a mock object and automate your tests

        • v1.0.0
        • 32.18
        • Published

        dynamsoft-capture-vision-bundle

        The Dynamsoft Capture Vision Bundle module is a collection of Dynamsoft products and their dependent resources.

        • v3.0.6001
        • 32.11
        • Published

        @picovoice/cheetah-web

        Cheetah Speech-to-Text engine for web browsers (via WebAssembly)

          • v2.3.0
          • 32.01
          • Published

          @timebutt/face-api.js

          JavaScript API for face detection and face recognition in the browser with tensorflow.js

            • v0.23.2
            • 31.37
            • Published

            @picovoice/cobra-web

            Cobra VAD engine for web browsers (via WebAssembly)

              • v2.0.3
              • 31.36
              • Published

              opencv4nodejs-prebuilt

              Asynchronous OpenCV 4.x nodejs bindings with JavaScript and TypeScript API.

              • v5.3.4
              • 30.93
              • Published

              sherpa-onnx-linux-arm64

              Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

              • v1.12.11
              • 30.66
              • Published

              ocr-tools

              Various tools for OCR

              • v0.2.0
              • 30.04
              • Published

              @picovoice/rhino-web

              Rhino Speech-to-Intent engine for web browsers (via WebAssembly)

                • v3.0.3
                • 29.89
                • Published

                vosk-lib

                Vosk library for node, with type defenitions and multi-arch support.

                • v0.1.3
                • 27.81
                • Published

                sherpa-onnx-darwin-x64

                Speech-to-text, text-to-speech, speaker diarization, and speech enhancement using Next-gen Kaldi without internet connection

                • v1.12.11
                • 26.85
                • Published

                react-taggy

                A simple zero-dependency React component for tagging user-defined entities within a block of text.

                • v0.1.12
                • 26.81
                • Published

                face-api-tfjs-core-update

                JavaScript API for face detection and face recognition in the browser with tensorflow.js

                  • v1.0.10
                  • 26.42
                  • Published

                  audd.io

                  A NodeJS package used to interact with the music recognition API provided by Audd.io

                  • v3.0.2
                  • 26.09
                  • Published

                  tesseractocr

                  Node.js wrapper for Tesseract OCR CLI.

                  • v2.0.3
                  • 25.87
                  • Published

                  teachable-machine.js

                  A robust and optimized JavaScript library for integrating Google's Teachable Machine models, supporting various image sources and providing efficient classification capabilities.

                  • v2.0.2
                  • 25.66
                  • Published

                  vuforia-api

                  Node.js client for the Vuforia Web Services API (VWS API) and the Vuforia Web Query API (VWQ API)

                  • v0.3.2
                  • 25.63
                  • Published

                  nn

                  Fast and simple neural network for node.js

                  • v0.0.7
                  • 25.35
                  • Published

                  job-recognition

                  Library for finding all job titles in an arbitrary piece of text.

                  • v1.1.5
                  • 25.19
                  • Published

                  speechkitt

                  A flexible GUI for interacting with Speech Recognition

                  • v1.0.0
                  • 24.71
                  • Published

                  face-recognition-models

                  This repo contains the model files used by face-recognition.js, in order to easily install them via npm.

                  • v0.0.0
                  • 24.29
                  • Published

                  @picovoice/leopard-web

                  Leopard Speech-to-Text engine for web browsers (via WebAssembly)

                    • v2.0.1
                    • 24.23
                    • Published

                    @docutain/react-native-docutain-sdk

                    React Native plugin of the Docutain Document Scanner SDK for Android and iOS. High quality document scanning, data extraction, text recognition and PDF creation for your apps. Easily scan documents in your app.

                    • v2.0.0
                    • 23.72
                    • Published

                    xfyun-sdk

                    科大讯飞语音识别 SDK,支持浏览器中实时语音听写功能

                    • v1.0.2
                    • 23.51
                    • Published

                    face-recognition

                    Simple Node.js API for robust face detection and face recognition.

                    • v0.9.4
                    • 22.58
                    • Published

                    @moonshine-ai/moonshine-js

                    On-device speech-to-text and voice control for web applications with Moonshine.

                    • v0.1.29
                    • 22.47
                    • Published

                    modern-face-api

                    JavaScript API for face detection and face recognition in the browser with tensorflow.js

                    • v0.22.4
                    • 22.29
                    • Published

                    OneDollar.js

                    Implementation of the $1 Unistroke Recognizer, a two-dimensional template based gesture recognition, in CoffeeScript.

                    • v2.0.0
                    • 22.01
                    • Published

                    react-voice-search

                    React voice search component with audio visualization, speech recognition, and cross-browser support for Web Speech API. SSR-compatible with Next.js.

                    • v1.1.1
                    • 21.08
                    • Published

                    face-api.js.expo

                    JavaScript API/lib for face detection and face recognition in the browser with tensorflow.js on Expo/react-native

                      • v0.22.3
                      • 20.77
                      • Published

                      pockybot

                      Spark bot that handles team recognition

                      • v1.6.0
                      • 19.65
                      • Published

                      facenet

                      Solve face verification, recognition and clustering problems: a TensorFlow backed FaceNet implementation for Node.js.

                      • v0.10.3
                      • 19.42
                      • Published

                      opencv4nodejs-m1

                      Asynchronous OpenCV 4.x nodejs bindings with JavaScript and TypeScript API.

                      • v1.0.2
                      • 18.87
                      • Published

                      @nsfwspy/browser

                      A nudity/pornography image classifier built for web browsers.

                      • v1.2.0
                      • 18.74
                      • Published

                      @lipsurf/plugins

                      Plugins for the LipSurf Chrome extension. Each plugin adds a set of commands to LipSurf.

                      • v4.10.0
                      • 18.62
                      • Published

                      @adaptive-recognition/carmen-cloud-client

                      Node.js client for Carmen Cloud by Adaptive Recognition. Efficiently read license plates, recognize vehicle details, and process container, railway wagon, and US DOT codes.

                        • v2.2.0
                        • 18.47
                        • Published

                        node-openalpr

                        Node.js OpenALPR Bindings

                        • v1.1.1
                        • 18.46
                        • Published

                        podium-sdk

                        Podium Client JavaScript SDK

                        • v1.11.2
                        • 17.97
                        • Published

                        @docutain/react-native-docutain-sdk-barcode

                        The Docutain Barcode SDK for React Native brings high quality barcode / QR code scanning features to your mobile apps, known from the world famous Docutain document management app used by millions of users around the world.

                        • v1.0.0-alpha.1
                        • 17.44
                        • Published

                        anti-captcha

                        Captcha recognition services API wrapper.

                        • v0.0.3
                        • 17.20
                        • Published

                        recognition

                        Captcha recognition services API wrapper.

                        • v0.0.1
                        • 17.15
                        • Published

                        electron-speech

                        speech recognition cli and api for node using electron

                        • v1.0.7
                        • 17.03
                        • Published

                        simple-selfie-face-api

                        Fork of face-api.js. JavaScript API for face detection and face recognition in the browser with tensorflow.js

                          • v0.0.9
                          • 16.85
                          • Published

                          @lucyus/actionify

                          Actionify is a lightweight Node.js automation library for Windows, enabling seamless control of the mouse, keyboard, clipboard, screen, windows and sound, with additional features like OCR and more.

                          • v0.13.0
                          • 16.81
                          • Published

                          react-native-abbyy-mobile-capture-sample-core-api

                          ABBYY Mobile Capture React Native Module allows to use the Image Capture feature of ABBYY Mobile Capture in apps based on the [React Native](https://reactnative.dev/) framework.

                          • v1.0.3
                          • 16.81
                          • Published

                          speechify

                          Easily add speech to text functionality into your website

                          • v0.1.0
                          • 16.49
                          • Published

                          @aurally/speech-control

                          A class to handle microphone permissions, start and observe speech input

                          • v1.1.2
                          • 16.03
                          • Published

                          @spot-parking/node-openalpr

                          Node.js OpenALPR Bindings - Forked from @netPark/node-openalpr and catered for Singapore

                          • v1.1.1
                          • 15.75
                          • Published

                          @smart-cloud/tollingvision

                          TypeScript client for [Tolling Vision](https://tollingvision.com/) by [Smart Cloud Solutions Inc.](https://smart-cloud-solutions.com/).

                            • v2.6.0
                            • 15.47
                            • Published

                            kanji-recognition

                            Angular 6 library that uses Google service to recognize handwritten kanji

                            • v1.0.5
                            • 15.13
                            • Published

                            aws-rekognition

                            AWS Deep learning-based image recognition

                            • v0.0.2
                            • 14.84
                            • Published

                            ispikit

                            ispikit

                            • v1.0.3
                            • 14.24
                            • Published

                            @usefulsensors/moonshine-js

                            On-device speech-to-text and voice control for web applications with Moonshine.

                            • v0.1.21
                            • 14.15
                            • Published

                            skybiometry-login

                            Log users in with SkyBiometry's face recognition

                            • v1.3.2
                            • 14.15
                            • Published

                            @mastashake08/speech-kit

                            Package for simplifying the Speech Recognition and Speech Utterence process.

                            • v2.0.8
                            • 13.90
                            • Published

                            spremic

                            A simple JavaScript speech recognition library.

                            • v0.0.48
                            • 13.88
                            • Published

                            cordova-plugin-speech

                            This is cordova plugin for Speech Recognition and Text to Speech.

                            • v0.0.4
                            • 13.86
                            • Published

                            @vapi/node-yolo

                            Node.js interface for Yolo/Darknet

                            • v2.1.5
                            • 13.75
                            • Published

                            @autojs/opencv

                            Asynchronous OpenCV 3.x nodejs bindings with JavaScript and TypeScript API for Auto.js Pro.

                            • v5.6.14
                            • 13.63
                            • Published

                            @scanood/libnut

                            libnut is an N-API module for desktop automation with node

                            • v4.2.0
                            • 13.32
                            • Published

                            robbie-sdk

                            Robbie Visio SDK to send events for analaysis

                            • v0.1.29
                            • 12.94
                            • Published

                            handwriting

                            Handwriting and stroke recognition library

                            • v0.0.3
                            • 12.93
                            • Published

                            voice-speech-recognition

                            Simple wrapper extended functionalities of Speech Recognition embedded in browsers.

                            • v1.1.2
                            • 12.84
                            • Published

                            opencv4nodejs-lambda

                            Asynchronous OpenCV 3.x API for node.js, built to work on AWS lambda, forked from https://github.com/justadudewhohacks/opencv4nodejs

                            • v2.35.0
                            • 12.83
                            • Published

                            @scanood/libnut-linux

                            libnut is an N-API module for desktop automation with node

                            • v2.7.0
                            • 12.71
                            • Published

                            cmusphinxdict

                            Wrapper for CMU Sphinx Pronouncing Dictionary

                            • v0.0.9
                            • 12.71
                            • Published

                            tesseract-with-html5-camera

                            The objective of this package is to recongnize text from captured image from mobile camera or webcam. This package also have same look and feel of a native mobile camera app but with a react component.

                            • v0.1.7
                            • 12.61
                            • Published

                            image-to-text

                            decodes objects in a given image and gives back the keywords/text

                            • v1.0.8
                            • 12.61
                            • Published

                            speech-js

                            lib for recognition and synthesis of speech

                            • v0.1.1
                            • 12.60
                            • Published

                            graspjs

                            Grasp.js is a handgrip pattern recognition micro-library for mobile devices.

                            • v0.1.0
                            • 12.60
                            • Published

                            face-api-arousal

                            JavaScript API for face detection and face recognition in the browser with tensorflow.js

                            • v1.22.3
                            • 12.54
                            • Published

                            @scanood/libnut-win32

                            libnut is an N-API module for desktop automation with node

                            • v2.7.0
                            • 12.07
                            • Published

                            pastecapi

                            Lightweight module for Pastec image recognition API

                            • v1.3.2
                            • 11.88
                            • Published

                            async-ocrad

                            Async-await wrapper for ocrad.js

                              • v0.0.2
                              • 11.86
                              • Published

                              speech-recog-stream

                              A module to stream audio to a speech recognition server and get back the STT result"

                              • v1.0.8
                              • 11.82
                              • Published

                              cxchord

                              Midi Chord Recognizer

                              • v1.1.3
                              • 11.71
                              • Published

                              cloudsight

                              CloudSight image recognition API (unofficial)

                              • v1.1.3
                              • 11.71
                              • Published

                              myscript-angular

                              AngularJs integrations for MyScript by VisionObjects

                              • v0.8.2
                              • 11.55
                              • Published

                              falexa

                              Create your own verbal commands that map to custom Javascript functions

                              • v2.0.3
                              • 11.25
                              • Published

                              aws-transcribe-to-vtt

                              Turn JSON from Amazon AWS Transcribe into VTT files for use as subtitles.

                              • v1.0.6
                              • 11.22
                              • Published

                              textifyimage

                              textifyimage is a lightweight npm package that allows you to extract text from images effortlessly.

                              • v1.0.3
                              • 11.15
                              • Published

                              voicecapture-angular

                              `voicecapture-angular` is an Angular library designed to provide seamless voice capture and transcription capabilities for web applications. With an easy-to-use API, `voicecapture-angular` allows developers to integrate voice recognition features effortle

                              • v1.0.1
                              • 10.80
                              • Published

                              sap-leonardo

                              NPM module for SAP Leonardo Machine Learning Foundation - Functional Services https://api.sap.com/package/SAPLeonardoMLFunctionalServices

                              • v0.6.0
                              • 10.41
                              • Published

                              @nsfwspy/node

                              A nudity/pornography image classifier built for Node.js.

                              • v1.2.0
                              • 10.41
                              • Published

                              yactraq

                              Interface to Yactraq Speech2Topics API

                              • v0.0.1
                              • 10.13
                              • Published

                              name-recognition

                              Library for finding all the (USA-centric) names in an arbitrary piece of text.

                              • v1.3.1
                              • 10.12
                              • Published

                              tts-js

                              Synthetize text to speech using the browser speechSynthesis

                              • v1.0.1
                              • 10.12
                              • Published

                              opencv_faced_detect

                              light-weight library for face recognition including features such as eyes, nose and mouth. and make image

                              • v1.1.3
                              • 10.03
                              • Published

                              myscript-js

                              Javascript integrations for MyScript by VisionObjects

                              • v0.4.1
                              • 10.03
                              • Published

                              the-finger

                              JavaScript library to detect touch gestures: tap, double tap, press, long press, drag, flick, rotate, pinch, spread, pan, two-finger.

                              • v1.0.3
                              • 10.02
                              • Published

                              @freddydrodev/artyom

                              Artyom is a Robust Wrapper of the Google Chrome SpeechSynthesis and SpeechRecognition that allows you to create a virtual assistent

                              • v0.0.1
                              • 9.74
                              • Published

                              voice-node-library

                              Real-time voice bot library with STT, LLM, and TTS capabilities

                              • v1.0.2
                              • 9.42
                              • Published

                              franc-audio-to-text

                              <!-- demo --> [DEMO WITH NEXTJS](https://next-transcriber.vercel.app) ## Installation ```bash npm install franc-audio-to-text ``` ## Usage with Typescript ```typescript import TranscribeAudioToText from "franc-audio-to-text"; ```

                                • v2.0.4
                                • 9.42
                                • Published

                                ng-facial-recognition

                                Microsoft Project Oxford - AngularJS 1.x Facial Recognition API Wrapper (Face API)

                                • v1.2.0
                                • 9.42
                                • Published

                                robotjs-node10

                                This is a fork of octalmage/robotjs with prebuilts for node 10

                                • v0.5.4
                                • 9.21
                                • Published

                                cogserv-entity-linking

                                Node.js client for Microsoft Cognitive Services API - Entity Linking

                                • v1.0.0
                                • 9.21
                                • Published

                                @bilzo/node-ts-ocr

                                A simple wrapper around command-line utils to assist in PDF / Image OCR (Optical Character Recognition) processing using Tesseract.

                                • v1.0.17
                                • 9.21
                                • Published

                                face-api-recognition

                                JavaScript API for face detection and face recognition in the browser with tensorflow.js

                                  • v0.22.5
                                  • 9.20
                                  • Published

                                  vedansh-face-api

                                  JavaScript API for face detection and face recognition by Vedansh

                                    • v0.0.1
                                    • 9.14
                                    • Published

                                    speech-recognition-react

                                    A react library that encapsulates the native browser speech recognition api

                                    • v2.0.0
                                    • 9.14
                                    • Published

                                    bing-sydney-ai

                                    ``` npm install bing-sydney-ai ```

                                      • v1.0.1
                                      • 9.14
                                      • Published

                                      getusermedia-async

                                      A promise-based, awaitable, browser-independent getUserMedia function to get user's audio or video.

                                      • v1.0.0
                                      • 9.14
                                      • Published

                                      @neofaceid/web-sdk

                                      NeoFaceId Web SDK for facial authentication

                                      • v1.0.19
                                      • 8.96
                                      • Published

                                      koi-koi

                                      Koi is a time aware interactive bird written in THREE.js with voice recognition, and an artificial IQ.

                                      • v0.1.0
                                      • 8.87
                                      • Published

                                      facerecognitionlib

                                      >A modular and customizable face recognition camera utility for the web. Easily integrate webcam face detection and recognition with detector model, API calls, toast notifications, and custom UI.

                                        • v1.0.3
                                        • 8.87
                                        • Published

                                        node-ner

                                        NodeJS Named Entity Recognition, using Stanford NER (easy install)

                                        • v0.0.3
                                        • 8.75
                                        • Published

                                        robotjs-shade

                                        Node.js Desktop Automation.

                                        • v0.4.14
                                        • 8.75
                                        • Published

                                        robotjs_jm

                                        Node.js Desktop Automation. Forked for JM Robotics

                                        • v35.0.6
                                        • 8.75
                                        • Published

                                        robotjs-repack

                                        Node.js Desktop Automation.fork by ToDesktop/robotjs-prebuild

                                        • v0.6.4
                                        • 8.69
                                        • Published

                                        speedyspeech

                                        This is a module to quickly use the Web Speech API to recognize keywords as a user speaks.

                                        • v0.1.2
                                        • 8.63
                                        • Published

                                        @scvzerng/libnut

                                        libnut is an N-API module for desktop automation with node

                                        • v2.7.1
                                        • 8.63
                                        • Published

                                        ng-speech-recognition

                                        AngularJS directive to add Speech Recognition to your hybrid mobile application & AngularJS web app.

                                        • v2.0.1
                                        • 8.26
                                        • Published

                                        mumble-js

                                        A simple Javascript framework for adding voice commands to a web site using the web speech recognition API. Based on annyang.js.

                                        • v1.0.1
                                        • 8.13
                                        • Published

                                        ducks-dashboard

                                        Dashboard duck with filter, sorting, search & results

                                        • v1.1.1
                                        • 8.13
                                        • Published

                                        react-mrz-scanner

                                        A React component to scan MRZ on passports, visa cards, etc.

                                        • v1.0.1
                                        • 8.06
                                        • Published

                                        @suchipi/libnut-win32

                                        libnut is an N-API module for desktop automation with node

                                        • v2.7.1
                                        • 8.06
                                        • Published

                                        react-hanzi-lookup

                                        HanziLookUpJS, made for React. React functional component for Chinese handwriting recognition. Little set-up required.

                                        • v1.0.7
                                        • 8.05
                                        • Published

                                        @recognify/core

                                        Regognize everything in your browser

                                        • v1.0.2
                                        • 7.49
                                        • Published

                                        @edumolki/opencv4nodejs

                                        Asynchronous OpenCV 3.x / 4.x nodejs bindings with JavaScript and TypeScript API.

                                        • v1.0.11
                                        • 7.49
                                        • Published

                                        @geoffcox/pretty-good-nlp

                                        A simple natural language processing (NLP) recognizer you can use in minutes.

                                        • v1.0.0
                                        • 7.48
                                        • Published

                                        @puge/opencv4nodejs

                                        Asynchronous OpenCV 3.x nodejs bindings with JavaScript and TypeScript API.

                                        • v5.6.7
                                        • 7.48
                                        • Published

                                        kairos-api

                                        The Node.js client for the Kairos face recognition API.

                                        • v0.1.3
                                        • 7.43
                                        • Published

                                        hotword

                                        Hot Word Detection and EASY to build and use

                                          • v1.0.9
                                          • 7.43
                                          • Published

                                          @koush/opencv4nodejs

                                          Asynchronous OpenCV 3.x nodejs bindings with JavaScript and TypeScript API.

                                          • v5.7.2
                                          • 7.43
                                          • Published

                                          vac

                                          A language detection library named after the hindu goddess of communications and words Vāc.

                                          • v1.0.2
                                          • 7.43
                                          • Published

                                          quantum_neuron2

                                          A perceptron neuron that simply recognizes differences.

                                          • v1.0.1
                                          • 7.43
                                          • Published

                                          webvoicehub

                                          Voice commands for web applications.

                                            • v1.4.0
                                            • 7.42
                                            • Published