JSPM

  • Created
  • Published
  • Downloads 385
  • Score
    100M100P100Q103049F
  • License MIT

Web component for Corti Dictation

Package Exports

  • @corti/dictation-web

Readme

Corti Dictation SDK

Overview

The Corti Dictation SDK is a web component that enables real-time speech-to-text dictation using Corti's Dictation API. It provides a simple interface for capturing audio, streaming it to the API, and handling transcripts.

Note: OAuth 2.0 authentication is not handled by this SDK. The client must provide an API key or authorization token before using the component.

Installation

Include the SDK in your project by importing the JavaScript module:

npm i @corti/dictation-web

Then import the module like so:

// Import the Corti Dictation SDK
import '@corti/dictation-web';

Alternatively, use a CDN to start quickly (not recommended).

<script
  src="https://cdn.jsdelivr.net/npm/@corti/dictation-web/dist/bundle.min.js"
  preload
  type="module"
></script>

Usage

Demo

🚀 Hosted Demo

Basic Example

<!DOCTYPE html>
<html lang="en">
  <body>
    <corti-dictation></corti-dictation>
    <textarea
      id="transcript"
      placeholder="Transcript will appear here..."
    ></textarea>

    <script>
      import '@corti/dictation-web';
      const dictation = document.getElementById('transcript');
      dictation.setAccessToken('YOUR_AUTH_TOKEN'); // Note: Never hardcode tokens
      // Listen for events
      dictationEl.addEventListener('transcript', e => {
        document.getElementById('transcript').value += e.detail.data.text + ' ';
      });
    </script>
  </body>
</html>

API Reference

Properties

Property Type Description
devices Array List of available recording devices.
selectedDevice Object The selected device used for recording (MediaDeviceInfo).
recordingState String Current state of recording (stopped, recording, initializing and stopping, ).
dictationConfig Object Configuration settings for dictation.
languagesSupported String[] List of all language codes available for use on the SDK.
debug_displayAudio Boolean Overrides any device selection and instead uses getDisplayMedia to stream system audio. Should only be used for debugging

Methods

Method Description
toggleRecording() Starts or stops recording.
setAccessToken(access_token: string) Set the latest access token. This will return the server config.

Events

Event Description
ready Fired once the component is ready.
recording-state-changed Fired when the recording state changes. detail.state contains the new state.
recording-devices-changed Fired when the user switches recording devices or the list of recording devices changes. detail.devices contains the full devices list. detail.selectedDevice contains the current selected device.
transcript Fired when a new transcript is received. detail.data.text contains the transcribed text.
command Fired whenever a new command is detected.
audio-level-changed Fired when the input audio level changes. detail.audioLevel contains the new level.
error Fired on error. detail contains the full error.

Authentication

This SDK does not handle OAuth 2.0 authentication. The client must provide an API key or access token as a string using setAccessToken.

Usage Examples

Explore practical implementations and usage examples in the Demo Folder. These demos can also be run locally.

Styling

UI Overview

The default UI is designed to be slotted into existing applications seamlessly, however, it also supports custom styling as well as theming. The UI can be fully customized using CSS properties. Refer to our Styling Guide for detailed instructions.

License

This SDK is licensed under MIT.

Support

For issues or questions, contact Corti Support at support@corti.ai.