JSPM

@fr3k/666fr3k

1.0.0
  • ESM via JSPM
  • ES Module Entrypoint
  • Export Map
  • Keywords
  • License
  • Repository URL
  • TypeScript Types
  • README
  • Created
  • Published
  • Downloads 24
  • Score
    100M100P100Q82048F
  • License MIT

AI-powered continuous speech recognition and synthesis loop with web UI - Text-to-Speech and Speech-to-Text testing framework

Package Exports

  • @fr3k/666fr3k
  • @fr3k/666fr3k/index.js

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (@fr3k/666fr3k) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

666FR3K

πŸ”₯ AI-Powered Continuous Speech Recognition & Synthesis Loop System πŸ”₯

A comprehensive Text-to-Speech (TTS) and Speech-to-Text (STT) testing framework with continuous loopback capabilities, CLI tools, and web UI.

npm version License: MIT

🎯 Features

  • πŸ”Š Text-to-Speech: Convert text to natural speech using gTTS (Google Text-to-Speech)
  • 🎀 Speech-to-Text: Real-time transcription using Google Speech Recognition
  • πŸ”„ Continuous Loop: Automated TTSβ†’Audioβ†’STT testing pipeline
  • 🎧 Auto-Listener: Continuous microphone monitoring with voice command support
  • 🌐 Web UI: Beautiful responsive interface for all features
  • πŸ§ͺ Advanced Testing: Complex vocabulary and technical terminology tests
  • πŸ“Š Analytics: Performance metrics and accuracy tracking
  • ⚑ Fast: 1.6x speed playback for efficient testing

πŸ“¦ Installation

Quick Start (npx)

npx 666fr3k --help

Global Installation

npm install -g 666fr3k

Local Installation

npm install 666fr3k

πŸ”§ Prerequisites

Node.js Dependencies

All Node.js dependencies are installed automatically with the package.

Python Dependencies

Install Python dependencies:

# Automatic installation
npx 666fr3k install-deps

# Or manual installation
pip3 install gtts SpeechRecognition pyaudio

System Requirements

  • Node.js: v16.0.0 or higher
  • Python: 3.7 or higher
  • Audio: ffmpeg, ffplay (for audio playback)
  • Microphone: Required for STT features

Install System Audio Tools

Ubuntu/Debian:

sudo apt-get install ffmpeg portaudio19-dev python3-pyaudio

macOS:

brew install ffmpeg portaudio

Windows:

choco install ffmpeg

πŸš€ Usage

CLI Commands

1. Auto-Listener (Continuous Speech Recognition)

Listen continuously for speech and transcribe in real-time:

# Basic listening (text output only)
npx 666fr3k listen

# With voice responses
npx 666fr3k listen --speak

# Verbose mode
npx 666fr3k listen --speak --verbose

Voice Commands:

  • "stop listening" - Exit the listener
  • "status" - Show session statistics
  • "help" - Show available commands

2. TTS→STT Loop Testing

Run continuous loop tests to verify the pipeline:

# Run 5 cycles at 1.6x speed (default)
npx 666fr3k loop

# Custom cycles and speed
npx 666fr3k loop --cycles 10 --speed 1.8

# Run 20 cycles at normal speed
npx 666fr3k loop -c 20 -s 1.0

3. Verification Tests

Run comprehensive TTS and STT tests:

# Basic tests
npx 666fr3k test

# Advanced tests with complex vocabulary
npx 666fr3k test --advanced

# Custom loop count
npx 666fr3k test --loop 10

4. Web UI

Launch the web interface:

# Start on default port 3666
npx 666fr3k web

# Custom port
npx 666fr3k web --port 8080

Then open: http://localhost:3666

Web UI Features

The web interface provides:

  • TTS Panel: Convert text to speech with speed control
  • STT Panel: Live microphone transcription
  • Loop Testing: Run automated TTSβ†’STT cycles
  • Statistics Dashboard: Track usage and accuracy metrics
  • Real-time Updates: WebSocket-based live transcription

πŸ“š API Usage

JavaScript/Node.js

const { spawn } = require('child_process');

// Run TTS
const tts = spawn('npx', ['666fr3k', 'loop', '--cycles', '3']);
tts.stdout.on('data', (data) => {
  console.log(data.toString());
});

// Start auto-listener
const listener = spawn('npx', ['666fr3k', 'listen', '--speak']);

Python Integration

import subprocess

# Run loop test
result = subprocess.run(['npx', '666fr3k', 'loop', '--cycles', '5'])

# Start listener
listener = subprocess.Popen(['npx', '666fr3k', 'listen'])

HTTP API (Web Server)

# Start server
npx 666fr3k web --port 3666

TTS Endpoint:

curl -X POST http://localhost:3666/api/tts \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello world", "speed": 1.6}'

STT Endpoint:

curl -X POST http://localhost:3666/api/stt \
  -H "Content-Type: application/json" \
  -d '{"audioData": "<base64_audio>"}'

πŸ§ͺ Testing Examples

Basic Loop Test

npx 666fr3k loop --cycles 5

Output:

πŸ”„ CYCLE #1
πŸ“’ Input: "Hello, this is cycle number one..."
πŸ”Š Playing audio at 1.6x speed... βœ“
🎧 Transcribing... βœ“
πŸ“ Output: "hello this is cycle number one"
πŸ“Š Accuracy: 80.0%
βœ… SUCCESS

Advanced Vocabulary Test

npx 666fr3k test --advanced

Tests complex terminology including:

  • Technology & AI concepts
  • Medical & scientific terms
  • Legal & constitutional language
  • Quantum physics & mathematics
  • Economic & financial theory
  • Philosophical concepts
  • Cybersecurity terminology

Continuous Listening Session

npx 666fr3k listen --speak
🎀 666FR3K AUTO-LISTENER ACTIVATED
πŸ“’ I'm now listening continuously...
πŸ’‘ Say 'stop listening' to exit

[12:34:56] 🎀 Heard: "Hello, can you hear me?"
πŸ’¬ You said: Hello, can you hear me?

🎧 Listening...

πŸ“Š Performance Metrics

Test Results

  • Basic Loops: 80-100% accuracy on simple phrases
  • Complex Vocabulary: 16-72% accuracy on technical terms
  • Speed: 1.6x playback maintains 70%+ accuracy
  • Latency: <2 seconds per TTSβ†’STT cycle

Tested Domains

βœ… General conversation (90%+ accuracy) βœ… Technology terminology (80%+ accuracy) βœ… Economic concepts (72% accuracy) ⚠️ Medical terms (37% accuracy) ⚠️ Quantum physics (17% accuracy)

πŸ”§ Configuration

Environment Variables

# Set custom port for web server
export PORT=8080

# Python interpreter path
export PYTHON_BIN=/usr/bin/python3

Package Configuration

Edit package.json to customize:

{
  "scripts": {
    "start": "node index.js",
    "web": "node server.js",
    "test": "node test.js"
  }
}

πŸ› Troubleshooting

Microphone Not Working

# Test microphone access
python3 -c "import speech_recognition as sr; print(sr.Microphone.list_microphone_names())"

Audio Playback Issues

# Check ffplay installation
ffplay -version

# Test audio output
ffplay -nodisp test.mp3

Python Dependencies

# Reinstall dependencies
pip3 install --upgrade gtts SpeechRecognition pyaudio

# On macOS, if pyaudio fails:
brew install portaudio
pip3 install --global-option='build_ext' --global-option='-I/usr/local/include' --global-option='-L/usr/local/lib' pyaudio

Google API Rate Limits

If you encounter API timeouts:

  • Add delays between requests
  • Consider using offline STT alternatives
  • Check network connectivity

πŸ“– Command Reference

Command Description Options
listen Start continuous listening --speak, --verbose
loop Run TTS→STT loop test --cycles N, --speed X
test Run verification tests --advanced, --loop N
web Start web UI server --port N
install-deps Install Python dependencies -

🎨 Web UI Screenshots

Main Dashboard:

  • TTS Panel with text input and speed control
  • STT Panel with live transcription
  • Loop testing interface
  • Real-time statistics dashboard

🀝 Contributing

Contributions welcome! Please follow these guidelines:

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Add tests
  5. Submit a pull request

πŸ“ License

MIT License - see LICENSE file for details

πŸ‘€ Author

fr3k

πŸ™ Acknowledgments

  • gTTS: Google Text-to-Speech library
  • SpeechRecognition: Python speech recognition library
  • Google Cloud: Speech API
  • Express.js: Web server framework

πŸ“ˆ Roadmap

  • Offline STT support
  • Multiple TTS voice options
  • Real-time audio streaming
  • Docker containerization
  • Multi-language support
  • Custom wake words
  • Voice activity detection improvements
  • Mobile app

πŸ”₯ Quick Examples

Example 1: Quick Test

npx 666fr3k loop --cycles 3

Example 2: Voice Assistant Mode

npx 666fr3k listen --speak

Say "Hello" and hear the system respond!

Example 3: Web Interface

npx 666fr3k web
# Open http://localhost:3666

Example 4: Advanced Testing

npx 666fr3k test --advanced

Tests complex vocabulary across 7 domains!


Made with πŸ”₯ by fr3k

Version: 1.0.0 Last Updated: 2025-11-02