@promptbook/utils
Promptbook: Run AI apps in plain human language across multiple models and platforms
Found 73 results for multimodal
Promptbook: Run AI apps in plain human language across multiple models and platforms
The TypeScript library for building AI applications.
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
JavaScript client for Speechly Streaming API
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
React client for Speechly Streaming API
Polyfill for the Speech Recognition API using Speechly
Promptbook: Run AI apps in plain human language across multiple models and platforms
SDK for building AI agents with seamless voice-text context switching
Non-intrusive behavior-based emotion detection SDK (keyboard & mouse) — EmpathAI core.
Zero-dependency MCP server for AI-powered SVG icon generation with multimodal LLM support
🧬 ModelMix - Unified API for Diverse AI LLM.
Experimental ModelFusion features
Model Context Protocol (MCP) server for Lucid App integration with multimodal AI analysis
PDF multimodal conversion MCP tool for Claude Code and Gemini CLI
n8n community node for Google Gemini AI integration with text generation, file upload & analysis, and TTS (Text-to-Speech) support
The official TypeScript/JavaScript SDK for Channel3 AI Shopping API
n8n community node for SiliconFlow AI models - chat completions, vision language models, embeddings, and reranking
MCP server for Morphik multimodal database
A Node.js library harnessing the power of Bard's Large Language Model (LLM) for seamless chat experiences and streamlined accessibility to Google's Gemini. Empower your applications with advanced conversational AI, leveraging Bard's LLM to answer question
MMIR (Mobile Multimodal Interaction and Relay) library
火山引擎即梦AI多模态生成服务MCP工具
Official JavaScript/TypeScript library for the ModelPilot API - OpenAI-compatible interface for intelligent model routing
A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with comprehensive support for text generation, multimodal processing, TTS, LoRA adapters, and more.
A powerful Node.js interface for DuckDuckGo AI Chat with advanced configuration, rate limiting, and image support
Device sensor integration for multimodal interactions
React Native specific components and utilities for multimodal UI
Context awareness and memory management for multimodal interactions
Fork of Google's Gemini CLI by sidx1. CLI tool for accessing Gemini AI with enhancements by sidx1.
Multi-modal input fusion engine for simultaneous interaction handling
CLI tool combining multimodal AI analysis with RawTherapee's engine to generate optimized PP3 profiles for RAW photography. Features automatic histogram analysis for enhanced AI processing.
Promptbook: Run AI apps in plain human language across multiple models and platforms
MCP server with multimodal capabilities - process documents, images, videos, audio using Gemini Pro with 1M context window
A set of react components and hooks to help with multimodal input
Zero-shot multimodal classification SDK - classify text and images with custom labels, no training required
Professional AI prompt engineering toolkit with advanced template features, real-time dashboards, conditional logic, template inheritance, live monitoring, OpenRouter integration, and 310+ model support
Pocket-Sized Multimodal AI for Content Understanding and Generation
Browser client for Speechly API
Enterprise-grade AI integration bridge connecting Claude Code, Gemini CLI, and Google AI Studio with intelligent routing and advanced multimodal processing capabilities
Image to LaTeX with Llama 3.2 Vision.
A unified data-ingestion CLI that auto-detects and converts text, image, audio and tabular sources into standardized training datasets
A library for interacting with Google's Generative AI models in real-time
A library to easily integrate various LLM models and vendors into applications, with advanced features.
JavaScript SDK for NexusAI - AI Agent Platform for Businesses
A Chakra UI Multi Modal - one modal with multiple, switchable sections
Rate limiter middleware for Express.js that allows very tight limits while providing a seamless experience to the users.
Pocket-Sized Multimodal AI for Content Understanding and Generation
Componente de barra de pesquisa multimodal para React com suporte a texto multilinhas e imagens