@promptbook/utils
Promptbook: Run AI apps in plain human language across multiple models and platforms
Found 72 results for multimodal
Promptbook: Run AI apps in plain human language across multiple models and platforms
The TypeScript library for building AI applications.
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
JavaScript client for Speechly Streaming API
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
React client for Speechly Streaming API
Promptbook: Run AI apps in plain human language across multiple models and platforms
Polyfill for the Speech Recognition API using Speechly
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
SDK for building AI agents with seamless voice-text context switching
Zero-dependency MCP server for AI-powered SVG icon generation with multimodal LLM support
Non-intrusive behavior-based emotion detection SDK (keyboard & mouse) — EmpathAI core.
Experimental ModelFusion features
🧬 ModelMix - Unified API for Diverse AI LLM.
Model Context Protocol (MCP) server for Lucid App integration with multimodal AI analysis
PDF multimodal conversion MCP tool for Claude Code and Gemini CLI
n8n community node for Google Gemini AI integration with text generation, file upload & analysis, and TTS (Text-to-Speech) support
The official TypeScript/JavaScript SDK for Channel3 AI Shopping API
n8n community node for SiliconFlow AI models - chat completions, vision language models, embeddings, and reranking
A Node.js library harnessing the power of Bard's Large Language Model (LLM) for seamless chat experiences and streamlined accessibility to Google's Gemini. Empower your applications with advanced conversational AI, leveraging Bard's LLM to answer question
A set of react components and hooks to help with multimodal input
MMIR (Mobile Multimodal Interaction and Relay) library
MCP server for Morphik multimodal database
Official JavaScript/TypeScript library for the ModelPilot API - OpenAI-compatible interface for intelligent model routing
火山引擎即梦AI多模态生成服务MCP工具
A powerful Node.js interface for DuckDuckGo AI Chat with advanced configuration, rate limiting, and image support
React Native specific components and utilities for multimodal UI
Device sensor integration for multimodal interactions
Zero-shot multimodal classification SDK - classify text and images with custom labels, no training required
Fork of Google's Gemini CLI by sidx1. CLI tool for accessing Gemini AI with enhancements by sidx1.
Multi-modal input fusion engine for simultaneous interaction handling
Context awareness and memory management for multimodal interactions
CLI tool combining multimodal AI analysis with RawTherapee's engine to generate optimized PP3 profiles for RAW photography. Features automatic histogram analysis for enhanced AI processing.
Promptbook: Run AI apps in plain human language across multiple models and platforms
MCP server with multimodal capabilities - process documents, images, videos, audio using Gemini Pro with 1M context window
Pocket-Sized Multimodal AI for Content Understanding and Generation
Professional AI prompt engineering toolkit with advanced template features, real-time dashboards, conditional logic, template inheritance, live monitoring, OpenRouter integration, and 310+ model support
Enterprise-grade AI integration bridge connecting Claude Code, Gemini CLI, and Google AI Studio with intelligent routing and advanced multimodal processing capabilities
Image to LaTeX with Llama 3.2 Vision.
Browser client for Speechly API
A unified data-ingestion CLI that auto-detects and converts text, image, audio and tabular sources into standardized training datasets
A library for interacting with Google's Generative AI models in real-time
A library to easily integrate various LLM models and vendors into applications, with advanced features.
A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with comprehensive support for text generation, multimodal processing, TTS, LoRA adapters, and more.
A Chakra UI Multi Modal - one modal with multiple, switchable sections
Rate limiter middleware for Express.js that allows very tight limits while providing a seamless experience to the users.
Pocket-Sized Multimodal AI for Content Understanding and Generation
Componente de barra de pesquisa multimodal para React com suporte a texto multilinhas e imagens