promptbook
Promptbook: Run AI apps in plain human language across multiple models and platforms
Found 72 results for multimodal
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
React client for Speechly Streaming API
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Polyfill for the Speech Recognition API using Speechly
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
SDK for building AI agents with seamless voice-text context switching
Non-intrusive behavior-based emotion detection SDK (keyboard & mouse) — EmpathAI core.
Zero-dependency MCP server for AI-powered SVG icon generation with multimodal LLM support
Experimental ModelFusion features
🧬 ModelMix - Unified API for Diverse AI LLM.
Model Context Protocol (MCP) server for Lucid App integration with multimodal AI analysis
PDF multimodal conversion MCP tool for Claude Code and Gemini CLI
n8n community node for Google Gemini AI integration with text generation, file upload & analysis, and TTS (Text-to-Speech) support
n8n community node for SiliconFlow AI models - chat completions, vision language models, embeddings, and reranking
The official TypeScript/JavaScript SDK for Channel3 AI Shopping API
A set of react components and hooks to help with multimodal input
A Node.js library harnessing the power of Bard's Large Language Model (LLM) for seamless chat experiences and streamlined accessibility to Google's Gemini. Empower your applications with advanced conversational AI, leveraging Bard's LLM to answer question
MMIR (Mobile Multimodal Interaction and Relay) library
MCP server for Morphik multimodal database
火山引擎即梦AI多模态生成服务MCP工具
Official JavaScript/TypeScript library for the ModelPilot API - OpenAI-compatible interface for intelligent model routing
A powerful Node.js interface for DuckDuckGo AI Chat with advanced configuration, rate limiting, and image support
Device sensor integration for multimodal interactions
React Native specific components and utilities for multimodal UI
Zero-shot multimodal classification SDK - classify text and images with custom labels, no training required
Fork of Google's Gemini CLI by sidx1. CLI tool for accessing Gemini AI with enhancements by sidx1.
Context awareness and memory management for multimodal interactions
CLI tool combining multimodal AI analysis with RawTherapee's engine to generate optimized PP3 profiles for RAW photography. Features automatic histogram analysis for enhanced AI processing.
Multi-modal input fusion engine for simultaneous interaction handling
Promptbook: Run AI apps in plain human language across multiple models and platforms
MCP server with multimodal capabilities - process documents, images, videos, audio using Gemini Pro with 1M context window
Pocket-Sized Multimodal AI for Content Understanding and Generation
Professional AI prompt engineering toolkit with advanced template features, real-time dashboards, conditional logic, template inheritance, live monitoring, OpenRouter integration, and 310+ model support
Browser client for Speechly API
Enterprise-grade AI integration bridge connecting Claude Code, Gemini CLI, and Google AI Studio with intelligent routing and advanced multimodal processing capabilities
Image to LaTeX with Llama 3.2 Vision.
A unified data-ingestion CLI that auto-detects and converts text, image, audio and tabular sources into standardized training datasets
A library for interacting with Google's Generative AI models in real-time
A library to easily integrate various LLM models and vendors into applications, with advanced features.
A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with comprehensive support for text generation, multimodal processing, TTS, LoRA adapters, and more.
A Chakra UI Multi Modal - one modal with multiple, switchable sections
Rate limiter middleware for Express.js that allows very tight limits while providing a seamless experience to the users.
Pocket-Sized Multimodal AI for Content Understanding and Generation
Componente de barra de pesquisa multimodal para React com suporte a texto multilinhas e imagens