JSPM

Found 72 results for multimodal

promptbook

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 44.55
  • Published

@promptbook/javascript

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 42.73
  • Published

@promptbook/ollama

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 42.55
  • Published

@promptbook/wizard

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 42.31
  • Published

@promptbook/browser

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 42.28
  • Published

@promptbook/anthropic-claude

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 41.68
  • Published

@promptbook/cli

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 41.56
  • Published

@promptbook/deepseek

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 41.07
  • Published

@promptbook/website-crawler

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 40.81
  • Published

ptbk

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 40.15
  • Published

@promptbook/components

Promptbook: Run AI apps in plain human language across multiple models and platforms

  • v0.100.0-45
  • 39.41
  • Published

contextual-agent-sdk

SDK for building AI agents with seamless voice-text context switching

  • v1.3.2
  • 38.98
  • Published

empathai-core

Non-intrusive behavior-based emotion detection SDK (keyboard & mouse) — EmpathAI core.

  • v0.1.8
  • 35.79
  • Published

icon-generator-mcp

Zero-dependency MCP server for AI-powered SVG icon generation with multimodal LLM support

  • v0.5.0
  • 34.70
  • Published

modelmix

🧬 ModelMix - Unified API for Diverse AI LLM.

  • v3.8.4
  • 32.04
  • Published

lucid-mcp-server

Model Context Protocol (MCP) server for Lucid App integration with multimodal AI analysis

  • v0.1.5
  • 31.47
  • Published

botrun-pdf-multimodal

PDF multimodal conversion MCP tool for Claude Code and Gemini CLI

  • v1.0.2
  • 30.58
  • Published

n8n-nodes-gemini-ai

n8n community node for Google Gemini AI integration with text generation, file upload & analysis, and TTS (Text-to-Speech) support

  • v0.6.8
  • 29.78
  • Published

n8n-nodes-siliconflow

n8n community node for SiliconFlow AI models - chat completions, vision language models, embeddings, and reranking

  • v1.3.4
  • 26.70
  • Published

channel3-sdk

The official TypeScript/JavaScript SDK for Channel3 AI Shopping API

  • v1.0.1
  • 26.66
  • Published

bard-api-node

A Node.js library harnessing the power of Bard's Large Language Model (LLM) for seamless chat experiences and streamlined accessibility to Google's Gemini. Empower your applications with advanced conversational AI, leveraging Bard's LLM to answer question

  • v2.1.0
  • 24.62
  • Published

mmir-lib

MMIR (Mobile Multimodal Interaction and Relay) library

  • v7.0.1
  • 24.24
  • Published

@morphik/mcp

MCP server for Morphik multimodal database

  • v1.0.12
  • 23.82
  • Published

jimeng-ai-mcp

火山引擎即梦AI多模态生成服务MCP工具

  • v1.0.14
  • 22.84
  • Published

modelpilot

Official JavaScript/TypeScript library for the ModelPilot API - OpenAI-compatible interface for intelligent model routing

  • v1.0.0
  • 22.60
  • Published

duckduckgo-chat-interface

A powerful Node.js interface for DuckDuckGo AI Chat with advanced configuration, rate limiting, and image support

  • v1.1.5
  • 20.36
  • Published

@multiface.js/sensors

Device sensor integration for multimodal interactions

    • v1.0.5
    • 19.96
    • Published

    zerolabel

    Zero-shot multimodal classification SDK - classify text and images with custom labels, no training required

    • v1.0.18
    • 18.12
    • Published

    gemini-cli-sidx1fork

    Fork of Google's Gemini CLI by sidx1. CLI tool for accessing Gemini AI with enhancements by sidx1.

    • v1.0.3
    • 18.10
    • Published

    @multiface.js/context

    Context awareness and memory management for multimodal interactions

      • v1.0.5
      • 17.46
      • Published

      ai-pp3

      CLI tool combining multimodal AI analysis with RawTherapee's engine to generate optimized PP3 profiles for RAW photography. Features automatic histogram analysis for enhanced AI processing.

      • v2.1.2
      • 17.35
      • Published

      @multiface.js/fusion

      Multi-modal input fusion engine for simultaneous interaction handling

        • v1.0.5
        • 17.32
        • Published

        @promptbook/wizzard

        Promptbook: Run AI apps in plain human language across multiple models and platforms

        • v0.94.0
        • 16.17
        • Published

        gemini-multimodal-mcp

        MCP server with multimodal capabilities - process documents, images, videos, audio using Gemini Pro with 1M context window

        • v1.1.4
        • 15.03
        • Published

        @unum-cloud/uform

        Pocket-Sized Multimodal AI for Content Understanding and Generation

          • v3.1.2
          • 11.92
          • Published

          @callmedayz/ai-prompt-toolkit

          Professional AI prompt engineering toolkit with advanced template features, real-time dashboards, conditional logic, template inheritance, live monitoring, OpenRouter integration, and 310+ model support

          • v2.6.2
          • 11.22
          • Published

          claude-gemini-multimodal-bridge

          Enterprise-grade AI integration bridge connecting Claude Code, Gemini CLI, and Google AI Studio with intelligent routing and advanced multimodal processing capabilities

          • v1.0.4
          • 10.83
          • Published

          llama-latex

          Image to LaTeX with Llama 3.2 Vision.

          • v0.0.4
          • 10.67
          • Published

          unimodaly-ingest

          A unified data-ingestion CLI that auto-detects and converts text, image, audio and tabular sources into standardized training datasets

          • v1.0.0
          • 8.49
          • Published

          google-genai-live-lib

          A library for interacting with Google's Generative AI models in real-time

            • v0.1.4
            • 6.98
            • Published

            llmplug

            A library to easily integrate various LLM models and vendors into applications, with advanced features.

              • v0.1.0
              • 0.00
              • Published

              llama-cpp-capacitor

              A native Capacitor plugin that embeds llama.cpp directly into mobile apps, enabling offline AI inference with comprehensive support for text generation, multimodal processing, TTS, LoRA adapters, and more.

              • v0.0.3
              • 0.00
              • Published

              chakra-multi-modal

              A Chakra UI Multi Modal - one modal with multiple, switchable sections

              • v1.0.1
              • 0.00
              • Published

              rate-limiter-multimodal

              Rate limiter middleware for Express.js that allows very tight limits while providing a seamless experience to the users.

              • v1.0.1
              • 0.00
              • Published

              @ashvardanian/uform

              Pocket-Sized Multimodal AI for Content Understanding and Generation

                • v2.0.2
                • 0.00
                • Published

                multimodal-search-bar

                Componente de barra de pesquisa multimodal para React com suporte a texto multilinhas e imagens

                • v0.1.0
                • 0.00
                • Published