JSPM

Found 55 results for data-extraction

tavily-mcp

MCP server for advanced web search using Tavily

    • v0.2.9
    • 63.80
    • Published

    ofx-data-extractor

    A module written in TypeScript that provides a utility to extract data from an OFX file in Node.js and Browser

    • v1.4.8
    • 57.56
    • Published

    mcp-omnisearch

    MCP server for integrating Omnisearch with LLMs

      • v0.0.8
      • 49.57
      • Published

      agentql-mcp

      Model Context Protocol (MCP) server that integrates AgentQL data extraction capabilities.

      • v1.0.0
      • 46.59
      • Published

      puremd-mcp

      Model Context Protocol (MCP) server for pure.md, the markdown delivery network for LLMs

      • v1.0.3
      • 44.59
      • Published

      crawl4ai

      TypeScript SDK for Crawl4AI REST API - Bun & Node.js compatible

      • v1.0.1
      • 38.97
      • Published

      @aidalinfo/pdf-processor

      Powerful PDF data extraction library powered by AI vision models. Transform PDFs into structured, validated data using TypeScript, Zod, and AI providers like Scaleway and Ollama.

      • v1.0.13
      • 38.84
      • Published

      raggle-js

      JavaScript client for Raggle API

      • v0.2.55
      • 36.34
      • Published

      site-crawl

      A CLI tool to recursively crawl websites and download content

      • v1.1.0
      • 34.53
      • Published

      sate.js

      🍢 Skewer web data perfectly - Smart Indonesian web crawler library

      • v1.1.1
      • 32.49
      • Published

      n8n-nodes-crawl4ai

      n8n nodes for Crawl4AI web crawler and data extraction

      • v0.1.7
      • 32.46
      • Published

      stepwright

      A powerful web scraping library built with Playwright

      • v1.0.2
      • 31.89
      • Published

      revit-cli

      A scalable CLI tool for Revit communication and data manipulation

        • v0.1.1
        • 30.72
        • Published

        @scrapeops/n8n-nodes-scrapeops

        n8n community node for ScrapeOps Proxy, Parser, and Data APIs for web scraping and data extraction

        • v0.2.4
        • 29.73
        • Published

        llm-gen

        A CLI tool to extract text from a static Next.js export and generate llm.txt for LLM ingestion.

        • v1.0.3
        • 28.39
        • Published

        ollama-library-scraper

        A TypeScript library for scraping model information from the Ollama model library website. Extract details, tags, and metadata from ollama.com/library with a simple, type-safe API.

        • v1.0.0
        • 25.51
        • Published

        sanity-font-data-extractor

        Extract and analyze font data from documents in Sanity Studio with detailed typography information

        • v1.0.0
        • 23.55
        • Published

        pdf-tax-reader-cl

        PDF scraping library for Chilean tax documents. Extract emitter name, economic activities, and address from structured PDF documents like 'CARPETA TRIBUTARIA ELECTRÓNICA PARA SOLICITAR CRÉDITOS'

        • v1.0.0
        • 23.12
        • Published

        matter-json

        JSON front-matter parser and combiner. Minimal and perfect

        • v1.0.0
        • 20.16
        • Published

        xscrape

        A flexible and powerful library designed to extract and transform data from HTML documents using user-defined schemas

        • v3.0.4
        • 18.57
        • Published

        @monostate/node-scraper

        Intelligent web scraping with AI Q&A, PDF support and multi-level fallback system - 11x faster than traditional scrapers

        • v1.8.1
        • 17.82
        • Published

        cyber-mysql-openai

        Intelligent natural language to SQL translator with self-correction capabilities using OpenAI and MySQL

        • v0.1.10
        • 17.37
        • Published

        exceltables4js

        Convierte un objeto de tabla Excel a JSON.

        • v3.1.0
        • 17.03
        • Published

        jeopardy-json

        A tool that scrapes and transforms Jeopardy! games from the J! Archive into structured JSON for trivia platforms and developers.

        • v1.5.0
        • 16.31
        • Published

        cparse

        一个基于 Cheerio 的 HTML 解析和数据提取工具库

        • v2.2.0
        • 16.20
        • Published

        dataset-config

        Parse HTML data attributes into a structured object with automatic type conversion.

        • v1.0.0
        • 15.57
        • Published

        aim-guard-mcp

        AIM MCP Server :: Guard and Protect your MCPs & AI Chatting

        • v1.1.5
        • 14.64
        • Published

        web-scrapify

        A simple web scraper that can scrape product details from various e-commerce platforms.

        • v1.0.10
        • 14.32
        • Published

        @mseep/agentql-mcp

        Model Context Protocol (MCP) server that integrates AgentQL data extraction capabilities.

        • v1.0.0
        • 13.55
        • Published

        easy-csv-parser

        easy-csv-parser simplifies CSV data parsing in Node.js. Fetch, extract headers, and convert CSV files from URLs to JavaScript objects and JSON effortlessly. Ideal fordevelopers, data analysis, automation, and more.

        • v1.0.8
        • 13.12
        • Published

        @mcpflow.io/mcp-tavily-mcp-

        Tavily搜索 MCP 服务是一个兼容Model Context Protocol (MCP)协议的高级网络搜索工具,允许AI模型如Claude直接访问互联网上的实时信息。该服务提供两个核心工具:tavily-search用于智能网络搜索,支持按新闻、特定域名筛选;以及tavily-extract用于从网页中提取关键内容。作为专业的搜索解决方案,Tavily MCP 服务支持多种MCP客户端包括Cursor、Cline和Claude Desktop,帮助AI模型获取最新、最相关的网络信息,大幅提升其回答

          • v0.1.6
          • 12.62
          • Published

          js-harvester

          Harvester is a lightweight and highly optimized javascript library for extracting data from the DOM tree. It supports extraction of tag texts with specified types and attributes. it's tiny and has no dependencies and also works with Puppeteer

          • v0.3.14
          • 11.73
          • Published

          @mseep/puremd-mcp

          Model Context Protocol (MCP) server for pure.md, the markdown delivery network for LLMs

          • v1.0.3
          • 11.21
          • Published

          @mseep/tavily-mcp

          MCP server for advanced web search using Tavily

            • v0.1.4
            • 10.68
            • Published

            parse-json-to-csv

            This is a simple Javscript library that converts a JSON object to a CSV file.

              • v1.0.0
              • 10.08
              • Published

              matter-yaml

              YAML front-matter parser and combiner. Minimal and perfect

              • v1.1.0
              • 9.56
              • Published

              matter-toml

              TOML front-matter parser and combiner. Minimal and perfect

              • v1.0.0
              • 9.42
              • Published

              parsera-ts

              Official TypeScript SDK for Parsera.org API - Extract structured data from any webpage

                • v1.0.1
                • 6.07
                • Published

                lusail

                JavaScript implementation of Lusail, a domain-specific language for extracting structured data from HTML

                • v0.8.1
                • 6.07
                • Published

                social-profile-scraping

                A lightweight library for scraping public social media profiles, providing profile information such as usernames, pictures, and more.

                • v1.0.6
                • 5.97
                • Published

                crava

                AI-powered web scraping that extracts structured data as JSON

                  • v1.0.0
                  • 4.59
                  • Published

                  n8n-nodes-crawl4ai-fork

                  n8n nodes for Crawl4AI v0.7.4+ web crawler and data extraction (maintained fork)

                  • v1.8.8
                  • 0.00
                  • Published

                  n8n-nodes-firecrawl-tool

                  n8n node for Firecrawl v2 API - Web scraping, crawling, and data extraction tool for workflows and AI agents

                  • v0.1.0
                  • 0.00
                  • Published

                  web-scraper-ts

                  A powerful and flexible web scraper library built with TypeScript

                  • v1.0.0
                  • 0.00
                  • Published