JSPM

Found 1180 results for crawler

ikka-flix

Nodejs library that provides an Api for obtaining the movies information from FlixHQ website.

    • v1.0.1
    • 0.00
    • Published

    @black_meteor/check-link-status

    A simple module, which can use for checking all link status code, given links. Below shows how to use the module

    • v1.0.2
    • 0.00
    • Published

    gbif-crawler

    Crawler for the GBIF API

      • v1.0.1
      • 0.00
      • Published

      press2blogger

      Moving or backing up your Wordpress site to Blogger

      • v1.0.3
      • 0.00
      • Published

      @vladfrangu-dev/crawlee-cheerio

      The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

      • v3.3.0
      • 0.00
      • Published

      sleepsecure

      A lightweight scraper for SleepCycle's online SleepSecure data.

      • v0.0.2
      • 0.00
      • Published

      llmoptimizer

      Generate an llms.txt summary of your website/docs for LLMs (framework-agnostic with Vite/Next/Nuxt/Astro/Remix helpers).

      • v1.1.0
      • 0.00
      • Published

      copha

      a general framework for running custom network tasks.

      • v0.0.9
      • 0.00
      • Published

      wadejs-wzh

      a crawler for zhihu

        • v1.0.2
        • 0.00
        • Published

        @systemfsoftware/apify

        The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

        • v3.2.5-patch.0
        • 0.00
        • Published

        snapcrawl-express-ssr

        Express middleware that serves pre-rendered HTML from SnapCrawl's SSR API to improve SEO for JavaScript apps.

          • v1.0.2
          • 0.00
          • Published

          @vladfrangu-dev/crawlee-cli

          The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

          • v3.2.0
          • 0.00
          • Published

          @vladfrangu-dev/crawlee

          The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

          • v3.3.0
          • 0.00
          • Published

          siege-crawler

          This CLI tool will find same domain urls in a web page and requesting them to find even more urls until server crash (or at the end of benchmark). It is used to test maximun capacity of server or finding for glitches that users might encounter.

          • v1.1.7
          • 0.00
          • Published

          crawler-lian

          A configuration - based crawler framework

            • v1.0.7
            • 0.00
            • Published

            scrapester

            JavaScript SDK for Scrapester API

            • v0.1.0
            • 0.00
            • Published

            olxba-scraper

            A web scraper for the Bosnian listings site olx.ba

              • v1.0.0
              • 0.00
              • Published

              @sanv/apify

              The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

              • v1.1.4
              • 0.00
              • Published

              key-crawler

              This library provides support for traversing objects and their values while providing information on the traversal state, pathing to target values, and the ability to manipulate said pathing to easily move to related values.

              • v1.2.0
              • 0.00
              • Published

              s_spider

              A fast crawler cli with pyppteer, this crawler can crawl SPA(single page application)

              • v1.0.2
              • 0.00
              • Published

              @cassidyai/crawlee-root

              The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

              • v3.7.3
              • 0.00
              • Published

              craw

              a website-crawler library for nodejs

                • v1.0.0
                • 0.00
                • Published

                kp-scraper

                A web scraper for the Serbian listings site KupujemProdajem

                  • v1.2.0
                  • 0.00
                  • Published

                  x-ray-build

                  A helper that build a x-ray based on a schema

                  • v1.2.0
                  • 0.00
                  • Published

                  @isfeng/bing-dict

                  A Bing command line dictionary, which obtains the query results of bing dictionary by crawler.

                    • v1.1.0
                    • 0.00
                    • Published

                    log-crawler

                    Crawl the log of any command

                    • v1.0.0
                    • 0.00
                    • Published

                    turtlefly

                    A crawler framework based on NodeJS.

                    • v2.1.0
                    • 0.00
                    • Published

                    yt-dude

                    here is a package for crawling and downloading videos from youtube

                    • v1.0.13
                    • 0.00
                    • Published

                    scrapping_engine

                    To scrap the content from the web site

                      • v1.0.0
                      • 0.00
                      • Published

                      parkour

                      Parkour the web like a yamakazi

                      • v1.0.0
                      • 0.00
                      • Published

                      limador

                      Powerful Scraping and Crawling library with anti-scraping, scalability, storage, static/dynamic contents, monitoring UI and more. Ready to deploy on cloud instances or serverless.

                      • v0.0.3
                      • 0.00
                      • Published

                      croton

                      Nexstack Nodejs library that provides an Api for obtaining the movies information website.

                      • v2.1.0
                      • 0.00
                      • Published

                      @atlach/html-extract

                      Get information using the string of the specified rule

                      • v0.1.0
                      • 0.00
                      • Published

                      web-scraper-mcp-puppeteer

                      基于 MCP 的网页爬取服务器,内置 Puppeteer 无头浏览器支持

                        • v1.0.3
                        • 0.00
                        • Published

                        web-scraper-ts

                        A powerful and flexible web scraper library built with TypeScript

                        • v1.0.0
                        • 0.00
                        • Published

                        next-llms-generator

                        Generate LLM-friendly text files from Next.js applications by crawling sitemaps and extracting content

                        • v0.1.0
                        • 0.00
                        • Published

                        crawl_fetcher

                        A web crawler. Automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.

                          • v1.0.0
                          • 0.00
                          • Published

                          @vladfrangu-dev/crawlee-jsdom

                          The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

                          • v3.3.0
                          • 0.00
                          • Published

                          @ryzl/hltv

                          The unofficial HLTV Node.js API

                          • v3.3.4
                          • 0.00
                          • Published

                          hatcher

                          Provides APIs by simple configuration.

                          • v0.2.1
                          • 0.00
                          • Published

                          jasg

                          Just Another Sitemap Generator

                          • v0.0.11
                          • 0.00
                          • Published

                          pathik

                          High-performance web crawler implemented in Go with JavaScript bindings

                            • v0.3.11
                            • 0.00
                            • Published

                            @ugenu.io/crawler

                            a module used for scraping with either an electron browser (webview, BrowserWindow, BrowserView) or http requests (axios)

                            • v1.1.2
                            • 0.00
                            • Published

                            puppeteer-prerender-next

                            Fetch the pre-rendered content, meta, links and Open Graph of a webpage, especially Single-Page Application (SPA)

                            • v0.15.0
                            • 0.00
                            • Published

                            malkovich-malkovich

                            A lightweight and simple API for web crawling built on chromium puppeteer

                            • v0.0.1
                            • 0.00
                            • Published

                            markdownify-cli

                            Convert a website to static markdown.

                            • v1.0.4
                            • 0.00
                            • Published

                            @sesamestrong/json-scraper

                            A tool to allow for quick running of JSON-based scrapers using request-promise and jsonframe-cheerio.

                              • v4.5.0
                              • 0.00
                              • Published

                              crawler-main

                              A search and crawler for Wikipedia articles

                              • v0.0.1
                              • 0.00
                              • Published

                              scdl-node

                              A SoundCloud Downloader made with Node and Typescript

                                • v1.0.1
                                • 0.00
                                • Published

                                is-google

                                Verify that a request is from Google crawlers using Google's DNS verification steps

                                • v1.0.2
                                • 0.00
                                • Published

                                is-baidu

                                Verify that a request is from Baidu crawlers using Baidu's DNS verification

                                • v1.0.2
                                • 0.00
                                • Published

                                map-tiles-crawler

                                Memory efficient and synchronous downloader of map tiles. Allows for a fast and easy approach to make map tiles (from a WMS) available offline.

                                • v1.0.2
                                • 0.00
                                • Published

                                molehill

                                Webcrawler for data mining and unification purposes

                                • v0.2.9
                                • 0.00
                                • Published

                                scrano

                                模仿scrapy的node爬虫框架

                                • v0.0.19
                                • 0.00
                                • Published

                                @pingid/shears

                                Functional web scraping in typescript

                                • v0.0.0-alpha.4
                                • 0.00
                                • Published

                                guozaoke-mcp-server

                                过早客论坛信息获取 MCP (Model Context Protocol) 服务器

                                • v1.1.0
                                • 0.00
                                • Published