JSPM

Found 177 results for crawling

@datasco/sdk

Datasco API SDK for Node.js to collect any data from any website

  • v1.0.4
  • 5.64
  • Published

proxidoor

proxidoor helps you make HTTP requests through a rotating proxy, you can use it for services such as web scraping, web crawling and more.

  • v1.0.3
  • 5.64
  • Published

friday-sdk

Official JavaScript/TypeScript SDK for the Friday API

  • v0.2.2
  • 5.40
  • Published

cspider

Distributed web crawler powered by Headless Chrome

  • v0.0.6
  • 5.31
  • Published

goose-browser-environment

Environment for Goose parser which allows to run it in commmon Browser

  • v1.0.4
  • 5.31
  • Published

spider-stealth

A Node.js scraping framework built on puppeteer-extra (to use a headless Chrome/Chromium browser). Has the ability to solve reCaptcha

  • v1.2.2
  • 5.31
  • Published

p4k-api

web scraper for album reviews from pitchfork

  • v1.4.3
  • 5.30
  • Published

nocrawler

Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they are downloaded, asynchronously. Scraping should be simple and fun!

  • v0.0.1
  • 5.30
  • Published

crawling-typer

Transform your text with dynamic typing animations! crawling-typer lets you display an array of strings one at a time, each with its own color. Customize typing speed, delete speed, and pauses between strings. Enjoy full control with loop counts, post-loo

  • v1.1.1
  • 5.30
  • Published

magnet-getter

An API to get magnet links using Puppeteer.

  • v1.1.0
  • 4.35
  • Published

spider-stealth-core

A Node.js scraping framework built on puppeteer-extra (to use a headless Chrome/Chromium browser). Has the ability to solve reCaptcha. The core module without browser installation

  • v1.3.4
  • 4.27
  • Published

dynamic-crawling

Tem o objetivo de executar rotinas de CRAWLING a partir de um arquivo JSON utilizando xpath mas aceitando para cada passo uma função callback que recebe o valor e pode passar esse valor para um próximo passo.

  • v1.0.2
  • 4.20
  • Published

planisphere

A straightforward sitemap generator written in TypeScript.

  • v1.0.1
  • 4.20
  • Published

robinbot

robin web crawling engine with nodejs

  • v0.9.0
  • 4.15
  • Published

imdb-scrapi

An API to get data off of IMDB using Puppeteer.

  • v1.0.2
  • 4.08
  • Published

declarative-scraper

Simple & Human-Friendly HTML Scraper with Json-ld support

  • v0.1.1
  • 4.08
  • Published

miniscraper

Minimalist Node.js web scraper and crawler working with under-the-hood JSDOM

  • v0.3.2
  • 4.08
  • Published

fiend

The most advanced web crawler for JavaScript

  • v0.1.0
  • 4.02
  • Published

@stacksleuth/browser-agent

StackSleuth in-house browser automation agent for debugging and user simulation

  • v0.2.1
  • 4.01
  • Published

keyworm

keyword mention 크롤러

  • v0.1.1
  • 4.01
  • Published

style-crawl

Package to find style links from the site you want

  • v1.1.2
  • 4.01
  • Published

crawler-mod

based on node-crawler

  • v0.0.1
  • 2.50
  • Published

instagram-crawling

Simple Instagram Crawling without using public API

  • v1.1.2
  • 2.50
  • Published

jason-the-miner

Harvesting data at the <html> mine.

  • v1.1.1
  • 2.46
  • Published

skrap

Easily scrap web pages by providing json recipes

  • v0.1.1
  • 2.46
  • Published

crawline

Web crawler

  • v0.0.0
  • 2.43
  • Published

node-pool-scraper

Node.js web scraping utility powered by puppeteer pool

  • v0.1.6
  • 2.37
  • Published

node-crawling-framework

NodeJs crawling & scraping framework heavily inspired by Scrapy (Pyhton)

  • v0.0.1-alpha.2
  • 2.35
  • Published

ccht

A simple command0line tool to crawl and test your website

  • v0.1.2
  • 2.35
  • Published

wight-backend-web

A Wight backend for fetching static web pages

  • v0.1.0
  • 2.35
  • Published

spamlet

spamlet is an efficient and simple crawler for playwright

    • v0.1.6
    • 2.35
    • Published

    crt-scrapper

    Easily create a scraper api with the @web/scrapper library, which includes a scraper and advanced events for your website.

    • v1.0.4
    • 2.35
    • Published

    gumo

    A web-crawler and scraper that extracts data from a family of nested dynamic webpages with added enhancements to assist in knowledge mining applications.

    • v1.0.7
    • 0.00
    • Published

    hcr

    Easy To Use Web Crawler

    • v1.4.1
    • 0.00
    • Published

    @subtitles/providers

    Providers are the core of applications, where the subtitles are collected. Each provider exports a unique strategy for gathering data. From legendastv's web scraping from opensubtitle API usage, you can collect subtitles from your favorite tv shows and mo

    • v0.3.0-beta.2
    • 0.00
    • Published

    press2blogger

    Moving or backing up your Wordpress site to Blogger

    • v1.0.3
    • 0.00
    • Published

    parkour

    Parkour the web like a yamakazi

    • v1.0.0
    • 0.00
    • Published

    ig-scrap-cache

    scrap and caching by use a redis from instagram

    • v3.0.0
    • 0.00
    • Published

    n8n-nodes-firecrawl-tool

    n8n node for Firecrawl v2 API - Web scraping, crawling, and data extraction tool for workflows and AI agents

    • v0.1.2
    • 0.00
    • Published

    sitemaps-getter

    A tool to get sitemaps from websites and crawl them

    • v1.0.3
    • 0.00
    • Published

    nstock

    naver stock data crawler

    • v0.1.0-beta
    • 0.00
    • Published

    malkovich-malkovich

    A lightweight and simple API for web crawling built on chromium puppeteer

    • v0.0.1
    • 0.00
    • Published