Found 1180 results for crawler

This CLI tool will find same domain urls in a web page and requesting them to find even more urls until server crash (or at the end of benchmark). It is used to test maximun capacity of server or finding for glitches that users might encounter.

crawler-lian

A configuration - based crawler framework

hugopoi-webcrawler

Web crawling tool

@jacoblincool/crawler-cli

Jacob's Crawler, the CLI version.

olxba-scraper

A web scraper for the Bosnian listings site olx.ba

@sanv/apify

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

key-crawler

This library provides support for traversing objects and their values while providing information on the traversal state, pathing to target values, and the ability to manipulate said pathing to easily move to related values.

gsutil-crawler

Get product info by barcode

s_spider

A fast crawler cli with pyppteer, this crawler can crawl SPA(single page application)

@cassidyai/crawlee-root

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

craw

a website-crawler library for nodejs

kp-scraper

A web scraper for the Serbian listings site KupujemProdajem

@studiowebux/sitemap

Sitemap plugin

x-ray-build

A helper that build a x-ray based on a schema

@isfeng/bing-dict

A Bing command line dictionary, which obtains the query results of bing dictionary by crawler.

log-crawler

Crawl the log of any command

turtlefly

A crawler framework based on NodeJS.

yt-dude

here is a package for crawling and downloading videos from youtube

dht-peer-crawler

A fast and stable DHT crawler.

scrapping_engine

To scrap the content from the web site

@repsi/enum-ret

A util tool

parkour

Parkour the web like a yamakazi

@akilio/site-data

Website schema based crawler

@liqd-js/useragent

User Agent

limador

Powerful Scraping and Crawling library with anti-scraping, scalability, storage, static/dynamic contents, monitoring UI and more. Ready to deploy on cloud instances or serverless.

croton

Nexstack Nodejs library that provides an Api for obtaining the movies information website.

crawler-request-undefined-message-gone

HTTP request module customized for crawlers.

x-ray-crawler-upgraded

x-ray's crawler

@atlach/html-extract

Get information using the string of the specified rule

just-crawl

crawler

web-scraper-mcp-puppeteer

基于 MCP 的网页爬取服务器，内置 Puppeteer 无头浏览器支持

web-scraper-ts

A powerful and flexible web scraper library built with TypeScript

next-llms-generator

Generate LLM-friendly text files from Next.js applications by crawling sitemaps and extracting content

crawl_fetcher

A web crawler. Automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.

@vladfrangu-dev/crawlee-templates

Templates for the crawlee projects

@vladfrangu-dev/crawlee-jsdom

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.