Found 1180 results for crawler

ikka-flix

Nodejs library that provides an Api for obtaining the movies information from FlixHQ website.

@black_meteor/check-link-status

A simple module, which can use for checking all link status code, given links. Below shows how to use the module

press2blogger

Moving or backing up your Wordpress site to Blogger

@vladfrangu-dev/crawlee-cheerio

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

sleepsecure

A lightweight scraper for SleepCycle's online SleepSecure data.

crawler-request-tt-message-gone

HTTP request module customized for crawlers.

llmoptimizer

Generate an llms.txt summary of your website/docs for LLMs (framework-agnostic with Vite/Next/Nuxt/Astro/Remix helpers).

copha

a general framework for running custom network tasks.

limit-request-promise-native

Rate-limiting/throttling for limit-request-native

@systemfsoftware/apify

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

snapcrawl-express-ssr

Express middleware that serves pre-rendered HTML from SnapCrawl's SSR API to improve SEO for JavaScript apps.

@vladfrangu-dev/crawlee-cli

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

@vladfrangu-dev/crawlee

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

This CLI tool will find same domain urls in a web page and requesting them to find even more urls until server crash (or at the end of benchmark). It is used to test maximun capacity of server or finding for glitches that users might encounter.

crawler-lian

A configuration - based crawler framework

hugopoi-webcrawler

Web crawling tool

@jacoblincool/crawler-cli

Jacob's Crawler, the CLI version.

scrapester

JavaScript SDK for Scrapester API

olxba-scraper

A web scraper for the Bosnian listings site olx.ba

@sanv/apify

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

key-crawler

This library provides support for traversing objects and their values while providing information on the traversal state, pathing to target values, and the ability to manipulate said pathing to easily move to related values.

gsutil-crawler

Get product info by barcode

s_spider

A fast crawler cli with pyppteer, this crawler can crawl SPA(single page application)

@cassidyai/crawlee-root

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

craw

a website-crawler library for nodejs

kp-scraper

A web scraper for the Serbian listings site KupujemProdajem

@studiowebux/sitemap

Sitemap plugin

x-ray-build

A helper that build a x-ray based on a schema

@isfeng/bing-dict

A Bing command line dictionary, which obtains the query results of bing dictionary by crawler.

log-crawler

Crawl the log of any command

turtlefly

A crawler framework based on NodeJS.

@mkusaka/sitemap-crawler

Extract content from sitemap URLs and save as markdown files

yt-dude

here is a package for crawling and downloading videos from youtube

dht-peer-crawler

A fast and stable DHT crawler.

scrapping_engine

To scrap the content from the web site

@repsi/enum-ret

A util tool

parkour

Parkour the web like a yamakazi

@akilio/site-data

Website schema based crawler

@liqd-js/useragent

User Agent

limador

Powerful Scraping and Crawling library with anti-scraping, scalability, storage, static/dynamic contents, monitoring UI and more. Ready to deploy on cloud instances or serverless.

croton

Nexstack Nodejs library that provides an Api for obtaining the movies information website.

crawler-request-undefined-message-gone

HTTP request module customized for crawlers.

x-ray-crawler-upgraded

x-ray's crawler

@atlach/html-extract

Get information using the string of the specified rule

just-crawl

crawler

web-scraper-mcp-puppeteer

基于 MCP 的网页爬取服务器，内置 Puppeteer 无头浏览器支持

web-scraper-ts

A powerful and flexible web scraper library built with TypeScript

next-llms-generator

Generate LLM-friendly text files from Next.js applications by crawling sitemaps and extracting content

crawl_fetcher

A web crawler. Automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.

@vladfrangu-dev/crawlee-templates

Templates for the crawlee projects

@vladfrangu-dev/crawlee-jsdom

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.