cheerio
The fast, flexible & elegant library for parsing and manipulating HTML and XML.
Found 1975 results for scraper
The fast, flexible & elegant library for parsing and manipulating HTML and XML.
A specification compliant robots.txt parser with wildcard (*) matching support.
JavaScript SDK for Firecrawl API
Apify API client for JavaScript
Node.js scraper module for Open Graph and Twitter Card info
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
A library to easily scrape metadata from an article on the web using Open Graph, JSON+LD, regular HTML metadata, and series of fallbacks.
Templates for the crawlee projects
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
A simple YouTube video downloader for audio and video formats with resolusi and quality.
Browserless scraper module
Lazy way to download images from Duck Duck Go search results in bulk
The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
scrapes app data from google play store
Download website to a local directory (including all css, images, js, etc.)
A simple and efficient package to scrape and parse captions (subtitles) from YouTube videos, supporting both user-submitted and auto-generated captions with language options.
Request a url and scrape the metadata from its HTML using Node.js or the browser.
Scraper for oEmbed, Twitter Cards and Open Graph metadata - fast and Promise-based
A fully managed unofficial TikTok API with OAuth capabilities
Search and get apk from aptoide.
scrape data from the itunes app store
This Package is Facebook video link's Direct Link Fetcher, Created by Mr Nima - using javascript.
JavaScript SDK for Firecrawl API
Web scraper for Bing.
A simple yet powerful module to retrieve organic search results and much more from Google.
An unofficial tiktok downloader scraper for download video, audio and images using tiktok link.
api wrapper from api.xfarr.com
n8n node for browser automation using Puppeteer
Tiktok scraper for Node.js
A module to search and scrape google. This is not sponsored, supported, or affiliated with Google Inc.
Tool for extracting content from YouTube videos and web pages
Crawler is a ready-to-use web spider that works with proxies, asynchrony, rate limit, configurable request pools, jQuery, and HTTP/2 support.
Node.js library to receive live stream chat events like comments and gifts from TikTok LIVE.
Get mediafire direct dl link,name,size,mime using mediafire link.
Node.js module for interacting with the Euler Stream TikTok LIVE API.
Simple, lightweight and expressive web scraping with Node.js
A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and AppLinks.
A lightweight robots.txt parser for Node.js with support for wildcards, caching and promises.
x-ray's crawler
Implementation of Twitter internal API in TypeScript
The fast, flexible & elegant library for parsing and manipulating HTML and XML.
Simple package to download audio & fetch basic details of the song from reverbnation.
A port of n0madic/twitter-scraper to Node.js.
OpenAPI client for twitter-openapi-typescript-generated
structure any website
A web scraper for NodeJs
Scrape instagram posts from Username, Hashtag or Location pages. Download media and save them to a ZIP archive. Create JSON/CSV files with a post information. No login required
This is Unofficial Web scraper of sinhalasub.lk web site.
基于 Playwright 的通用 Block 爬虫框架,支持受限并发、进度恢复、单页面或单 Block 处理模式
Add advanced selector support to cheerio
Simple API by Caliph
Dead simple license scraper and validator with zero dependencies.
Get meta data from any url (http/https) and support group by property
Javascript scraper module for Open Graph and Twitter Card info
Scrape module
impit-based HTTP client implementation for Crawlee. Impersonates browser requests to avoid bot detection.
The library scraper for WhatsApp bot or Restfull API's
Cheerio (http://cheeriojs.github.io/cheerio/) fork that uses parse5 HTML-parser (https://github.com/inikulin/parse5) as an underlying platform
Elegant implementation of core jQuery designed for the server
Dependency free module for scraping and crawling websites using [Crawlbase](https://crawlbase.com) API
Gifted-Dls: Social Media(Youtube, Tiktok, Facebook, Instagram, Twitter, Spotify, +18) Downloaders and Some Api Tools
Get data from soundcloud easily.
Scraper for oEmbed, Twitter Cards and Open Graph metadata - fast and Promise-based
Creador de scrapers para plataformas sociales como TikTok, Instagram, Facebook y Pinterest.
The library provides convenient access to the Outscraper API. Allows using Outscraper's services from your code. See https://outscraper.com for details.
A JavaScript package for non-browser environments that leverages [Genius API](https://genius.com/developers) to find (and scrape) song lyrics and album covers.<br/>
Tiny, fast, and elegant implementation of core jQuery designed specifically for the server
A scraper for https://bandcamp.com
A toolkit for identifying ROM files, fetching Hasheous metadata, and launching games with EmulatorJS
Nodejs library that provides high-level APIs for obtaining information on various entertainment media such as books, movies, comic books, anime, manga, and so on.
JS client for WebcrawlerAPI
rechercher et télécharger des applications depuis Aptoide
A slim module for scraping Facebook event data in milliseconds.
Turn any webpage intro structured data using LLMs
📦 A scraper package serving anime information from hianimez.to
A JS package for scraping recipes from the web.
Unnoficial Node.js Client for the Webshrinker APIs available at https://www.webshrinker.com
Lightning-fast, enterprise-grade HTTP client for modern JavaScript. Full HTTP/2 support, intelligent cookie management, multiple adapters (HTTP, Fetch, cURL, XHR), streaming, proxy support (HTTP/HTTPS/SOCKS), and cross-environment compatibility.
ScrapingAnt API client for JavaScript
Makes it simple to interface with Garmin Connect to get or set any data point
Makes it simple to interface with Garmin Connect to get or set any data point
A fully-typed api wrapper for a favicon scraper
Downloads images/videos from Coomer/Kemono, Bunkr, GoFile, Reddit-NSFW user posts
JavaScript SDK for Firecrawl API
Fast, token-efficient web content extraction - fetch web pages and convert to clean Markdown
A JavaScript library that allows for the quick transformation of DOM documents into useful formats.
Tiny, fast, and elegant implementation of core jQuery designed specifically for the server
Simple scraper for Google images using Puppeteer
crawl youtube without api key (search videos channels or get all channel/playlist's videos)
MCP 服务器,提供中文必应搜索结果抓取和网页内容抓取功能,支持中文搜索
Simple script for downloading Youtube comments without using the Youtube API
<p align="center"> <a href="https://sub.wyzie.ru/"> <img src="https://i.postimg.cc/L5ppKYC5/cclogo.png" height="120"> <h1 align="center">Wyzie Lib</h1> </a> </p>
ČSFD API in JavaScript. Amazing NPM library for scrapping csfd.cz :)
Pitchfork.com scraper that gets best new music info Edit
Yet another node torrent scraper based on x-ray. (Support iptorrents, torrentleech, torrent9, Yyggtorrent, ThePiratebay, torrentz2, 1337x, KickassTorrent, Rarbg, TorrentProject, Yts, Limetorrents, Eztv)
Scraper perpustakaan untuk Bot WhatsApp atau Rest API
Distributed web crawler powered by Headless Chrome
An easy-to-use Node web crawler storing cookies, following redirects, traversing pages and submitting forms.
Amazon Scraper. You can scrape products from amazon search result and you can also scrape reviews from a specific product
Makes it simple to interface with Garmin Connect to get or set any data point
Scrape meta, link & open graph tags from the head of document as a JavaScript object (JSON)
It is a TypeScript library that provides you with all the information about any medium article (title, pageContent, main image url, author name, author image url etc.) just by providing link of the article.
Core SDK and runtime for the JoyBoy parser ecosystem
A webscraper for the NoCopyrightSounds website to provide an API
A CLI tool for scraping and compiling documentation or other multi page content from websites and NPM packages into a single markdown file.
Movies Sinhala Subtitle Download Scraper
A package to bypass Cloudflare's protection
n8n node for requesting webpages using Puppeteer
Modern TypeScript library for collecting public Instagram content with smart delays, mobile-first approach, and media support
Scraper next gen based on x-ray (2.3.2)
A port of n0madic/twitter-scraper to Node.js.
A powerful, modular web scraping and crawling library for Node.js, inspired by crawl4ai. Features stealth mode, LLM extraction, and markdown processing.
911Proxy Universal Web Scraper MCP Server - supports HTML extraction and screenshots
A Facebook event scraper that extracts events via both HTML-embedded data and the GraphQL API.
Lightweight async scraper for Google News
Makes it simple to interface with Garmin Connect CN to get or set any data point
Extract recipe data from the web effortlessly
Reddit scraper for fetching posts and comments via the official API with automatic caching
HarvestAPI provides LinkedIn data scraping tools for real-time, high-performance scraping at a low cost. API allows to search for Linkedin `jobs`, `companies`, `profiles`, and `posts` using a wide range of filters.
get data from tableau dashboards
A Node.js library that parses information from PAGASA's Severe Weather Bulletin page and turns it into various formats.
Scrape linktree profile data
A web crawler for Nodejs.
Extract emails from text and also from a site page
Fast, lightweight Open Graph, Twitter Card, and structured data extractor for Node.js with caching and validation
Link Meta Extractor. Extract metadata information from any http/https url. Simply pass a url string to the function and wait for the metadata results.
MCP 服务器,提供中文必应搜索结果抓取和网页内容抓取功能,支持中文搜索
News extraction and scraping. Article Parsing
Scrape Bandcamp content
Allows parsing of PAGASA TCB PDF files into pagasa-parser Bulletins.
Scraper for Google search pages based on keywords and other parameters like localisation
A Node.js package that fetches GeeksForGeeks user profile, stats and solved problem details using Puppeteer.
TypeScript library for scraping and parsing stories from various Vietnamese novel websites
n8n community node for advanced browser automation using Puppeteer API with stealth mode, device emulation, advanced selectors (XPath, ARIA, Text, Pierce), screenshots, PDFs, cookies, geolocation, and multi-session contexts
Distributed web crawler powered by Headless Chrome
Life expectancy and lifestyle effects data scraper with curated datasets
Generates storm signal images from PAGASA Parser bulletins.
A TypeScript library for interacting with the Aniworld anime streaming platform or similarly structured services like s.to.
Scrape sneakers data using dynamic parameters like proxy (rotating support), cookie, country and currency.
A simple yet powerful module to retrieve organic search results and much more from Google.
A simple and lightweight NekoPoi scraper.
Web scraper for NodeJS
scraper random for downloader and searching
A client library for accessing the CF Bypass.
Node.js scraper module for Open Graph and Twitter Card info
Crawler (spider) of site web pages by domain name
Client for the Panopticon news monitoring system
<p align="center"> <a href="" rel="noopener"> <img src="https://sinhala.adaderana.lk/2021/assets/images/header-new-logo-sinhala_2019.png" alt="Derana"></a> </p>
Node.js scraper module for Open Graph and Twitter Card info, based on https://github.com/jshemas/openGraphScraper
Browser & Node.JS cross-compatible module for the Euler Stream WebSocket service.
React Native scraper module for Open Graph and Twitter Card info
MCP server for Crawlbase API - enables web scraping through Model Context Protocol
Nexara is a lightweight Node.js module designed to fetch and parse web content efficiently. It also provides a utility for capturing webpage screenshots.
n8n node for browser automation using Puppeteer
A Twitter/X client for agents with scraping and posting capabilities
A package to scrap fandoms wikis characters page. Only scraps the characters info section and the list of all repertoried characters.
KenPom.com API wrapper - College basketball statistics for Node.js with CLI
A gateway to TradingView's data for your Node.js application!
Pinterest automation library with TypeScript and Playwright - undetected features included
scrapes app data from google play store with proxy support
Scrape anime data from different sources (only anime-sama.org, animepahe and crunchyroll for the moment)
Minimalistic package to asynchronously scrap data from Google News.
simple multi-level scraper json input/output
A simple scraper by kelvdra.
Fully-typed TikTok web API client that REALLY works—handles URL signing, msToken rotation, retries, and helpers for profiles, posts, and challenges
Sebuah Tools yang berfungsi untuk mendownload Video atau Foto dari media sosial, serta sebagai tools yang berguna untuk aplikasi kamu seperti untuk BOT
Classify and extract structured data from anywhere
Scrape public available jobs on Linkedin using headless browser
Very complete scrapper for the famous porn website pornhub
A zero-dependency caption scraper for YouTube ✨
A web scraper that fetches fighters, events, rankings, records, and detailed fight statistics from UFC.com.
TikTok video downloader CLI. Download one or multiple TikTok videos directly from URLs with configurable save paths and concurrent downloads.
TypeScript library to fetch Costa Rican exchange rates from Banco Central de Costa Rica (BCCR)
A Node.js module to scrape data from Google Play Store
Nodejs library that provides high-level APIs for obtaining information on various entertainment media such as books, movies, comic books, anime, manga, and so on.
DuckDuckGo search scraper with TypeScript API
A Twitter scraper that uses Nitter to fetch tweets without authentication
Instagram scraper without authenticated API
ImageFap gallery downloader
scrapes app data from google play store
A fast, intelligent web documentation scraper that converts website documentation into Markdown format
TypeScript rewrite of google-play-scraper with throttling, retries, and strong typing.
JSDOM with extra tweak ( jquery / cheerio / request )
Powerful web content extraction SDK with URL normalization and intelligent scraping - https://github.com/anisirji/llm-web-extractor
CLI tool to scrape business place data from ValueSERP API
NodeJS script to scrap the entire database of dbgest.com / bedetheque.com (approx. 260.000+ albums)
Tiny, fast, and elegant implementation of core jQuery designed specifically for the server
An unofficial instagram downloader scraper for download videos and images using instagram link.
Ultra-minimal recursive web crawler
Declarative HTML data extraction library with schema-based selectors
TypeScript-first multi-platform social media scraper without API keys
Get user data and posts by scraping Instagram's user page. Without API key or oAuth!
Configurable website scraper in typescript
Xnxx Search and information scraper
Get Clean Reading Content from every web page
This is unofficial sinhala Cricket news scraper of sporty.lk Site. Created by MrNima.
An automated Youtube Poster package, with a Database included, which allows you to make it easy to post new Video Uploads or Streams to a Channel, per YT-CHANNEl defineable, fast, relyable, FETCHING ALL CHANNEL DATA AND ALL VIDEOS!
A scraper for https://bandcamp.com
A simple scraper.
A complete and versatile web scraper.
n8n node for browser automation using Puppeteer. With extra modifications for cartier.
Node-RED module cheerio a fast, flexible & lean implementation of core jQuery designed specifically for the server.
Introducing Scrappey, your comprehensive website scraping solution provided by Scrappey.com. With Scrappey's powerful and user-friendly API, you can effortlessly retrieve data from websites, including those protected by Cloudflare. Join Scrappey today and
Generator tool to scrape Salesforce documentation and consumer library to access object schemas. See README for both usages.
scrape data from the itunes app store
Recursieve webcrawler die subdomeinen, pagina's en afbeeldingen vindt en crawlt.
It parses the html and collects the requested data as desired.
Modern TypeScript library to scrape application data from the iTunes/Mac App Store
A TypeScript client for the Narro inlink API, for easily querying web page metadata via a cloud endpoint.
Simple API YouTube downloader (not official from en.loader.to)
A powerful multi-platform media downloader supporting YouTube, TikTok, Pinterest, and more. Built for easy expansion and clean API usage.
scrapes app data from google play store. Forked from facundoolano/google-play-scraper
A command-line tool acting as an MCP (ModelContextProtocol) server, using Playwright to crawl web content for AI models.
A command-line tool for scraping historical sale prices from BoardGameGeek's marketplace and generating valuation reports for your board game collection.
A simple XVideos Scraper that srapes/gets video data and downloadable video source and returns a promise/JSON Object.
http interceptor to hoomanize cloudflare requests
Structured data extraction from html/webpages.
This is an unofficial API for google dictionary.
Crawlyx is an open-source command-line interface (CLI) based web crawler built using Node.js. It is designed to crawl websites and extract useful information like links, images, and text. It is lightweight, fast, and easy to use.
A simple Telegram channel scraper
The pirate bay client
Extract phone numbers from arbitrary text strings
A port of n0madic/twitter-scraper to Node.js.
x-ray's crawler
Fetch SoundCloud resources through API v2
🚀 MCP SERVER FIXED v3.7.9! Resolved import errors, middleware conflicts, type hints - NOW WORKING PERFECTLY!
n8n node for browser automation using Puppeteer
Tiny, fast, and elegant implementation of core jQuery designed specifically for the server
n8n node for browser automation using Puppeteer
structure any website
Scrap all settlement from indonesian banks
Google Translate scraper for Illyria Translate
Scrape web as easy as possible
MCP server for gathering market intelligence from mobile app stores (Apple App Store & Google Play)
A lightweight, robust, and framework-agnostic HTML metadata parser for Node.js and the browser. Designed to extract Open Graph, Twitter Cards, and standard metadata with strict validation support.
A powerful multi-platform media downloader supporting YouTube, TikTok, Pinterest, and more. Built for easy expansion and clean API usage.
Scrapes the web serial Worm and its sequel Ward into an ebook format
A port of n0madic/twitter-scraper to Node.js.
Google search scraper with captcha solving support
Node.js - google =====================