JSPM

  • Created
  • Published
  • Downloads 373
  • Score
    100M100P100Q120055F
  • License AGPL-3.0-or-later

Fast web fetcher for AI agents - stealth mode, crawl mode, page actions, structured extraction, PDF parsing, smart escalation from simple HTTP to headless browser

Package Exports

  • webpeel

Readme

WebPeel — Web fetching for AI agents

npm version PyPI version npm downloads GitHub stars CI TypeScript AGPL v3 License

Turn any web page into AI-ready markdown. Smart escalation. Stealth mode. Free to start.

Website · Docs · Playground · Dashboard · Discussions


Quick Start

# Zero install — just run it
npx webpeel https://news.ycombinator.com
# Stealth mode (bypass bot detection)
npx webpeel https://protected-site.com --stealth

# Crawl a website
npx webpeel crawl https://example.com --max-pages 20

# Search the web
npx webpeel search "best AI frameworks 2026"

# Autonomous agent (BYOK LLM)
npx webpeel agent "Find the founders of Stripe" --llm-key sk-...

First 25 fetches work instantly, no signup. After that, sign up free for 125/week.

Why WebPeel?

Feature WebPeel Firecrawl Jina Reader MCP Fetch
Free tier ✅ 125/wk recurring 500 one-time ❌ Cloud only ✅ Unlimited
Smart escalation ✅ HTTP→Browser→Stealth Manual
Stealth mode ✅ All plans ⚠️ Limited
Firecrawl-compatible ✅ Drop-in replacement ✅ Native
Self-hosting ✅ Docker compose ⚠️ Complex N/A
Autonomous agent ✅ BYOK any LLM ⚠️ Locked
MCP tools ✅ 9 tools 3 0 1
License ✅ MIT AGPL-3.0 Proprietary MIT
Pricing Free / $9 / $29 $0 / $16 / $83 Custom Free

Install

# Node.js
npm install webpeel        # or: pnpm add webpeel

# Python
pip install webpeel

# Global CLI
npm install -g webpeel

Usage

Node.js

import { peel } from 'webpeel';

const result = await peel('https://example.com');
console.log(result.content);    // Clean markdown
console.log(result.metadata);   // { title, description, author, ... }
console.log(result.tokens);     // Estimated token count

// With options
const advanced = await peel('https://example.com', {
  render: true,           // Browser for JS-heavy sites
  stealth: true,          // Anti-bot stealth mode
  maxTokens: 4000,        // Limit output
  includeTags: ['main'],  // Filter HTML tags
});

Python

from webpeel import WebPeel

client = WebPeel()  # Free tier, no key needed

result = client.scrape("https://example.com")
print(result.content)  # Clean markdown

results = client.search("python web scraping")
job = client.crawl("https://docs.example.com", limit=100)

Zero dependencies. Pure Python 3.8+. Full SDK docs →

MCP Server

9 tools for Claude Desktop, Cursor, VS Code, and Windsurf:

webpeel_fetch · webpeel_search · webpeel_crawl · webpeel_map · webpeel_extract · webpeel_batch · webpeel_agent · webpeel_summarize · webpeel_brand

{
  "mcpServers": {
    "webpeel": {
      "command": "npx",
      "args": ["-y", "webpeel", "mcp"]
    }
  }
}

Install in Claude Desktop Install in VS Code

Where to add this config: Claude Desktop → ~/Library/Application Support/Claude/claude_desktop_config.json · Cursor → Settings → MCP Servers · VS Code → ~/.vscode/mcp.json · Windsurf → ~/.codeium/windsurf/mcp_config.json

Docker (Self-Hosted)

git clone https://github.com/webpeel/webpeel.git
cd webpeel && docker compose up

Full API at http://localhost:3000. MIT licensed — no restrictions.

Features

🎯 Smart Escalation

Automatically uses the fastest method, escalates only when needed:

HTTP Fetch (200ms)  →  Browser Rendering (2s)  →  Stealth Mode (5s)
     80% of sites          15% of sites             5% of sites

🎭 Stealth Mode

Bypass Cloudflare and bot detection. Masks browser fingerprints, navigator properties, WebGL vendor.

npx webpeel https://protected-site.com --stealth

🕷️ Crawl & Map

Crawl websites with link following, sitemap discovery, robots.txt compliance, and deduplication.

npx webpeel crawl https://docs.example.com --max-pages 100
npx webpeel map https://example.com --max-urls 5000

🤖 Autonomous Agent (BYOK)

Give it a prompt, it researches the web using your own LLM key.

npx webpeel agent "Compare pricing of Notion vs Coda" --llm-key sk-...

📊 More Features

Feature CLI Node.js Python API
Structured extraction
Screenshots
Branding extraction
Change tracking
Token budget
Tag filtering
Image extraction
AI summarization
Batch processing
PDF extraction

Integrations

Works with LangChain, LlamaIndex, CrewAI, Dify, and n8n. Integration docs →

Hosted API

Live at api.webpeel.dev — Firecrawl-compatible endpoints.

# Fetch a page (free, no auth needed for first 25)
curl "https://api.webpeel.dev/v1/fetch?url=https://example.com"

# With API key
curl "https://api.webpeel.dev/v1/fetch?url=https://example.com" \
  -H "Authorization: Bearer wp_..."

Pricing

Plan Price Weekly Fetches Burst Extra Usage
Free $0 125/wk 25/hr
Pro $9/mo 1,250/wk 100/hr ✅ from $0.001
Max $29/mo 6,250/wk 500/hr ✅ from $0.001

Extra credit costs: fetch $0.002, search $0.001, stealth $0.01. Resets every Monday. All features on all plans. Compare with Firecrawl →

Development

git clone https://github.com/webpeel/webpeel.git
cd webpeel
npm install && npm run build
npm test

See CONTRIBUTING.md for guidelines.

Documentation · Playground · API Reference · npm · PyPI · Migration Guide · Blog · Discussions

Star History

Star History Chart

License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0).

What this means:

  • ✅ Free to use, modify, and distribute
  • ✅ Free for personal and commercial use
  • ⚠️ If you run a modified version as a network service, you must release your source code under AGPL-3.0

Need a commercial license? Contact us at support@webpeel.dev for proprietary/enterprise licensing.

Note: Versions 0.7.1 and earlier were released under MIT. Those releases remain MIT-licensed.

© WebPeel


Like WebPeel? ⭐ Star us on GitHub — it helps others discover the project!