Search results

1000+ packages found

The fast, flexible & elegant library for parsing and manipulating HTML and XML.

published version 1.1.0, 8 days ago17939 dependents licensed under $MIT
47,121,272

A specification compliant robots.txt parser with wildcard (*) matching support.

published version 3.0.1, 2 years ago84 dependents licensed under $MIT
5,105,869

JavaScript SDK for Firecrawl API

published version 1.25.5, 11 days ago37 dependents licensed under $MIT
347,826

Browserless scraper module

published version 5.0.1, a year ago16 dependents licensed under $GPL-3.0-or-later
424,190

Apify API client for JavaScript

published version 2.12.5, 19 days ago32 dependents licensed under $Apache-2.0
250,512

Node.js scraper module for Open Graph and Twitter Card info

published version 6.10.0, 2 months ago75 dependents licensed under $MIT
286,263

A library to easily scrape metadata from an article on the web using Open Graph, JSON+LD, regular HTML metadata, and series of fallbacks.

published version 5.47.1, 10 days ago74 dependents licensed under $MIT
184,185

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago27 dependents licensed under $Apache-2.0
183,204

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago6 dependents licensed under $Apache-2.0
153,751

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago7 dependents licensed under $Apache-2.0
149,426

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago4 dependents licensed under $Apache-2.0
129,994

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago1 dependents licensed under $Apache-2.0
127,460

Lazy way to download images from Duck Duck Go search results in bulk

published version 0.1.11, 4 years ago0 dependents licensed under $MIT
124,223

Request a url and scrape the metadata from its HTML using Node.js or the browser.

published version 5.2.1, 21 days ago21 dependents licensed under $MIT
133,620

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago3 dependents licensed under $Apache-2.0
131,915

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago1 dependents licensed under $Apache-2.0
127,189

Templates for the crawlee projects

published version 3.13.7, 10 days ago1 dependents licensed under $Apache-2.0
127,583

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago5 dependents licensed under $Apache-2.0
130,798

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago59 dependents licensed under $Apache-2.0
128,618

The scalable web crawling and scraping library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

published version 3.13.7, 10 days ago1 dependents licensed under $Apache-2.0
124,516