StormCrawler
StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
- Open Source
StormCrawler Alternatives
The best StormCrawler alternatives based on verified products, community votes, reviews and other factors.
Latest update:
-
Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
-
Apache Nutch is a highly extensible and scalable open source web crawler software project.
-
Clear. Fast. Unlimited. Residential & Mobile Proxies For Best Price .
-
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web...
-
Solr is an open source enterprise search server based on Lucene search library, with XML/HTTP and...
-
Turn the web into a database!
-
A Platform for Data Crawling and Scraping For Business Developers
-
ACHE is a web crawler for domain-specific search.
-
Apify is a web scraping and automation platform that can turn any website into an API.
-
Common Crawl
-
Octoparse provides easy web scraping for anyone. Our advanced web crawler, allows users to turn web pages into structured spreadsheets within clicks.
-
Easily build scalable web scrapers
-
ParseHub is a free web scraping tool. With our advanced web scraper, extracting data is as easy as clicking the data you need.
-
HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility.