Scrapy - Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
StormCrawler - StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
Apify - Apify is a web scraping and automation platform that can turn any website into an API.
Apache Nutch - Apache Nutch is a highly extensible and scalable open source web crawler software project.
ParseHub - ParseHub is a free web scraping tool. With our advanced web scraper, extracting data is as easy as clicking the data you need.
Heritrix - Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web...