Scrapy - Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
Apify - Apify is a web scraping and automation platform that can turn any website into an API.
StormCrawler - StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
Heritrix - Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web...
Mixnode - Turn the web into a database!
ProxyCrawl - ProxyCrawl stay anonymous while crawling the web. Avoid captchas, blocks and proxies. Crawling and scraping protection