Software Alternatives, Accelerators & Startups

Apache Nutch

Apache Nutch is a highly extensible and scalable open source web crawler software project.

Top 4 Open-Source Alternatives to Apache Nutch

Apache Nutch
Scrapy StormCrawler Mixnode ScrapeHero

Summary

The top open-source alternatives to Apache Nutch are Scrapy, StormCrawler, and Mixnode. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. 1
    Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling 97 social mentions

  2. StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling

  3. Turn the web into a database!
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling

  4. A web scraping service to collect data from websites, without any programming or DIY tools.
    Pricing:
    • Open Source
    • Freemium
    • Free Trial
    • $5.0 / Monthly

    #API #Web Scraping #Data Dashboard 1 social mentions

Suggest an alternative
If you think we've missed something, please suggest an alternative to Apache Nutch.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

Apache Nutch discussion

Log in or Post with