Software Alternatives & Reviews

Apache Nutch

Apache Nutch is a highly extensible and scalable open source web crawler software project. subtitle

Apache Nutch Alternatives

The best Apache Nutch alternatives based on verified products, community votes, reviews and other factors.
Latest update:

  1. 11
    /scrapy-alternatives

    Scrapy | A Fast and Powerful Scraping and Web Crawling Framework

    Open Source

  2. 11
    /stormcrawler-alternatives

    StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.

    Open Source

  3. Try for free

    Get proxy servers featuring IPv4, HTTP/HTTPs, and SOCKS4/5 protocols. Choose from static and rotating IP addresses. ProxyCompass is here to support your business around the clock.

    Try for free paid Free Trial $15.0 / Monthly (5 US Proxies)

  4. /heritrix-alternatives

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web...

  5. /mixnode-alternatives

    Turn the web into a database!

    Open Source

  6. /commoncrawl-alternatives
  7. /ache-crawler-alternatives

    ACHE is a web crawler for domain-specific search.

  8. /apache-solr-alternatives

    Solr is an open source enterprise search server based on Lucene search library, with XML/HTTP and...

    Open Source

  9. /apify-alternatives

    Apify is a web scraping and automation platform that can turn any website into an API.

  10. /httrack-alternatives

    HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility.

    Open Source

  11. /crawlbase-alternatives

    A Platform for Data Crawling and Scraping For Business Developers

    paid $99.0 / Monthly

  12. /octoparse-alternatives

    Octoparse provides easy web scraping for anyone. Our advanced web crawler, allows users to turn web pages into structured spreadsheets within clicks.

  13. /content-grabber-alternatives

    Content Grabber is an automated web scraping tool.

  14. /puppeteer-alternatives

    Puppeteer is a Node library which provides a high-level API to control headless Chrome or Chromium...

Suggest an alternative
If you think we've missed something, please suggest an alternative to Apache Nutch.

Generic Apache Nutch discussion

Log in or Post with