Software Alternatives & Reviews

Heritrix

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web...

Top 10 Open-Source Alternatives to Heritrix

Scrapy StormCrawler Apache Nutch Apache Solr Algolia Meilisearch DuckDuckGo Manticore search Typesense ItemsAPI

Summary

The top open-source alternatives to Heritrix are Scrapy, StormCrawler, and Apache Nutch. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. 1
    Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Data 93 social mentions

  2. StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Data

  3. Apache Nutch is a highly extensible and scalable open source web crawler software project.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Data 2 social mentions

  4. Solr is an open source enterprise search server based on Lucene search library, with XML/HTTP and...
    Pricing:
    • Open Source

    #Custom Search Engine #Custom Search #Search Engine 17 social mentions

  5. Algolia's Search API makes it easy to deliver a great search experience in your apps & websites. Algolia Search provides hosted full-text, numerical, faceted and geolocalized search.
    Pricing:
    • Open Source

    #Search API #Custom Search #Custom Search Engine 3 social mentions

  6. Ultra relevant, instant, and typo-tolerant full-text search API
    Pricing:
    • Open Source

    #Search Engine #Custom Search Engine #Custom Search 3 social mentions

  7. The Internet privacy company that empowers you to seamlessly take control of your personal information online, without any tradeoffs.
    Pricing:
    • Open Source

    #Search Engine #Web Search #Internet Search 1669 social mentions

  8. Typo tolerant, delightfully simple, open source search 🔍
    Pricing:
    • Open Source

    #Custom Search Engine #Custom Search #Search Engine 52 social mentions

  9. ItemsAPI is open source search API for creating mobile and web application
    Pricing:
    • Open Source

    #Custom Search Engine #Custom Search #Search API

Suggest an alternative
If you think we've missed something, please suggest an alternative to Heritrix.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

Heritrix discussion

Log in or Post with