Software Alternatives & Reviews


Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web...

Heritrix Alternatives

The best Heritrix alternatives based on verified products, community votes, reviews and other factors.
Latest update:

  1. Scrapy | A Fast and Powerful Scraping and Web Crawling Framework

    Open Source

  2. StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.

    Open Source

  3. Clear. Fast. Unlimited. Residential & Mobile Proxies For Best Price .

    Try for free paid Free Trial $3.0 (3$ per 1 Gb)

  4. Apache Nutch is a highly extensible and scalable open source web crawler software project.

    Open Source

  5. Turn the web into a database!

  6. ACHE is a web crawler for domain-specific search.

  7. Solr is an open source enterprise search server based on Lucene search library, with XML/HTTP and...

    Open Source

  8. Algolia's Search API makes it easy to deliver a great search experience in your apps & websites. Algolia Search provides hosted full-text, numerical, faceted and geolocalized search.

    Open Source

  9. HTTrack is a free (GPL, libre/free software) and easy-to-use offline browser utility.

    Open Source

  10. grab-site is a crawler for archiving websites to WARC files.

  11. Elasticsearch is an open source, distributed, RESTful search engine.

  12. Apify is a web scraping and automation platform that can turn any website into an API.

  13. Ultra relevant, instant, and typo-tolerant full-text search API

    Open Source

Suggest an alternative
If you think we've missed something, please suggest an alternative to Heritrix.

Generic Heritrix discussion

Log in or Post with