Software Alternatives, Accelerators & Startups

Top 12 Open-Source Alternatives to Diffbot

Diffbot
ScrapeHero Webhose.io Zyte Scrapy Webtap.ai CoffeeScript Dataflow Kit Ocean Protocol StormCrawler AirCode

Summary

The top open-source alternatives to Diffbot are ScrapeHero, Webhose.io, and Zyte. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. A web scraping service to collect data from websites, without any programming or DIY tools.
    Pricing:
    • Open Source
    • Freemium
    • Free Trial
    • $5.0 / Monthly

    #API #Web Scraping #Data Dashboard 1 social mentions

  2. Webhose.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling 1 social mentions

  3. 3
    We're Zyte (formerly Scrapinghub), the central point of entry for all your web data needs.
    Pricing:
    • Open Source
    • Freemium
    • Free Trial

    #Web Scraping #Data Extraction #Web Crawling 1 social mentions

  4. 4
    Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling 97 social mentions

  5. Extract data from any website using natural language queries—no coding needed.
    Pricing:
    • Open Source
    • $19.99 / Monthly (Pro Plan. Access to our AI-powered web scraper.)

    #Web Scraping #Data Extraction #AI

  6. Unfancy JavaScript
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Data Analysis 25 social mentions

  7. A cloud-based web scraping platform. Extract data from websites and automate workflows on the web.
    Pricing:
    • Open Source
    • Paid
    • Free Trial
    • $5.0 / Usage

    #Web Scraping #Website Screenshots #Data Extraction

  8. The open-source & privacy-preserving data sharing protocol
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Developer Tools

  9. StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.
    Pricing:
    • Open Source

    #Web Scraping #Data Extraction #Web Crawling

  10. 10
    Serverless Node.js stack for API development
    Pricing:
    • Open Source

    #Productivity #Web Scraping #Data Extraction

  11. Get News Data with API
    Pricing:
    • Open Source

    #API Tools #Data Extraction #Web Crawling 16 social mentions

  12. 12
    Colly is a scraping framework to extract structured data from websites.
    Pricing:
    • Open Source

    #Web Scraping #Browser Testing #Tool 9 social mentions

Suggest an alternative
If you think we've missed something, please suggest an alternative to Diffbot.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

Diffbot discussion

Log in or Post with