Software Alternatives & Reviews

Web Scraping Open Knowledge

CommonCrawl morph.io
  1. Common Crawl

    #Search Engine #Web Scraping #Data Extraction 90 social mentions

  2. Morph A Heroku for Scrapers. Get structured data out of the web
    This is the one I know about: https://morph.io/ and https://github.com/openaustralia/morph#readme (AGPLv3) -- they used to be at the intersection of "heroku for scrapers" and DoltHub (e.g. https://www.dolthub.com/repositories/dolthub/us-businesses/data/master/businesses) since the scrapers would run but then make their data available as CSV or sqlite or whatever. But, when I just tried to load one of the morph.io scrapers, the page just said "creating new template" so I'm guessing they've gone the way of the ScraperWiki.com that preceded them: turns out, hosted compute for free isn't free.

    1 social mentions

Discuss: Web Scraping Open Knowledge

Log in or Post with