Software Alternatives, Accelerators & Startups

Current problems and mistakes of web scraping in Python and tricks to solve them!

Scrapy Apify
  1. 1
    Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
    Pricing:
    • Open Source
    One might ask, what about Scrapy? I'll be honest: I don't really keep up with their updates. But I haven't heard about Zyte doing anything to bypass TLS fingerprinting. So out of the box Scrapy will also be blocked, but nothing is stopping you from using curl_cffi in your Scrapy Spider.

    #Web Scraping #Data Extraction #Web Crawling 97 social mentions

  2. 2
    Apify is a web scraping and automation platform that can turn any website into an API.
    Developed by Apify, it is a Python adaptation of their famous JS framework crawlee, first released on Jul 9, 2019.

    #Web Scraping #Data Extraction #Web Crawling 26 social mentions

Discuss: Current problems and mistakes of web scraping in Python and tricks to solve them!

Log in or Post with