Software Alternatives & Reviews

Creating an advanced search engine with PostgreSQL

Vespa.ai Scrapy
  1. Store, search, rank and organize big data
    Pricing:
    • Open Source

    #Search Engine #Custom Search Engine #Custom Search 19 social mentions

  2. 2
    Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
    Pricing:
    • Open Source
    If you're looking for a turn-key solution, I'd have to dig a little. I generally write a scraper in python that dumps into a database or flat file (depending on number of records I'm hunting). Scraping is a separate subject, but once you write one you can generally reuse relevant portions for many others. If you can get adept at a scraping framework like Scrapy you can do it fairly quickly, but there aren't many tools that work out of the box for every site you'll encounter. Once you've written the spider, it's generally able to be rerun for updates unless the site code is dramatically altered. It really comes down to how brittle the spider is coded (i.e. Hunting for specific heading sizes or fonts or something) instead of grabbing the underlying JSON/XHR that doesn't usually change frequently. 1. https://scrapy.org.

    #Web Scraping #Data Extraction #Data 93 social mentions

Discuss: Creating an advanced search engine with PostgreSQL

Log in or Post with