A powerful, fast and high-fidelity HTML to PDF conversion API.
Code examples and package ready for Node, Python and PHP developers.
Advanced features are available, including watermarking and encryption!
Based on our record, Xapian seems to be more popular. It has been mentiond 7 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Recoll is free/open source (GPL) that can index PDFs and search them very quickly. It uses Xapian under the hood. I have over 165,000 documents indexed on an old laptop running Linux and can query them all in a split second. Source: 7 months ago
+ xapian which has been around a while, and while gpl licensed, is quite capable https://xapian.org/. - Source: Hacker News / over 1 year ago
Tangentially related if you need search without the clustering and high availability story of elastic search and friends I highly recommend Xapian. Its like the SQLite of search. Single library that provides the basic set of features you would expect in a quality search experience: facets, ranked search, boolean operators, stemming etc etc. https://xapian.org/. - Source: Hacker News / over 1 year ago
For fast searching, it usually requires indexing the files in question. There are a number of text-file indexing solutions, many of which use xapian, sphinx, or lucene/solr under the hood. Based on conditions (watching files/directories, cron jobs, new-mail triggers, etc), they'll add/remove files to the index, and you can then use a corresponding command to compose queries across that data. If it's indexed, it... Source: over 2 years ago
There is also xapian/recoll https://xapian.org/ which works great for "desktop" search. - Source: Hacker News / over 2 years ago
ElasticSearch - Elasticsearch is an open source, distributed, RESTful search engine.
DocRaptor - As the only API powered by the Prince HTML-to-PDF engine, DocRaptor provides the best support for complex PDFs with powerful support for headers, page breaks, page numbers, flexbox, watermarks, accessible PDFs, and much more
ElasticHQ - Tool for ElasticSearch management and monitoring.
pdflayer - Free, powerful HTML to PDF API supporting both URL and raw HTML conversion. Unlimited document size, lightning-fast and compatible PHP, Python, Ruby, etc.
Kaizen - Kaizen is an ElasticSearch GUI for Windows, Mac and Linux, written in JavaFX as a cross-platform desktop application.
PDFCrowd - Pdfcrowd is a Web/HTML to PDF online service. Convert HTML to PDF online in the browser or in your PHP, Python, Ruby, .NET, Java apps via the REST API.