Based on our record, Vespa.ai should be more popular than Sphinx Search. It has been mentiond 20 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
In cases where a company possesses a strong technological foundation and faces a substantial workload demanding advanced vector search capabilities, its ideal solution lies in adopting a specialized vector database. Prominent options in this domain include Chroma (having raised $20 million), Zilliz (having raised $113 million), Pinecone (having raised $138 million), Qdrant (having raised $9.8 million), Weaviate... - Source: dev.to / 11 days ago
If you're serious about scaling up, definitely consider Vespa (https://vespa.ai). At serious scale, Vespa will likely knock all the other options out of the park. - Source: Hacker News / about 1 year ago
Yahoo released their geographic data catalogue under open license and it still lives on as https://whosonfirst.org/ Afaik https://en.wikipedia.org/wiki/Apache_ZooKeeper started at Yahoo https://vespa.ai/ was Yahoo's search engine for news and other content product, now spinned off (https://techcrunch.com/2023/10/04/yahoo-spins-out-vespa-its-search-tech-into-an-independent-company/). - Source: Hacker News / about 1 year ago
I think https://vespa.ai/ has the right approach in this space by focusing on being hybrid - vectors alone aren't great for production use cases, it's the combining of vectors+text that lets you use ranking to get meaningful result. (I'm an investor so I'm biased; but it's also the reason why I invested). - Source: Hacker News / over 1 year ago
So what’s the catch? Why is this not everywhere? Because IR is not quite NLP — it hasn’t gone fully mainstream, and a lot of the IR frameworks are, quite frankly, a bit of a pain to work with in-production. Some solid efforts to bridge the gap like Vespa [1] are gathering steam, but it’s not quite there. [1] https://vespa.ai. - Source: Hacker News / over 1 year ago
Sphinx is a search engine that can be integrated into a website to provide advanced search functionality such as full-text, Boolean, and faceted search. It is a powerful open-source search engine that can handle large amounts of data and quickly return results. - Source: dev.to / about 2 years ago
Have been using Sphinx. It does some processing around suffixes, tenses, and so on, and looks at word proximity (BM25), but is definitely limited. Source: about 2 years ago
Lucene is the thing you think you need. Elastic Search is a nice wrapper for it. But these are Java, so maybe you want Sphinx Search (C++) or MeiliSearch (Rust). Source: over 2 years ago
Using a natural language search will almost certainly be a better solution and PHP may not be the best tool for this task. Figure out how you are going to get the text out of the PDF and where you are going to put it. Look at things like sphinx and full text search in boolean mode for doing the keyword matching. Source: over 2 years ago
In practice though you don't do any of this, you get a library to do it for you. I've used Sphinx Search in the past for some fairly hefty (In the order of terabytes), and there's a good book covering how to get it all set up and started. Source: almost 3 years ago
Meilisearch - Ultra relevant, instant, and typo-tolerant full-text search API
ElasticSearch - Elasticsearch is an open source, distributed, RESTful search engine.
Typesense - Typo tolerant, delightfully simple, open source search 🔍
Apache Solr - Solr is an open source enterprise search server based on Lucene search library, with XML/HTTP and...
Milvus - Vector database built for scalable similarity search Open-source, highly scalable, and blazing fast.
Algolia - Algolia's Search API makes it easy to deliver a great search experience in your apps & websites. Algolia Search provides hosted full-text, numerical, faceted and geolocalized search.