Based on our record, Vespa.ai seems to be a lot more popular than Apache Doris. While we know about 19 links to Vespa.ai, we've tracked only 1 mention of Apache Doris. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
If you're serious about scaling up, definitely consider Vespa (https://vespa.ai). At serious scale, Vespa will likely knock all the other options out of the park. - Source: Hacker News / 19 days ago
Yahoo released their geographic data catalogue under open license and it still lives on as https://whosonfirst.org/ Afaik https://en.wikipedia.org/wiki/Apache_ZooKeeper started at Yahoo https://vespa.ai/ was Yahoo's search engine for news and other content product, now spinned off (https://techcrunch.com/2023/10/04/yahoo-spins-out-vespa-its-search-tech-into-an-independent-company/). - Source: Hacker News / 3 months ago
I think https://vespa.ai/ has the right approach in this space by focusing on being hybrid - vectors alone aren't great for production use cases, it's the combining of vectors+text that lets you use ranking to get meaningful result. (I'm an investor so I'm biased; but it's also the reason why I invested). - Source: Hacker News / 3 months ago
So what’s the catch? Why is this not everywhere? Because IR is not quite NLP — it hasn’t gone fully mainstream, and a lot of the IR frameworks are, quite frankly, a bit of a pain to work with in-production. Some solid efforts to bridge the gap like Vespa [1] are gathering steam, but it’s not quite there. [1] https://vespa.ai. - Source: Hacker News / 4 months ago
When it comes to search I cannot disagree more. https://vespa.ai is a purpose built search engine. If you start bolting search onto your database, your relevance will be terrible, you'll be rewriting a lot of table stakes tools/features from scratch, and your technical debt will skyrocket. - Source: Hacker News / 10 months ago
As an open-source real-time data warehouse, Apache Doris provides semi-structured data processing capabilities, and the newly-released version 2.1.0 makes a stride in this direction. Before V2.1, Apache Doris stores semi-structured data as JSON files. However, during query execution, the real-time parsing of JSON data leads to high CPU and I/O consumption in addition to high query latency, especially when the... - Source: dev.to / about 1 month ago
Typesense - Typo tolerant, delightfully simple, open source search 🔍
ClickHouse - ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.
Meilisearch - Ultra relevant, instant, and typo-tolerant full-text search API
StarRocks - StarRocks offers the next generation of real-time SQL engines for enterprise-scale analytics. Learn how we make it easy to deliver real-time analytics.
Algolia - Algolia's Search API makes it easy to deliver a great search experience in your apps & websites. Algolia Search provides hosted full-text, numerical, faceted and geolocalized search.
Apache Hive - Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.