Annoy VS Weaviate

Annoy

Annoy is a C++ library with Python bindings to search for points in space that are close to a given query point.

Weaviate

Welcome to Weaviate

Landing page //
2023-10-10

Landing page //
2023-05-10

Does Asking for Reviews Annoy My Customers?

Weaviate videos

+ Add

Introducing the Weaviate Vector Search Engine!

Category Popularity

0-100% (relative to Annoy and Weaviate)

Weaviate

Search Engine

23 23%

Search Engine

77% 77

Utilities

27 27%

Utilities

73% 73

Custom Search Engine

36 36%

Custom Search Engine

64% 64

Databases

0 0%

Databases

100% 100

User comments

Share your experience with using Annoy and Weaviate. For example, how are they different and which one is better?

Social recommendations and mentions

Annoy might be a bit more popular than Weaviate. We know about 35 links to it since March 2021 and only 28 links to Weaviate. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Annoy mentions (35)

Do we think about vector dbs wrong?
The focus on the top 10 in vector search is a product of wanting to prove value over keyword search. Keyword search is going to miss some conceptual matches. You can try to work around that with tokenization and complex queries with all variations but it's not easy. Vector search isn't all that new a concept. For example, the annoy library (https://github.com/spotify/annoy), an open source embeddings database. - Source: Hacker News / 10 months ago
Vector Databases 101
If you want to go larger you could still use some simple setup in conjunction with faiss, annoy or hnsw. Source: 12 months ago
Calculating document similarity in a special domain
I then use annoy to compare them. Annoy can use different measures for distance, like cosine, euclidean and more. Source: about 1 year ago
Can Parquet file format index string columns?
Yes you can do this for equality predicates if your row groups are sorted . This blog post (that I didn't write) might add more color. You can't do this for any kind of text searching. If you need to do this with file based storage I'd recommend using a vector based text search and utilize a ANN index library like Annoy. Source: about 1 year ago
[D]: Best nearest neighbour search for high dimensions
If you need large scale (1000+ dimension, millions+ source points, >1000 queries per second) and accept imperfect results / approximate nearest neighbors, then other people have already mentioned some of the best libraries (FAISS, Annoy). Source: about 1 year ago

Weaviate mentions (28)

How to choose the right type of database
Weaviate: An open-source, cloud-native vector database built for scalable and fast vector searches. It's particularly effective for semantic search applications, combining full-text search with vector search for AI-powered insights. - Source: dev.to / 4 months ago
7 Vector Databases Every Developer Should Know!
Weaviate is an open-source vector search engine with out-of-the-box support for vectorization, classification, and semantic search. It is designed to make vector search accessible and scalable, supporting use cases such as semantic text search, automatic classification, and more. - Source: dev.to / 4 months ago
Qdrant, the Vector Search Database, raised $28M in a Series A round
Congrats to them! What have your experiences with vector databases been? I've been using https://weaviate.io/ which works great, but just for little tech demos, so I'm not really sure how to compare one versus another or even what to look for really. - Source: Hacker News / 5 months ago
How Modern SQL Databases Are Changing Web Development - #4 Into the AI Era
A RAG implementation's quality and performance highly depend on the similarity-based search of embeddings. The challenge arises from the fact that embeddings are usually high-dimensional vectors, and the knowledge base may have many documents. It's not surprising that the popularity of LLM catalyzed the development of specialized vector databases like Pinecone and Weaviate. However, SQL databases are also evolving... - Source: dev.to / 6 months ago
Make Notion search great again: Vector Database
To find semantically similar texts we need to calculate the distance between vectors. While we have just a few short texts we can brute-force it: calculate the distance between our query and each text embedding one by one and see which one is the closest. When we deal with thousands or even millions of entries in our database, however, we need a more efficient way of comparing vectors. Just like for any other way... - Source: dev.to / 7 months ago

What are some alternatives?

When comparing Annoy and Weaviate, you can also consider the following products

txtai - AI-powered search engine

Qdrant - Qdrant is a high-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Scikit-learn - scikit-learn (formerly scikits.learn) is an open source machine learning library for the Python programming language.

Milvus - Vector database built for scalable similarity search Open-source, highly scalable, and blazing fast.

pgvecto.rs - Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database. - tensorchord/pgvecto.rs

Vectara Neural Search - Neural search as a service API with breakthrough relevance

Annoy vs txtai

Annoy vs Qdrant

Annoy vs Scikit-learn

Annoy vs Milvus

Annoy vs pgvecto.rs

Annoy vs Vectara Neural Search

Weaviate vs txtai

Weaviate vs Qdrant

Weaviate vs Scikit-learn

Weaviate vs Milvus

Weaviate vs pgvecto.rs