CommonCrawl VS Brave Search

Brave Search

Private search that puts you first, not big tech

Landing page //
2023-10-16

Landing page //
2023-03-24

CommonCrawl

Website: commoncrawl.org
Categories: #Search Engine #Web Scraping #Data Extraction #Internet Search

Edit details

Brave Search

Website: search.brave.com
Categories: #Android #iPhone #Web App #Search Engine #Private Search Engine

Edit details

CommonCrawl videos

No CommonCrawl videos yet. You could help us improve this page by suggesting one.

+ Add video

Brave Search videos

+ Add

Introducing Brave Search beta

Category Popularity

0-100% (relative to CommonCrawl and Brave Search)

Brave Search

Search Engine

13 13%

Search Engine

87% 87

Web Scraping

100 100%

Web Scraping

0% 0

Android

0 0%

Android

100% 100

Internet Search

12 12%

Internet Search

88% 88

User comments

Share your experience with using CommonCrawl and Brave Search. For example, how are they different and which one is better?

Reviews

These are some of the external sources and on-site user reviews we've used to compare CommonCrawl and Brave Search

CommonCrawl Reviews

We have no reviews of CommonCrawl yet.
Be the first one to post

Brave Search Reviews

Notgiv Inmaname

Random guy at Hypothetical Inexistent Organization of Proffesional Dumbasses | over 2 years ago

Brave Search is way better than other search engines.
In contrast to other "private" search engines (except for Presearch and SearX), it doesn't have trackers, or not nearly as many. This information can be verified by installing uBlock Origin and ClearURLs, which detect 0 and 2 trackers respectively, against for example DuckDuckGo's nearly 10 and 19. Other alternatives are SearX (No trackers AT ALL, still kinda user-friendly) and Presearch (A bit easier to use but a tiny bit worse for privacy, it has 1 more tracking element).

🏁 Competitors: DuckDuckGo, Mojeek, StartPage, Presearch, Searx, Google, Bing, Yahoo, Ecosia, Ask.com, Qwant

👍 Pros: Good search results|Not too many trackers|Not buggy|Easy to use

👎 Cons: Has 2 tracking url elements

The Next Google

“Brave Search can operate as stand-alone, the rest cannot as they rely on Google or Bing. Most search engines are not independent search engines, and while they may provide some value, they are qualitatively different from what Brave Search is doing. Independence is not something directly actionable, but it’s a fundamental property. Independence means that Brave Search would...

Source: dkb.io

Social recommendations and mentions

Based on our record, Brave Search should be more popular than CommonCrawl. It has been mentiond 328 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

CommonCrawl mentions (91)

Ask HN: Who is hiring? (May 2024)
Common Crawl Foundation | REMOTE | Full and part-time | https://commoncrawl.org/ | web datasets I'm the CTO at the Common Crawl Foundation, which has a 17 year old, 8. - Source: Hacker News / 2 days ago
Ask HN: How does one implement web plagiarism?
Https://commoncrawl.org/ is a non-profit which offers a pre-crawled dataset. The specifics of individual tools probably vary. I imagine most tools would be based on academic datasets. - Source: Hacker News / 4 months ago
Things are about to get a lot worse for Generative AI
Should the NYT not sue https://commoncrawl.org/ ? OpenAI just used the data from commoncrawl for training. - Source: Hacker News / 4 months ago
Indexing a Billion Pages
What you’re likely referring to is Common Crawl: https://commoncrawl.org. - Source: Hacker News / 4 months ago
Interview with Viktor Lofgren from Marginalia Search
> ... a project called "Nutch" would allow web users to crawl the web themselves. Perhaps that promise is similar to the promises being made about "AI" today. The project did not turn out to be used in the way it was predicted (marketed), or even used by web users at all. Actually Nutch is used to produce the Common Crawl[0] and 60% of GPT-3's training data was Common Crawl[1], so in a way it is being used... - Source: Hacker News / 5 months ago

Brave Search mentions (328)

DuckDuckGo AI Chat
Pretty cool! I use Brave Search (https://search.brave.com) and it too got AI results a few months ago. They're quite helpful! - Source: Hacker News / 15 days ago
Are my searches tracked by google if I us google seach in Brave?
Best way to protect yourself from that is to use other search engines that do not track you (I really like Brave Search, but if you want Google results without tracking try Startpage). Source: 5 months ago
Google Pays $21B for Search Monopoly: How "Free" Tech Markets Repress
Instead of DuckDuckGo and Ecosia whose use Bing Search, they should share real alternative like https://kagi.com/ or https://search.brave.com. - Source: Hacker News / 6 months ago
Ask HN: Are there valid Google Search alternatives?
No need to pay for Kagi imo https://search.brave.com/. - Source: Hacker News / 6 months ago
Tell HN: DuckDuckGo Search Results have jumped the shark
It's been a while. https://search.brave.com/. - Source: Hacker News / 6 months ago

What are some alternatives?

When comparing CommonCrawl and Brave Search, you can also consider the following products

Scrapy - Scrapy | A Fast and Powerful Scraping and Web Crawling Framework

DuckDuckGo - The Internet privacy company that empowers you to seamlessly take control of your personal information online, without any tradeoffs.

StormCrawler - StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.

Google - Google Search, also referred to as Google Web Search or simply Google, is a web search engine developed by Google. It is the most used search engine on the World Wide Web

Apache Nutch - Apache Nutch is a highly extensible and scalable open source web crawler software project.

Searx - Open source metasearch engine

CommonCrawl vs Scrapy

CommonCrawl vs DuckDuckGo

CommonCrawl vs StormCrawler

CommonCrawl vs Google

CommonCrawl vs Apache Nutch

CommonCrawl vs Searx

Brave Search vs Scrapy

Brave Search vs DuckDuckGo

Brave Search vs StormCrawler

Brave Search vs Google