MongoDB might be a bit more popular than Apache Tika. We know about 18 links to it since March 2021 and only 17 links to Apache Tika. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
In this article, we’ll build a CLI tool using the Rig AI framework and MongoDB for retrieval-augmented generation (RAG). This tool will store summarized conversations in a database and retrieve them when needed, enabling the AI to maintain context over time. - Source: dev.to / 2 months ago
Have a Mongo database holding the various phrases we're going to use and potentially configuration data for the frontend as well. - Source: dev.to / 9 months ago
It's also worth mentioning that Perseid provides out-of-the-box support for React, VueJS, Svelte, MongoDB, MySQL, PostgreSQL, Express and Fastify. - Source: dev.to / 8 months ago
Does anyone know if the most basic Elastic Cluster instance of DocumentDB carries any monthly fixed cost or is it just on-demand cost? Another words if I run like 10,000 queries against the DB per month, what kind of bill would I expect? This is for a super small app. I am currently using mongodb free tier , but want to migrate everything to AWS. Can't seem to find a straight answer to the pricing question. Source: over 2 years ago
You can use either MongoDB.com's dashboard (if you host a remote database) or Mongo Compass to run queries on the data or you can modify the express middleware with your own queries. I'm still working on the API, so it's not very robust yet. I will update this when it is. Source: over 2 years ago
Strongly recommend using Apache Tika[1] for this. It's industry standard for ubiquitous document text extraction. You can take the text output from Tika, chunk it with something like Chonkie[2], and embed it for your search index. -[1]https://tika.apache.org/ -[2]https://chonkie.ai/. - Source: Hacker News / about 1 month ago
Apache Tika could help extract the relevant bits of PDFs, couldnt it? https://tika.apache.org/. - Source: Hacker News / 12 months ago
Apache Tika has worked well for me in the past, ended up running it on an AWS Lambda https://tika.apache.org/. - Source: Hacker News / almost 2 years ago
If you accept running Java, the Apache Tika is extremely good at parsing content (https://tika.apache.org/). - Source: Hacker News / almost 2 years ago
Apache Tika can spit out text from lots of formats. I've used it with grep (or rg) to make a small scale searching of local folders. Tika does a really good job at OCR for finding if text is in a file. Source: about 2 years ago
Redis - Redis is an open source in-memory data structure project implementing a distributed, in-memory key-value database with optional durability.
Apache Archiva - Apache Archiva is an extensible repository management software.
PostgreSQL - PostgreSQL is a powerful, open source object-relational database system.
highlight.js - Highlight.js is a syntax highlighter written in JavaScript. It works in the browser as well as on the server.
MySQL - The world's most popular open source database
Asklayer - Get real answers from your customers with Asklayers surveys, quizzes, polls and more. Works on any website with zero code and includes enterprise level features such auto-segmentation, user tagging, branching, NPS & CSAT calculation.