If you want image to text I would recommend https://github.com/tesseract-ocr/tesseract. - Source: Reddit / 23 days ago
> (…) better than Tesseract Isn’t Tesseract also neural network-based? https://github.com/tesseract-ocr/tesseract. - Source: Hacker News / 2 months ago
The ocr filter in ffmpeg is powered by the Tesseract library. As you will often find in ffmpeg, the build within ffmpeg has only a subset of the functionality of the original library - at least, for the moment. There's always the possibility of APIs being expanded in later ffmpeg releases. And it is open source of course, so there's the option of instigating those changes yourself - or using the original library... - Source: dev.to / 2 months ago
After that you would use Tesseract-OCR to OCR the pages. Tesseract is a open source multiplatform OCR software. If the typeface is something non standard you would have to train the recognition engine on your data. - Source: Reddit / 3 months ago
Alternatively, look into Tesseract. Allows you to do offline/local OCR; it might be a better option if you're on a tight budget with a huge image dataset. You could also look into training Tesseract with your own annotated text images for better results if you find the base model doesn't suit your needs. - Source: Reddit / 3 months ago
That sounds like it will almost certainly require custom scripting because your use case is unique. You can probably break down the problem into multiple steps which are easier to address. There is some decent pdf software out there that can handle OCR (optical character recognition) though barcodes specifically are a bit harder to get opensource solutions for - the main one being tesseract... - Source: Reddit / 3 months ago
Rescribe is front-end for Google's Tesseract OCR engine. You can run rescribe against a folder/directory of image files (e.g. pngs). - Source: Reddit / 3 months ago
Tesseract (And it has many GUI-based applications as well.). - Source: Reddit / 4 months ago
I have done this in the past - take a screenshot of the webpage and OCR that using https://github.com/tesseract-ocr/tesseract (you can do all that in Python) , but that's not using Selenium at all. - Source: Reddit / 5 months ago
If you want to code it yourself, that could be a fun project! You could for example look at tools like pdftotext if your PDF is machine generated or OCR tools like tesseract if PDF are scans. - Source: Reddit / 5 months ago
> Would love to find a cheaper (local) option vs AWS How about tesseract (https://github.com/tesseract-ocr/tesseract). - Source: Hacker News / 6 months ago
If you are grabbing from your webbrowser then the solution is scraping the source code rather than screen shots. If you are pulling from something like a game or a program that just presents a GUI with prerendered text then OCR is what you are looking for. OCR stands for Optical Charachter Recognition and the most popular offline library to use is called tessaract. - Source: Reddit / 7 months ago
Afterwards you can try an existing optical character recognition pipeline e.g. Tesseract https://github.com/tesseract-ocr/tesseract to try to parse the text. - Source: Reddit / 7 months ago
Oh, that might work, I'll take a look thank you. I was playing around with some ideas[Googling stuff] last night and came across https://github.com/tesseract-ocr/tesseract which looks like it might be promising if I can keep the image fairly well controlled. - Source: Reddit / 7 months ago
Under the hood it uses OCRmyPDF which in its turn uses tesseract which means almost any language can be detected (they advertise it with - "supports more then 100 languages"). Yes, it supports Greek language as well. - Source: Reddit / 7 months ago
Take the slides and use OCR software to extract the text. Then there are various Web sites where a block of text can be pasted in and standard reading scores (e.g. Flesch-Kincaid) calculated. - Source: Reddit / 7 months ago
For OCR I used Tesseract https://github.com/tesseract-ocr/tesseract which is available via toltec on device. If you want to look at more modern approaches https://huggingface.co/microsoft/trocr-base-handwritten might be interesting. - Source: Reddit / 8 months ago
Whatever you draw in rnote could be sent to OCR api's: OCR Https://github.com/tesseract-ocr/tesseract TESSERACT OCR APIs in rust Https://houqp.github.io/leptess/leptess/index.html. - Source: Reddit / 8 months ago
In order to implement text recognition, I needed to use another library, Tesseract. It is an optical character recognition engine made by Google. There is also a .NET wrapper for this library. I had to use a few tricks to properly match arbitrary text with a library that is best suited for digitized books, and cleverly use page segmentation methods. You will find how I used Tesseract in this C# class. - Source: dev.to / 9 months ago
For linux (or GNOME more specifically) there is Frog[1]. It uses Tesseract OCR[2] under the hood. [1]: https://flathub.org/apps/details/com.github.tenderowl.frog [2]: https://github.com/tesseract-ocr/tesseract. - Source: Hacker News / 9 months ago
Tesseract has Rust bindings and has been around for a while. It's not clear from your description what you're trying to accomplish, but I think for OCR, tesseract is an easy OSS way to start. - Source: Reddit / 10 months ago
Do you know an article comparing Tesseract to other products?
Suggest a link to a post with product alternatives.