Extract Tables by Docsumo might be a bit more popular than Engauge Digitizer. We know about 2 links to it since March 2021 and only 2 links to Engauge Digitizer. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
I wonder how many implementations there were of that sort of thing. I wrote one for an HP flatbed plotter sometime in the 80s to digitize plots from papers, but I never heard of another. I don't remember the implementation, but it must have been cheaper hardware than the one shown, and only had a plastic lens as the sight. (Nowadays you have http://markummitchell.github.io/engauge-digitizer for instance.) I... - Source: Hacker News / almost 3 years ago
Nice, I've used Engauge Digitizer in the past http://markummitchell.github.io/engauge-digitizer/. - Source: Hacker News / about 3 years ago
This is neat. Over Docsumo [0] we have a combination of ML plus NLP+Computer Vision algorithms to detect the tables: [0] - https://docsumo.com/free-tools/extract-tables-from-pdf-images. - Source: Hacker News / almost 3 years ago
Our older pipelines use image-processing-based approaches. However, they had too much assumptions in them (for instance, header texts, column types, etc). Now, we've moved onto to ML-based approach to train generic models that can be applied to variety of documents for table structure recognition. [0] - https://docsumo.com/free-tools/extract-tables-from-pdf-images. - Source: Hacker News / about 3 years ago
WebPlotDigitizer - WebPlotDigitizer - Web based tool to extract numerical data from plots, images and maps.
VancePDF - As an AI OCR-driven PDF solution provider, VancePDF offers high-quality PDF processing services online.
Plot Digitizer - All-in-One Tool to Extract Data from Graphs, Plots & Images
DocParser - Extract data from PDF files & automate your workflow with our reliable document parsing software. Convert PDF files to Excel, JSON or update apps with webhooks.
DataThief III - DataThief III is a program to extract (reverse engineer) data points from a graph.
Docsumo - Extract Data from Unstructured Documents - Easily. Efficiently. Accurately.