Based on our record, DataThief III should be more popular than Extract Tables by Docsumo. It has been mentiond 3 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
This is neat. Over Docsumo [0] we have a combination of ML plus NLP+Computer Vision algorithms to detect the tables: [0] - https://docsumo.com/free-tools/extract-tables-from-pdf-images. - Source: Hacker News / almost 3 years ago
Our older pipelines use image-processing-based approaches. However, they had too much assumptions in them (for instance, header texts, column types, etc). Now, we've moved onto to ML-based approach to train generic models that can be applied to variety of documents for table structure recognition. [0] - https://docsumo.com/free-tools/extract-tables-from-pdf-images. - Source: Hacker News / about 3 years ago
I don’t know if this is still up-to-date, but a decade ago I used to use this tool in cases where I couldn’t access numbers behind published graphs: https://datathief.org/. - Source: Hacker News / over 1 year ago
Use https://datathief.org/ to convert this chart to a data table if you want to try it. Source: over 1 year ago
Use something like DataThief (https://datathief.org/) to get the waveform as data points. This depends on having a pretty clear picture of the waveform. Source: over 2 years ago
VancePDF - As an AI OCR-driven PDF solution provider, VancePDF offers high-quality PDF processing services online.
WebPlotDigitizer - WebPlotDigitizer - Web based tool to extract numerical data from plots, images and maps.
DocParser - Extract data from PDF files & automate your workflow with our reliable document parsing software. Convert PDF files to Excel, JSON or update apps with webhooks.
Engauge Digitizer - This open source, digitizing software converts an image file showing a graph or map, into numbers.
Docsumo - Extract Data from Unstructured Documents - Easily. Efficiently. Accurately.
g3data - g3data is used for extracting data from graphs.