Scikit-learn - scikit-learn (formerly scikits.learn) is an open source machine learning library for the Python programming language.
DocParser - Extract data from PDF files & automate your workflow with our reliable document parsing software. Convert PDF files to Excel, JSON or update apps with webhooks.
Pandas - Pandas is an open source library providing high-performance, easy-to-use data structures and data analysis tools for the Python.
Amazon Textract - Easily extract text and data from virtually any document using Amazon Textract. Textract goes beyond simple optical character recognition (OCR) to also identify the contents of fields in forms and information stored in tables.
NumPy - NumPy is the fundamental package for scientific computing with Python
ABBYY FineReader - ABBYY's latest PDF editor software, FineReader 16 you can easily convert files like PDF to Excel, PDF to Word, edit, share, collaborate & more with this PDF editor!