Scikit-learn - scikit-learn (formerly scikits.learn) is an open source machine learning library for the Python programming language.
Amazon Textract - Easily extract text and data from virtually any document using Amazon Textract. Textract goes beyond simple optical character recognition (OCR) to also identify the contents of fields in forms and information stored in tables.
Pandas - Pandas is an open source library providing high-performance, easy-to-use data structures and data analysis tools for the Python.
DocParser - Extract data from PDF files & automate your workflow with our reliable document parsing software. Convert PDF files to Excel, JSON or update apps with webhooks.
NumPy - NumPy is the fundamental package for scientific computing with Python
FlexiCapture - ABBYY FlexiCapture brings together the best NLP, machine learning, and advanced recognition capabilities into a single, enterprise-scale platform to handle every type of document. Available in the Cloud, on premise or as SDK.