Pandas - Pandas is an open source library providing high-performance, easy-to-use data structures and data analysis tools for the Python.
Apache Kudu - Apache Kudu is Hadoop's storage layer to enable fast analytics on fast data.
Jupyter - Project Jupyter exists to develop open-source software, open-standards, and services for interactive computing across dozens of programming languages. Ready to get started? Try it in your browser Install the Notebook.
Amazon EMR - Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
PyCharm - Python & Django IDE with intelligent code completion, on-the-fly error checking, quick-fixes, and much more...
BlueData - BlueData's software platform makes it easier, faster and more cost-effective for organizations to deploy Big Data infrastructure on-premises.