In software development we often unit test our code (hopefully). And code written for Spark is no different. So here I want to run through an example of building a small library using PySpark and unit testing it. I'm using Visual Studio Code as my editor here, mostly because I think it's brilliant, but other editors are available. - Source: dev.to / 17 days ago
Amazon Redshift - Learn about Amazon Redshift cloud data warehouse.
Hadoop - Open-source software for reliable, scalable, distributed computing
Apache Kylin - OLAP Engine for Big Data
Hive - Seamless project management and collaboration for your team.
Apache Druid - Fast column-oriented distributed data store
Hortonworks - Hadoop-Related