Apache Flink is another popular open source distributed data streaming engine that performs stateful computations over bounded and unbounded data streams. This framework is written in Scala and Java and is ideal for complex data stream computations. - Source: dev.to / 28 days ago
This post continues a series on data-driven development best practices. These are specifically made for software systems with highly complicated integration points, but are applicable to many different situations. This series uses stream processing with Apache Flink for the examples. - Source: dev.to / 10 days ago
In software development we often unit test our code (hopefully). And code written for Spark is no different. So here I want to run through an example of building a small library using PySpark and unit testing it. I'm using Visual Studio Code as my editor here, mostly because I think it's brilliant, but other editors are available. - Source: dev.to / 21 days ago
Spring Framework - The Spring portfolio has many projects, including Spring Framework, Spring IO Platform, Spring Cloud, Spring Boot, Spring Data, Spring Security...
Hadoop - Open-source software for reliable, scalable, distributed computing
Spark - Spark helps you take your inbox under control. Instantly see what’s important and quickly clean up the rest. Spark for Teams allows you to create, discuss, and share email with your colleagues
Hive - Seamless project management and collaboration for your team.
Grails - An Open Source, full stack, web application framework for the JVM
Hortonworks - Hadoop-Related