All Course

featured project

Apache Spark, Scala and Kafka

Spark is an open source processing engine built, in spark we have ecosystems like Spark SQL, Streaming, Mlib, Graphx, for processing the data we use Scala as a programming language and Apache kafka is most advanced feature of big data used for streaming data integrated with java API’s. ...

view details
featured project

Apache Kafka

Understand Kafka and its components. Kafka cluster deployment on Hadoop and YARN Understanding real time Kafka streaming. ...

view details
featured project

Spark

Apache Spark is a lightning fast cluster computing designed for fast computation. Spark executes in memory data processing and runs much faster than Hadoop Map Reduce. Learners will get trained in-depth spark concepts with Scala programming and its components such as Spark Streaming, Spark SQL, Spark RDD, Spark MLlib and Spark Graphx. ...

view details
featured project

HADOOP

Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. This brief tutorial provides a quick introduction to Big Data, Map Reduce algorithm, and Hadoop Distributed File System. ...

view details