Description: Distributions provide performance and functionality enhancements over the base open source code Apache provides. In this course, you'll learn about the various distributions available and common maintenance tasks in a Hadoop environment.

Target Audience: Individuals who wish to understand key concepts and features of Hadoop and its tools

Duration: 00:43

Description: Apache Ranger is used to provide data security across a Hadoop implementation. In this course, you'll learn about installing Ranger and Ranger authentication considerations, as well as customizing services to run Ranger alongside Hadoop.

Target Audience: Individuals who wish to understand key concepts and features of Hadoop and its tools

Duration: 00:56

Description: Hadoop can be used with Amazon EMR to process vast amounts of data. In this course, you'll get an introduction to using Hadoop with Amazon EMR.

Target Audience: Individuals who wish to understand key concepts and features of Hadoop and its tools

Duration: 00:54

Description: Clusters are used to store and analyze large volumes of data in a distributed computer environment. This course outlines the best practices to follow when implementing clusters in Hadoop.

Target Audience: Individuals who wish to understand key concepts and features of Hadoop and its tools

Duration: 00:53

Description: This course covers the HDFS architecture and its main building blocks. In addition, subjects such as data replication, communication protocols, and accessibility are introduced.

Target Audience: Individuals who wish to understand key concepts and features of Hadoop and its tools

Duration: 00:38