Course details
Apache Hadoop is an open-source software framework for distributed storage and distributed processing of large data on computer clusters built from commodity hardware.
In this course we'll discuss about several important aspects of Hadoop like HDFS(Hadoop Distributed File System), MapReduce, Hive, HBase and Pig.
First we'll talk about Overview of Big data means what is Big Data, Facts of Big Data, Scenarios, Hadoop cluster architecture. Then we'll move towards HDFS, Components of HDFS and its architecture, NameNode, Secondary NameNode and DataNode.
Next module is about MapReduce. In this we'll talk about Map Phase and Reduce Phase, Architecture of MapReduce, Combiners and Reducers.
Next module is about PIG. In this we'll see what is Apache Pig, its importance, Pig Latin language, and where to avoid Pig.
Them we'll talk about HBase, we'll talk about its use cases, general commands in HBase, DDL in HBase, DML in HBase, How to create, delete and integrate table in HBase and lot more.
So start learning Hadoop today.
Updated on 25 August, 2016- JavaScript Full stack web developer virtual internship Virtual Bootcamp + Internship at LaimoonAED 1,449Duration: Upto 30 Hours
- Windows Operating System Fundamentals Global EdulinkUSD 99
USD 707Duration: Upto 22 Hours - Installation, Storage, and Compute with Windows Alpha AcademyUSD 25
USD 280Duration: Upto 14 Hours - USD 2,967Duration: 12 Weeks Live virtual classroom