Price: AED 3,357

    Course details

    Overview

    This course is designed for developers who create applications and analyze Big Data in Apache Hadoop on Windows using Pig and Hive. Topics include: Hadoop, YARN, the Hadoop Distributed File System (HDFS), MapReduce, Sqoop and the HiveODBC ;

    Duration    
    4 days    
     
    Course Objectives


    • Describe Hadoop and Hadoop and YARN  
    • Describe the Hadoop ecosystem  
    • List Components & deployment options for HDP on Windows  
    • Describe the HDFS architecture
    • Use the Hadoop client to input data into HDFS  
    • Transfer data between Hadoop and Microsoft SQL Server
    • Describe the MapReduce and YARN architecture
    • Run a MapReduce job on YARN  
    • Write a Pig script  • Define advanced Pig relations  
    • Use Pig to apply structure to unstructured Big Data
    • Invoke a Pig User-Defined Function  
    • Use Pig to organize and analyze Big Data  
    • Describe how Hive tables are defined and implemented  
    • Use Hive windowing functions  
    • Define and use Hive file formats  
    • Create Hive tables that use the ORC file format  
    • Use Hive to run SQL-like queries to perform data analysis  
    • Use Hive to join datasets  
    • Create ngrams and context ngrams using Hive  
    • Perform data analytics  
    • Use HCatalog with Pig and Hive  
    • Install and configure HiveODBC Driver for Windows  
    • Import data from Hadoop into Microsoft Excel  
    • Define a workflow using Oozie     

      Updated on 27 June, 2018

    Eligibility / Requirements

    Students should be familiar with programming principles and have experience in software development. SQL knowledge and familiarity with Microsoft Windows is also helpful. No prior Hadoop knowledge is ;

    About Agilitics Pte. Ltd.

    Agilitics Pte. Ltd. is Singapore headquartered, Data and Business Analytics focussed company. We are the real experts of the big data domain. 

    Established in 2013, Head quartered at Singapore,

    Agilitics Pte Ltd is a leading Big Data Analytics and Agile Consulting and Training solutions provider

    Our Tagline is Agility + Analytics Delivered.

    We offer a comprehensive range of Big data ecosystem and Agile management solution, services and expertise for Information Management, Data Analytics, Machine Learning, Artificial Intelligence and Smart City Solutions

    See all Agilitics Pte. Ltd. courses
    Courses you can instantly connect with... Do an online course on Operating Systems starting now. See all courses

    Is this the right course for you?

    Rate this page

    Didn't find what you were looking for ?

    or