Course details
This course is essential to all software engineers, programmers, Data analysts, database administrators and anyone looking to become great at big data.
- You will learn how to use the most popular software in the Big Data industry at moment, using batch processing as well as realtime processing.
- This course will give you enough background to be able to talk about real problems and solutions with experts in the industry.
- You will gain an experience with Ingesting data into Hadoop file system, working with data in batch and Stream processing, Deliver visualization and insights of Data and too many technical skills.
Fundamentals of Big Data
- Intro to Distributed Systems & HDFS
- Exploring Big Data Ecosystem & Distributions
- Basic Intro to NoSQL
- Intro to Hadoop
- Introduction to MapReduce
- Intro to YARN
- Basic Hadoop Cluster Implementation
- Intro to SQL-On-Hadoop
- Intro to Hive
- Basic Implementation of Hive & Hive Server 2
- Ingesting Data into Hadoop
- Intro to Sqoop
- Concept of Stream Processing
- Into to Kafka
Advanced Level: 5 Days / 40 Hours
MapReduce In Depth
- Hadoop Architecture In Depth
- Intro to Apache Zookeeper
- Advanced Cluster Implementation
- Advanced Hive Architecture
- Ingesting Data Into Hive Using Sqoop
- HiveQL
- Intro to Apache HBase
- Intro to ETL Concepts
- Intro to Data Flow using Apache Nifi
- Implementing Apache Nifi Cluster
- Simple Integration Project with Apache Nifi
- Ingesting Data into Hadoop
- Implementing Kafka Cluster
- Building Simple Kafka Producer & Consumer Using Python
- Spark Implementation
- Simple Data Analysis with Apache Spark
Big Data Workshop: 5 Days / 40 Hours
Project 1: Data Ingestion from RDBMS into HIVE (ORC File)
- Using Sqoop to Connect to RDBMS(Oracle / SQL Server)
- Creating Hive Table on ORC File
- Hive Table Performance Tuning
- Using Kafka to Connect to RDBMS(Oracle / SQL Server)
- Kafka Advanced Cluster Configuration
- Ingesting Data into Hadoop/Hive
- Spark Cluster Implementation
- Using PySpark
- Processing Data With Spark
- Stream Processing With Spark
- Integrating Spark with Hive
- Ingesting Data with Spark into Hive
- Using Spark SQL
- Implementing Apache Zeppelin
- Integrating Apache Zeppeling With Spark
- Visualize Data With Zeppeling
Eligibility / Requirements
Attending Course :Big Data Systems ( Level 1 )
To gain the most from the workshop, the following is required:
Knowledge of Programming, Basic of “Java, Scala, Or Python”
Knowledge of Relational Database Management Systems.
Basic knowledge of operating systems & Network.
Course Location
About CLS Learn
Since 1995, CLS Learning solutions is leading the technology learning market in Egypt, the Middle East, and Africa. With our wide network of international partners, trainers, instructors, and technology leaders; we are able to deliver top notch training programs to our students and technology professionals.
25 Years in the market.
We delivered over 4,200 courses to 63,500 professionals in our centers.
We delivered 1,200 courses to 18,240 corporate employees on Site.
See all CLS Learn courses- JavaScript Full stack web developer virtual internship Virtual Bootcamp + Internship at LaimoonAED 1,449Duration: Upto 30 Hours
- EGP 1,023
EGP 4,629Duration: 28 Hours - Big Data Hadoop: SQL & NoSQL Skill-UpEGP 663Duration: Upto 23 Hours