Course details

Overview

This course provides a hands-on learning experience for data scientists and Hadoop developers, covering Spark Core, Data Frames, ML-Lib and Spark-ML, with a focus on pre-processing machine learning ;
 
Duration
24 hours (online)
4 days (ILT) 
 
Course Topics

• Introduction to Apache Spark
• The Spark RDD API: Transformations & Actions
Class Datasets: Crime Reports & Weather
• Performance Tuning for Spark
• Spark SQL and DataFrames API
• Working with Spark DataFrames - Basic
• Working with Spark DataFrames - Joins
• Working with Spark DataFrames - Advanced
• Machine Learning with Spark - Mllib & Spark-ML API
• Spark-ML: A pipeline abstraction
• Building Predictive Models with Spark-ML
• Tree-based Predictive Models with Spark-ML
• Optimizing Supervised Learning Models with Spark-ML
• Clustering with Spark-ML
• Recommender Systems with Spark-ML
• The future of Spark 
 
  Updated on 16 May, 2018

Eligibility / Requirements

Participants must have a basic understanding of Hadoop and a good working knowledge of Python.  Participants must understand basic machine learning concepts and algorithms and have a solid understanding of SQL or ;

About Agilitics Pte. Ltd.

Agilitics Pte. Ltd. is Singapore headquartered, Data and Business Analytics focussed company. We are the real experts of the big data domain. 

Established in 2013, Head quartered at Singapore,

Agilitics Pte Ltd is a leading Big Data Analytics and Agile Consulting and Training solutions provider

Our Tagline is Agility + Analytics Delivered.

We offer a comprehensive range of Big data ecosystem and Agile management solution, services and expertise for Information Management, Data Analytics, Machine Learning, Artificial Intelligence and Smart City Solutions

See all Agilitics Pte. Ltd. courses
Courses you can instantly connect with... Do an online course on Data Science starting now. See all courses

Is this the right course for you?

Didn't find what you were looking for ?

or