Course details

Welcome to this course: Data Science - Sparklyr Basics for Beginners. Apache Spark has been increasingly adopted for the development of distributed applications. In the past year, transforming the world using data is typically achieved through disrupting and changing real processes in real industries. In order to operate at this level you need to build data science solutions of substance -solutions that solve real problems. Spark SQL APIs provide an optimized interface that helps developers build such applications quickly and easily. 

In this course, you'll learn:

  • Understand the differences between working with data frames in R and Spark
  • Learn to perform exploratory data analysis in Spark using sparklyr
  • Learn how to connect to Spark locally or to a remote Spark cluster
  • Learn how to build data products in R that don't rely on storing big data locally
  • Learn how to interact with data in Apache Spark through sparklyr and Spark SQL

At the end of this course, you will learn to use Spark as a big data operating system, understand how to implement advanced analytics on the new APIs, and explore how easy it is to use Spark in day-to-day tasks.

Updated on 18 February, 2018
Courses you can instantly connect with... Do an online course on Data Science starting now. See all courses

Rate this page