Course details

SparkR is an R package that provides a light-weight frontend to use Apache Spark from R. In Spark 2.0.2, SparkR provides a distributed data frame implementation that supports operations like selection, filtering, aggregation etc. (similar to R data frames) but on large datasets. SparkR also supports distributed machine learning using MLlib.

You will learn how to create spark cluster in Databricks.

You will learn how to create dataframes and grouping data and aggregating data.

+ Read More

Courses you can instantly connect with... Do an online course on Big Data starting now. See all courses