Course details

This course mainly explains about what is parquet format,advantages of it and how to create hive table with parquet format in Cloudera.Eventhough we have so many formats,Parque is unique format and mostly used in different frameworks,languages along with Hadoop. It is now the biggest table stored in our Hadoop cluster, which currently takes 270TB of HDFS storage (810TB in raw storage after 3 replications), and serves as the primary source of data for most of the higher level aggregated tables.It is especially good for queries which read particular columns from a "wide" (with many columns) table, since only needed columns are read and IO is minimized.It is so useful for all students and Bigdata developers who want to learn about apache parquet.

Course is very useful for all developers.


Updated on 22 March, 2018
Courses you can instantly connect with... Do an online course on IT, Computing and Technology starting now. See all courses

Rate this page