Hadoop Project- Perform basic big data analysis on airline dataset using big data tools -Pig, Hive and Impala.

In this hive project, you will design a data warehouse for e-commerce environments.

In this hadoop project, you will be using a sample application log file from an application server to a demonstrated scaled-down server log processing pipeline.

This is in continuation of the previous Hive project "Tough engineering choices with large datasets in Hive Part - 1", where we will work on processing big data sets using Hive.

Explore hive usage efficiently in this hadoop hive project using various file formats such as JSON, CSV, ORC, AVRO and compare their relative performances

In this Apache Spark SQL project, we will go through provisioning data for retrieval using Spark SQL.

