In this hadoop project, we are going to be continuing the series on data engineering by discussing and implementing various ways to solve the hadoop small file problem.
In this big data project, we will embark on real-time data collection and aggregation from a simulated real-time system using Spark Streaming.
In this hive project, you will design a data warehouse for e-commerce environments.
In this big data spark project, we will do Twitter sentiment analysis using spark streaming on the incoming streaming data.
Hive Project -Learn to write a Hive program to find the first unique URL, given 'n' number of URL's.
In this hadoop project, you will be using a sample application log file from an application server to a demonstrated scaled-down server log processing pipeline.
In this hadoop project, learn about the features in Hive that allow us to perform analytical queries over large datasets.
In this Spark project, we are going to bring processing to the speed layer of the lambda architecture which opens up capabilities to monitor application real time performance, measure real time comfort with applications and real time alert in case of security
The goal of this spark project for students is to explore the features of Spark SQL in practice on the latest version of Spark i.e. Spark 2.0.
The goal of this hadoop project is to apply some data engineering principles to Yelp Dataset in the areas of processing, storage, and retrieval.