Each project comes with 2-5 hours of micro-videos explaining the solution.
Get access to 50+ solved projects with iPython notebooks and datasets.
Add project experience to your Linkedin/Github profiles.
In this PySpark project, you will simulate a complex real-world data pipeline based on messaging. This project is deployed using the following tech stack - NiFi, PySpark, Hive, HDFS, Kafka, Airflow, Tableau and AWS QuickSight.
In this hadoop project, you will be using a sample application log file from an application server to a demonstrated scaled-down server log processing pipeline.
In this spark streaming project, we are going to build the backend of a IT job ad website by streaming data from twitter for analysis in spark.
Learn to install and setup Oozie.
You will learn to configure workflows to run jobs in hadoop with these Hadoop Oozie projects.
The key to mastering Oozie is knowing the right configuration parameters to get the job done and these big data projects on Oozie will let you master the super-efficient admin assistant.
These hadoop projects for practice on Oozie will help you make workflow development, maintenance and troubleshooting easier when building big data applications.