Each project comes with 2-5 hours of micro-videos explaining the solution.
Get access to 50+ solved projects with iPython notebooks and datasets.
Add project experience to your Linkedin/Github profiles.
In this Hackerday, we will show by demonstrating how to build an ETL pipeline on streaming datasets using Kafka. We will be using the trips and fares dataset from the New York Taxi and Limousine Commission to demonstrate how to get data in real-time, join it to other streaming datasets, and store the data in a database.
Will be answering questions like reporting the income of drivers every hour, find the drivers around a certain location at any point in time amongst other things.
In this hadoop project, we are going to be continuing the series on data engineering by discussing and implementing various ways to solve the hadoop small file problem.
In this Spark project, we are going to bring processing to the speed layer of the lambda architecture which opens up capabilities to monitor application real time performance, measure real time comfort with applications and real time alert in case of security
In this big data project, we will talk about Apache Zeppelin. We will write code, write notes, build charts and share all in one single data analytics environment using Hive, Spark and Pig.