Each project comes with 2-5 hours of micro-videos explaining the solution.
Get access to 50+ solved projects with iPython notebooks and datasets.
Add project experience to your Linkedin/Github profiles.
I have worked for more than 15 years in Java and J2EE and have recently developed an interest in Big Data technologies and Machine learning due to a big need at my workspace. I was referred here by a... Read More
I have extensive experience in data management and data processing. Over the past few years I saw the data management technology transition into the Big Data ecosystem and I needed to follow suit. I... Read More
In Dezyre's Hadoop hands-on training course, we perform two different projects that require us to stream data from twitter in real time. Most of these hadoop projects are a production scenario which will then involve analyzing the project in a batch mode and representing to end users.
But what if the decision that needs the streamed data is time sensitive? This means that we must stream that data and analyze it in motion. After analysis, the result must be presented as the streaming is taking place.
An example of a use of such system is to analyze public response to any event in real time like a political speech, a sports game, an economic news and much more. People with the access to quality real-time data can then position themselves for profit in such circumstance.
Spark Project - Discuss real-time monitoring of taxis in a city. The real-time data streaming will be simulated using Flume. The ingestion will be done using Spark Streaming.
The goal of this IoT project is to build an argument for generalized streaming architecture for reactive data ingestion based on a microservice architecture.
The goal of this hadoop project is to apply some data engineering principles to Yelp Dataset in the areas of processing, storage, and retrieval.