100+ Solved Projects
Learn by working on real projects
Top Instructors
Learn from Industry Experts
Lifetime access
Learn at your own pace

Best Sellers in “Big Data & Hadoop”

Big Data Project Analyze a streaming log file by integrating Kafka and Kylin

Analyze a streaming log file by integrating Kafka and Kylin

In this project, we are going to analyze streaming logfile dataset by integrating Kafka and Kylin.
4.6
$15  $9
Big Data Project Data analysis and Collaboration using Apache Zeppelin, Pig and Hive

Data analysis and Collaboration using Apache Zeppelin, Pig and Hive

In this project, we will talk about Apache Zeppelin. We will write code, write notes, build charts and share all in one single data analytics environment using Hive, Spark and Pig.
4.6
$15  $9
Big Data Project Online Analytical processing and visualization of retail data with Apache Kylin

Online Analytical processing and visualization of retail data with Apache Kylin

In this project, we will be performing an OLAP cube design using the AdventureWorks dataset. The deliverable for this session will be to design a cube, build and implement it using Kylin, query the cube and even connect familiar tools (like Excel) with our new cube.
4.6
$15  $9
Big Data Project Analyze and Visualize Online Review Data using Spark,Elasticsearch,Sqoop and Kibana

Analyze and Visualize Online Review Data using Spark,Elasticsearch,Sqoop and Kibana

In this project, we will use the yelp review dataset to analyze businesses, reviews and ingest the final output of our data processing in Elasticsearch and use the visualization tool in the ELK stack to visualize various kinds of ad-hoc reports from the data.
4.6
$15  $9
Big Data Project Mine Github Social Profile by building a Spark GraphX Algorithm and a Network Crawler

Mine Github Social Profile by building a Spark GraphX Algorithm and a Network Crawler

In this project, we will look at how to mine and make sense of connections in a simple way by building a Spark GraphX Algorithm and a Network Crawler.
4.6
$15  $9
Big Data Project Build a data pipeline based on messaging using Spark and Hive

Build a data pipeline based on messaging using Spark and Hive

In this project, we will simulate a simple real-world batch data pipeline based on messaging using Spark and Hive.
4.6
$15  $9
Big Data Project Building Real-Time Data Pipelines with Kafka Connect

Building Real-Time Data Pipelines with Kafka Connect

In this big data project, we will see how data ingestion and loading is done with Kafka connect APIs while transformation will be done with Kafka Streaming API.
4.6
Best Seller $15  $9
Big Data Project Implementing Slow Changing Dimensions in a Data Warehouse using Hive and Spark

Implementing Slow Changing Dimensions in a Data Warehouse using Hive and Spark

In this hackerday, we will look at the various types of SCDs and how to implements SCDs in Hive and Spark.
4.7
$15  $9
Big Data Project Tough engineering choices with large datasets in Hive Part - 2

Tough engineering choices with large datasets in Hive Part - 2

It is continuation of the previous hackerday "Tough engineering choices with large datasets in Hive Part - 1", where we will work on processing big data sets using Hive.
4.9
$15  $9
Big Data Project Design a Hadoop Architecture

Design a Hadoop Architecture

Learn to design Hadoop Architecture and understand how to store data using data acquisition tools in Hadoop.
4.8
$9  $3
Big Data Project Using Apache Hive for Real-Time Queries and Analytics

Using Apache Hive for Real-Time Queries and Analytics

Learn to write a Hadoop Hive Program for real-time querying.
4.7
$9  $3
Big Data Project Tough engineering choices with large datasets in Hive Part - 1

Tough engineering choices with large datasets in Hive Part - 1

Towards mastery in Hive in processing big datasets.
4.6
$15  $9
Big Data Project Finding Unique URL's using Hadoop Hive

Finding Unique URL's using Hadoop Hive

Learn to write a Hive program to find the first unique URL, given 'n' number of URL's.
4.9
$9  $3
Big Data Project Spark SQL on Spark 2

Spark SQL on Spark 2

In this hackerday, we will explore the features of Spark SQL in practice.
4.6
$15  $9
Big Data Project Real time log processing using streaming architecture 2

Real time log processing using streaming architecture 2

In this hackerday, we will be performing a real time processing of log entries from applications, using Kafka for the streaming architecture in a microservice sense.
4.7
$15  $9
Big Data Project Real time log processing using streaming architecture

Real time log processing using streaming architecture

In this hackerday, we are going to bring processing to the speed layer of the lambda architecture which opens up capabilities to monitor application real time performance, measure real time comfort with applications and real time alert in case of security
4.6
$15  $9
Big Data Project SQL Analytics with Hive

SQL Analytics with Hive

In this project, we will look the features in Hive that allow us to perform analytical queries over large datasets.
4.8
$15  $9
Big Data Project General architecture for building IOT infrastructure

General architecture for building IOT infrastructure

In this project, our goal is to build an argument for generalized streaming architecture for reactive data ingestion based on a microservice architecture. 
4.9
$15  $9
Big Data Project Measuring impact of US Non-Farm Payroll Report on some FX Markets

Measuring impact of US Non-Farm Payroll Report on some FX Markets

In this project, we want to measure by how much NFP has triggered moves in past markets.
4.8
$15  $9
Big Data Project Data Engineering on Yelp Dataset - NoSQL Storage

Data Engineering on Yelp Dataset - NoSQL Storage

In this project, we will use two NoSQL databases(HBase and MongoDB) to store Yelp business attributes and also learn how to retrieve these data for processing or query.
4.9
$15  $9
Big Data Project Yelp data processing using Spark and Neo4j

Yelp data processing using Spark and Neo4j

In this project, we are going to do network analysis using a graph database so that we can find patterns in how a social network affects business reviews and ratings.
4.8
$15  $9
Big Data Project Yelp Data Processing using Spark and Hive Part 2

Yelp Data Processing using Spark and Hive Part 2

In this project, we going to continue building the data warehouse and will do further data processing to deliver different kinds of data products.
4.7
$15  $9
Big Data Project Yelp Data Processing Using Spark And Hive Part 1

Yelp Data Processing Using Spark And Hive Part 1

In this project, we will continue from a previous hackerday session "Data engineering on Yelp Datasets using Hadoop tools" and will focus on doing the entire data processing using spark.
4.9
$17  $11
Big Data Project Real-time data collection and Spark Streaming Aggregation

Real-time data collection and Spark Streaming Aggregation

In this big data project, we will embark on real-time data collection and aggregation from a simulated real-time system using Spark Streaming.
4.4
Best Seller $17  $11
Big Data Project Data processing with Spark SQL

Data processing with Spark SQL

In this project, we will go through provisioning data for retrieval using Spark SQL.
4.5
$15  $9
Big Data Project Analysing Big Data with Twitter Sentiments using Spark Streaming

Analysing Big Data with Twitter Sentiments using Spark Streaming

In this big data spark project, we will do Twitter sentiment analysis using spark streaming on the incoming streaming data.
4.8
$15  $9
Big Data Project Solving the Hadoop Small File Problem

Solving the Hadoop Small File Problem

In this project, we are going to be continuing the series on data engineering by discussing and implementing various ways to solve the Hadoop Big Data problem.
4.6
$15  $9
Big Data Project E-Commerce Data Warehouse

E-Commerce Data Warehouse

In this project, we are going to be designing a data warehouse for a retail shop.
4.8
Best Seller $17  $11
Big Data Project Real-time Auto Tracking with Spark-Redis

Real-time Auto Tracking with Spark-Redis

In our project, we will discuss real-time monitoring of taxis in a city. The real-time data streaming will be simulated using Flume. The ingestion will be done using Spark Streaming.
4.7
$15  $9
Big Data Project Airline Online Performance

Airline Online Performance

In this project, we are going to make big data available and accessible.
4.8
$15  $9
Big Data Project Data engineering on Yelp Datasets using Hadoop tools

Data engineering on Yelp Datasets using Hadoop tools

In this project, we will be applying some data engineering principles to the Yelp Dataset in the areas of processing, storage, and retrieval.
4.6
$15  $9
Big Data Project Job portal service

Job portal service

In this project, we are going to build the backend of a IT job ad website.
4.7
$15  $9
Big Data Project Processing web server log

Processing web server log

In this project, we will be using a sample application log file from an application server to demonstrated a scaled-down server log processing pipeline.
4.8
Best Seller $17  $11
Big Data Project Analysis of Community Interactions using Spark GraphX

Analysis of Community Interactions using Spark GraphX

In this project, we will be doing an analysis of the level and strength of interactions across areas of coverage of a telecom provider between different areas in the city of Milan.
4.7
$15  $9
Big Data Project Building a Data warehouse using Spark on Hive

Building a Data warehouse using Spark on Hive

In this project we will build a Hive data warehouse from a raw dataset stored in HDFS and present the data in a relational structure so that querying the data will be natural.
4.8
Best Seller $20  $14
Big Data Project Visualise Daily Wikipedia Trends using Hive, Zepellin Notebooks and Airflow

Visualise Daily Wikipedia Trends using Hive, Zepellin Notebooks and Airflow

In this project, we'll work with Apache Airflow and write scheduled workflow, which will download data from Wikipedia archives, upload to S3, process them in HIVE and finally analyze on Zeppelin Notebooks.
4.6
$15  $9
Big Data Project Analyse movie ratings data for better movie recommendation

Analyse movie ratings data for better movie recommendation

In this project, we will be working on Hive and HQL to analyze movie ratings using MovieLens data for better movie recommendation.
4.5
$15  $9
Big Data Project Making real time decision on incoming data using Flume and Kafka

Making real time decision on incoming data using Flume and Kafka

In this project, we will work on ingesting data from a source using Apache Flume and Kafka to make a real-time decision on incoming data.
4.7
$20  $14
Big Data Project Predict Song Preferences using PigLatin DataFu and LZO Codec

Predict Song Preferences using PigLatin DataFu and LZO Codec

In this challenge, we will discover songs for those artists that are associated with the culture of different countries.
4.6
$15  $9
Big Data Project Denormalize JSON data related to Field Service Management and analyse using Hive

Denormalize JSON data related to Field Service Management and analyse using Hive

In this hive project, you will work on denormalizing the JSON data and create HIVE scripts with ORC file format.
4.7
$15  $9
Big Data Project Visualizing Website Clickstream Data with Apache Hadoop

Visualizing Website Clickstream Data with Apache Hadoop

Analyze clickstream data of a website using Hadoop Hive to increase sales by optimizing every aspect of the customer experience on the website from the first mouse click to the last.
4.8
$15  $9
Big Data Project Data Mining Project on Yelp Dataset using Hadoop Hive

Data Mining Project on Yelp Dataset using Hadoop Hive

Use the Hadoop ecosystem to glean valuable insights from the Yelp dataset. You will be analyzing the different patterns that can be found in the Yelp data set, to come up with various approaches in solving a business problem.
4.7
$15  $9