Using Apache Hive for Real-Time Queries and Analytics

Using Apache Hive for Real-Time Queries and Analytics

Learn to write a Hadoop Hive Program for real-time querying.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

Arvind Sodhi

VP - Data Architect, CDO at Deutsche Bank

I have extensive experience in data management and data processing. Over the past few years I saw the data management technology transition into the Big Data ecosystem and I needed to follow suit. I... Read More

James Peebles

Data Analytics Leader, IQVIA

This is one of the best of investments you can make with regards to career progression and growth in technological knowledge. I was pointed in this direction by a mentor in the IT world who I highly... Read More

What will you learn

Learn to write real-time queries in Apache Hive.

Project Description

The log file contains entries like user A visited page 1, user B visited page 3, user C visited page 2, user D visited page no 4.

How will you implement a Hadoop job for this to answer the following queries in real-time:

  1. Which page was visited by user C more than 4 times in a day?
  2. Which page was visited by only one user exactly 3 times in a day?

Similar Projects

In this Spark project, we are going to bring processing to the speed layer of the lambda architecture which opens up capabilities to monitor application real time performance, measure real time comfort with applications and real time alert in case of security

In this hadoop project, learn about the features in Hive that allow us to perform analytical queries over large datasets.

The goal of this IoT project is to build an argument for generalized streaming architecture for reactive data ingestion based on a microservice architecture. 

Curriculum For This Mini Project