Using Apache Hive for Real-Time Queries and Analytics

Using Apache Hive for Real-Time Queries and Analytics

Learn to write a Hadoop Hive Program for real-time querying.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Learn to write real-time queries in Apache Hive.

Project Description

The log file contains entries like user A visited page 1, user B visited page 3, user C visited page 2, user D visited page no 4.

How will you implement a Hadoop job for this to answer the following queries in real-time:

  1. Which page was visited by user C more than 4 times in a day?
  2. Which page was visited by only one user exactly 3 times in a day?

Similar Projects

In this hadoop hive project, you will work on Hive and HQL to analyze movie ratings using MovieLens dataset for better movie recommendation.

Hive Project -Learn to write a Hive program to find the first unique URL, given 'n' number of URL's.

In this project, we will show how to build an ETL pipeline on streaming datasets using Kafka.

Curriculum For This Mini Project

24-Nov-2017
13m