Using Apache Hive for Real-Time Queries and Analytics

Using Apache Hive for Real-Time Queries and Analytics

Learn to write a Hadoop Hive Program for real-time querying.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Learn to write real-time queries in Apache Hive.

Project Description

The log file contains entries like user A visited page 1, user B visited page 3, user C visited page 2, user D visited page no 4.

How will you implement a Hadoop job for this to answer the following queries in real-time:

  1. Which page was visited by user C more than 4 times in a day?
  2. Which page was visited by only one user exactly 3 times in a day?

Similar Projects

In this big data project, we will be performing an OLAP cube design using AdventureWorks database. The deliverable for this session will be to design a cube, build and implement it using Kylin, query the cube and even connect familiar tools (like Excel) with our new cube.

In this big data project, we will look at how to mine and make sense of connections in a simple way by building a Spark GraphX Algorithm and a Network Crawler.

In this project, we will take a look at three different SQL-on-Hadoop engines - Hive, Phoenix, Impala and Presto.

Curriculum For This Mini Project

24-Nov-2017
13m