Finding Unique URL's using Hadoop Hive

Finding Unique URL's using Hadoop Hive

Hive Project -Learn to write a Hive program to find the first unique URL, given 'n' number of URL's.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Learn to write a Hive program

Project Description

You have a file that contains 200 billion URLs. How will you find the first unique URL using Hadoop Hive? 

Similar Projects

Hive Project- Understand the various types of SCDs and implement these slowly changing dimesnsion in Hadoop Hive and Spark.

In this project, we will take a look at three different SQL-on-Hadoop engines - Hive, Phoenix, Impala and Presto.

The goal of this hadoop project is to apply some data engineering principles to Yelp Dataset in the areas of processing, storage, and retrieval.

Curriculum For This Mini Project