Finding Unique URL's using Hadoop Hive

Finding Unique URL's using Hadoop Hive

Hive Project -Learn to write a Hive program to find the first unique URL, given 'n' number of URL's.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Learn to write a Hive program

Project Description

You have a file that contains 200 billion URLs. How will you find the first unique URL using Hadoop Hive? 

Similar Projects

In this project, we will take a look at three different SQL-on-Hadoop engines - Hive, Phoenix, Impala and Presto.

In this big data project, we will discover songs for those artists that are associated with the different cultures across the globe.

In this NoSQL project, we will use two NoSQL databases(HBase and MongoDB) to store Yelp business attributes and learn how to retrieve this data for processing or query.

Curriculum For This Mini Project