Learn to write a Hive program to find the first unique URL, given 'n' number of URL's.
Users who bought this project also bought
What will you learn
Learn to write a Hive program
What will you get
Access to recording of the complete project
Access to all material related to project like data files, solution files etc.
It is expected that students have a fair knowledge of Hadoop and Hive.
You have a file that contains 200 billion URLs. How will you find the first unique URL using Hadoop MapReduce?
Senior Hadoop Engineer at Sirius Computer Solutions
Abhishek has a corporate experience for 5 years in the fields of Hadoop R&D, Big Data technologies, Hadoop administration, IBM Netezza Database Administration, Data Warehousing, Data Mining (Netezza, Oracle PL/SQL and Microsoft SQL Server), Development, ETL and Advanced analytics. He has a vast exposures on various pro see more...gramming and query languages such as Linux, python, PLSQL, HQL, PIG, Java, VBA, R, SAS and Awk. He has also conceptualized and developed standardized procedures, SQL and VBA codes to streamline various processes. During his one year Big Data (Data Scientist) program at TCS, he also holds an exposure to LP modelling, Operational Research, Supply Chain Management, Machine Learning, Natural programming Language and many more which can add value to any business.