In Eclipse you have to add the all the jars that are from hadoop.mapred.* packages. Please right click on the project -> properties ->Java Build path -> Libraries Tab -> Add external jars and add the following
/usr/lib/hadoop - add hadoop-annotations.jar,hadoop-auth.jar,hadoop-common.jar
/usr/lib/hadoop/client - Add all jars
/usr/lib/hadoop/lib - Add all jars.
Once these jars are added, Eclipse will recognize the MapReduce programs
Text data requires special preparation before you can start using it for any machine learning project.In this ML project, you will learn about applying Machine Learning models to create classifiers and learn how to make sense of textual data.
In this spark project, we will continue building the data warehouse from the previous project Yelp Data Processing Using Spark And Hive Part 1 and will do further data processing to develop diverse data products.
The goal of this Spark project is to analyze business reviews from Yelp dataset and ingest the final output of data processing in Elastic Search.Also, use the visualisation tool in the ELK stack to visualize various kinds of ad-hoc reports from the data.