The goal of this Spark project is to analyze business reviews from Yelp dataset and ingest the final output of data processing in Elastic Search.Also, use the visualisation tool in the ELK stack to visualize various kinds of ad-hoc reports from the data.
- Ingesting data from relational database using Sqoop
Ingesting data from relational database directly into Spark
Processing relational data in Spark
Ingesting processed data into Elasticsearch
Visualizing review analytics using Kibana