1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com

Spark Project-Analysis and Visualization on Yelp Dataset

The goal of this Spark project is to analyze business reviews from Yelp dataset and ingest the final output of data processing in Elastic Search.Also, use the visualisation tool in the ELK stack to visualize various kinds of ad-hoc reports from the data.

Users who bought this project also bought

What will you learn

  • Ingesting data from relational database using Sqoop
  • Ingesting data from relational database directly into Spark
  • Processing relational data in Spark
  • Ingesting processed data into Elasticsearch
  • Visualizing review analytics using Kibana

What will you get

  • Access to recording of the complete project
  • Access to all material related to project like data files, solution files etc.


  • A fair knowledge of Big Data on Hadoop, Spark.
  • No knowledge of Elasticsearch is required
  • An installation of Hadoop environment or sandbox is required. We recommend Cloudera or Hortonworks Sandbox.

Project Description

Most businesses seek to get reviews on their goods and services one way or another. It is a most basic way for the business to improve their efficiency and subsequently their bottom-line. Get the review is not only the issue, ability to extract and visualize analytics from review data is critical to business success.

In Apache Spark Project, we will use the yelp review dataset to analyze businesses and reviews over a period of time. Perhaps we will spot potential gaps in service delivery or see how business thrive in different scenarios.

Beyond processing this data, we will ingest the final output of our data processing in Elasticsearch and use the visualization tool in the ELK stack to visualize various kinds of ad-hoc reports from the data.



Big Data & Enterprise Software Engineer

I am passionate about software development, databases, data analysis and the android platform. My native language is java but no one has stopped me so far from learning and using angular and node.js. Data and data analysis is thrilling and so are my experiences with SQL on Oracle, Microsoft SQL Server, Postgres and MyS see more...