1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com
data-analysis-collaboration-using-zeppelin.jpg

Data Analysis and Visualisation using Spark and Zeppelin

In this big data project, we will talk about Apache Zeppelin. We will write code, write notes, build charts and share all in one single data analytics environment using Hive, Spark and Pig.
4.64.6

Users who bought this project also bought

What will you learn

  • Apache Zeppelin: What it is and how it works
  • Installing Zeppelin interpreters
  • Running Spark, Hive and Pig code on your notebook
  • Writing markdown notes or narrative text.
  • Collaboration or Sharing your book with others
  • Discuss other notebook alternatives like (Jupyter or Databricks notebooks)

What will you get

  • Access to recording of the complete project
  • Access to all material related to project like data files, solution files etc.

Prerequisites

  • A fair knowledge of Hadoop, Hive, and Spark SQL
  • A working Hadoop distribution sandbox (eg. Cloudera QuickStart VM or Hortonworks HDP sandbox)

Project Description

A notebook is a code execution environment that allows for creating, sharing code and its execution, visualization and other text information (like markups). It enables an interactive computing in the area of data exploration or analysis. It is logical to a sharable Grunt shell for Pig, or scala shell and PySpark shell for Spark, or beeline for Hive but with visualization, discovery and collaboration.

In this big data Project, we will talk about one of this notebook - Apache Zeppelin. With Zeppelin, we will do a number of data analysis by answering some questions on the crime dataset using Hive, Spark and Pig. We will prepare some chart to better represent our results and finally share our results with the collaborative or sharing feature of the notebook.

On completing this big data project using zeppelin, participants will have known what Zeppelin is, gained the ability to install new interpreters, use Zeppelin for performing data analysis, sharing results with their friends or colleagues. Also, the participant will be informed of other notebooks in the data ecosystem like Jupyter or the databricks cloud notebooks.

Instructors

 
Michael

Big Data & Enterprise Software Engineer

I am passionate about software development, databases, data analysis and the android platform. My native language is java but no one has stopped me so far from learning and using angular and node.js. Data and data analysis is thrilling and so are my experiences with SQL on Oracle, Microsoft SQL Server, Postgres and MyS see more...