1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com
chicago-crime-data-analysis-on-apache-spark.jpg

Chicago Crime Data Analysis on Apache Spark

In this project, we will look at running various use cases in the analysis of crime data sets using Apache Spark.
Event Date
18th
Aug - 2018
08:00am - 10:30am PST
19th
Aug - 2018
08:00am - 10:30am PST
What are the prerequisites for this project?
  • No knowledge of Spark is required
  • Requires the download and setup of any Hadoop sandbox having Spark or Spark 2

What will you learn

  • Spark's DataFrame vs Dataset
  • Type-safe UDF in Spark
  • Rollup functions in Spark
  • Windowing functions in Spark
  • Running your spark code in Apache Zeppelin

Project Description

In this Hackerday, we will look at running various use cases in the analysis of crime datasets using Apache Spark.
This is a back-to-basics Hackerday session that is going to be very expository for those who have never written spark application or are new to writing spark application using Scala. We will explore the Spark SQL UDF and as well as roll-up and windowing functions.

We will also do a final submission of our application on Apache Zeppelin to submit our application to our friends. We will try to run some of our code in both 1.x and 2.x versions of Spark. However, you are recommended to start moving completely to Spark 2.x.
 

Instructors

 
Michael

Big Data & Enterprise Software Engineer

I am passionate about software development, databases, data analysis and the android platform. My native language is java but no one has stopped me so far from learning and using angular and node.js. Data and data analysis is thrilling and so are my experiences with SQL on Oracle, Microsoft SQL Server, Postgres and MyS see more...

What is Hackerday?

Stay updated in technology trends by working on projects

Live online coding sessions led by industry experts

Build 2-4 projects a month each lasting 6 hours designed to teach you advanced concepts

Code in groups and connect with your community