1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com

Spark integration and analysis with NoSQL Databases 2 - Cassandra

In this project, we will look at Cassandra and how it is suited for especially in a hadoop environment, how to integrate it with spark, installation in our lab environment.
What are the prerequisites for this project?
  • It is expected that students have a fair knowledge of Big Data and hadoop particularly Spark.
  • Installation of a hadoop quickstart VM.
  • Installation of MongoDB and Cassandra in your vm or host machine.

What will you learn

  • Exploratory look at cassandra
  • Data modelling in Cassandra
  • Use cases Cassandra in the enterprise
  • Spark integration using our dataset
  • Materialized Views
  • Comparing Analytical queries of MongoDB and Cassandra
  • Spark Datasources

Project Description

In the last hackerday, we looked at NoSQL databases and their roles in today's enterprise. We talked about design choices with respect to document-oriented and wide-columnar datbases, and conclude by doing hands-on exploration of MongoDB, its integration with spark and writing analytical queries using the MongDB query structures.
Like we also noted, Spark has a benefit of being very extensible to quite a number of storage platforms beyond hadoop. This means that as spark developers, we can write and read from virtually any popular storage platform while building our data pipeline.
In this hackerday, we will conclude that session by take a look at Cassandra. We will look at what it is suited for especially in a hadoop environment, how to integrate it with spark, installation in our lab environment, modelling the UK MOT vehicle testing dataset that we used on MongoDB in the first part. Once loaded, anyone can at anytime, perform analytical queries on the tables.



Big Data & Enterprise Software Engineer

I am passionate about software development, databases, data analysis and the android platform. My native language is java but no one has stopped me so far from learning and using angular and node.js. Data and data analysis is thrilling and so are my experiences with SQL on Oracle, Microsoft SQL Server, Postgres and MyS see more...

What is Hackerday?

Stay updated in technology trends by working on projects

Live online coding sessions led by industry experts

Build 2-4 projects a month each lasting 6 hours designed to teach you advanced concepts

Code in groups and connect with your community