1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com

Implementing Slow Changing Dimensions in a Data Warehouse using Hive and Spark

In this hackerday, we will look at the various types of SCDs and how to implements SCDs in Hive and Spark.
Event Date
Nov - 2017
07:00pm - 09:30pm PST
Nov - 2017
07:00pm - 09:30pm PST
What are the prerequisites for this project?
  • It is expected that students have a fair knowledge of Big Data and Hadoop.
  • Installation Cloudera Quickstart VM.
  • This project assumes that you have a good knowledge of Hadoop. If not - we recommend you to take the Big Data and Hadoop course first.

What will you learn

  • What is slow changing dimension (scd)
  • Types of slow changing dimension
  • Updates and transactions in Hive
  • Implementing SCD 2 & 3 in Hive
  • Implementing SCD 2 & 3 in Spark

Project Description

One of the broadest use of Hadoop today is building data warehousing platform off a data lake. And in building a data warehouse, the traditions left us by Kimball and Inmon is still very much in play.

Why not every one of the legacy rules should be implemented as as-is in the big data platform, the issue of slow-changing dimensions is still a front-burner.

The slow changing dimension of warehouse dimension that is said to rarely change. However, when they change, there should be a systematic approach to capturing that change. Examples of SCDs are customer and products information.

In this hackerday, we will look at the various types of SCDs and how to implements SCDs in Hive and Spark.



Big Data & Enterprise Software Engineer

I am passionate about software development, databases, data analysis and the android platform. My native language is java but no one has stopped me so far from learning and using angular and node.js. Data and data analysis is thrilling and so are my experiences with SQL on Oracle, Microsoft SQL Server, Postgres and MyS see more...

What is Hackerday?

Stay updated in technology trends by working on projects

Live online coding sessions led by industry experts

Build 2-4 projects a month each lasting 6 hours designed to teach you advanced concepts

Code in groups and connect with your community