1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com
online-analytical-processing-visualization-of-retail-data-apache-kylin.jpg

Implementing OLAP on Hadoop using Apache Kylin

In this big data project, we will be performing an OLAP cube design using AdventureWorks database. The deliverable for this session will be to design a cube, build and implement it using Kylin, query the cube and even connect familiar tools (like Excel) with our new cube.
4.64.6

Users who bought this project also bought

What will you learn

  • Apache Kylin and how it works?
  • Installing Apache Kylin in our Quickstart VM
  • Design star schema on our AdventureWorks database
  • Implementing our star schema in Kylin
  • Writing aggregate queries against a Kylin cube
  • Connecting a visualization tool

What will you get

  • Access to recording of the complete project
  • Access to all material related to project like data files, solution files etc.

Prerequisites

  • A fair knowledge of Hadoop, Hive, HBase and MapReduce
  • A fair knowledge of data warehousing and OLAP design
  • A working Hadoop distribution sandbox (eg. Cloudera QuickStart VM or Hortonworks HDP sandbox)

Project Description

Apache Kylin : Implementing OLAP on Hadoop platform

Perform OLAP on Hadoop big data platform has been a burden for a while, primarily due to high latency of queries. A different open source project like impala, presto and even apache hawq have tried to fix the problem with an MPP style of query execution architecture, but with an even larger dataset, performing query aggregation which is key to OLAP queries is still far from desirable.

Apache Kylin (kylin.apache.org) is a Distributed Analytics Engine that provides SQL interface and multidimensional analysis (OLAP) on the large dataset using MapReduce or Spark. This means that I can answer classical MDX questions in the Hadoop platform with a decent amount of latency.

In this big data project, we will be performing an OLAP cube design using the AdventureWorks dataset. The deliverable for this hadoop  be to design a cube, build and implement it using Kylin, query the cube and even connect familiar tools (like Excel) with our new cube.

Instructors

 
Michael

Big Data & Enterprise Software Engineer

I am passionate about software development, databases, data analysis and the android platform. My native language is java but no one has stopped me so far from learning and using angular and node.js. Data and data analysis is thrilling and so are my experiences with SQL on Oracle, Microsoft SQL Server, Postgres and MyS see more...