1-844-696-6465 (US)        +91 77600 44484        help@dezyre.com
sql-analytics-with-hive.jpg

Hadoop Project for Beginners-SQL Analytics with Hive

In this hadoop project, learn about the features in Hive that allow us to perform analytical queries over large datasets.
4.84.8

Users who bought this project also bought

What will you learn

  • Hive high-level recap
  • Data ingestion/transformation using Sqoop, Spark and Hive
  • File formats and query performance
  • Writing aggregate queries using UDAFs.
  • Aggregation using widowing functions.
  • Query optimizations in Hive

What will you get

  • Access to recording of the complete project
  • Access to all material related to project like data files, solution files etc.

Prerequisites

  • Installation Cloudera Quickstart VM.
  • It is also essential for students to have a fair knowledge of Hadoop.
  • No knowledge of Hive is required for this hackerday as we will do a little overview of hive before we begin.

Project Description

In this hive project, we want to take a deeper dive into some analytical features in Hive. Using SQL is still very dominant and will remain so for the nearest features. Most big data tools have been adapted to allow users interact with them using the familiar SQL language. This is because of years of knowledge and skill that has gone into training, acceptance, tooling, standards development and re-engineering. So in many cases, using these cool features of SQL to access data solves a lot of analytical questions without ever needing us to resort to machine learning, BI or data mining.

In this big data project, we want to look at these features in Hive that allows us to perform analytical queries over large datasets.

We will be using the adventure works dataset in a MySQL dataset. Therefore, there will be a need to ingest and transform the data before we proceed to analytics.

Instructors

 
Michael

Big Data & Enterprise Software Engineer

I am passionate about software development, databases, data analysis and the android platform. My native language is java but no one has stopped me so far from learning and using angular and node.js. Data and data analysis is thrilling and so are my experiences with SQL on Oracle, Microsoft SQL Server, Postgres and MyS see more...

Curriculum For This Mini Project

 
  Overview
00:11:00
  SerDes
00:03:22
  Cloning the dataset
00:02:29
  Understanding the dataset
00:05:27
  Load the data
00:06:57
  Query the data
00:11:42
  Create a Sqoop job
00:19:03
  Executing the Sqoop job
00:15:23
  Why is append used ?
00:04:55
  Build hive tables on top of the data
00:11:20
  Troubleshooting hive table
00:03:06
  Using Parquet and xpath
00:18:18
  Select statement
00:12:16
  Use case based aggregations
00:09:39
  Q&A - the problem statement
00:14:22
  Q&A - Hive versus myql database
00:08:16
  Enhancing aggregate functions
00:25:40
  Grouping sets
00:08:34
  Rollup versus Cube
00:16:41
  Windowing analytic functions
00:18:47
  Properties of windowing analytic functions
00:24:32
  Solving an example - finding %
00:16:41