Learn how you can build Big Data Projects
External script plugin, UDF and UDAF for Hive
You can check the materials that I presented in the previous session from below link:
Jun 10 2014 03:40 AM
Demand prediction of driver availability using multistep time series analysis
In this supervised learning machine learning project, you will predict the availability of a driver in a specific area by using multi step time series analysis.
Implementing Slow Changing Dimensions in a Data Warehouse using Hive and Spark
Hive Project- Understand the various types of SCDs and implement these slowly changing dimesnsion in Hadoop Hive and Spark.
Predict Employee Computer Access Needs in Python
Data Science Project in Python- Given his or her job role, predict employee access needs using amazon employee database.
Event Data Analysis using AWS ELK Stack
This Elasticsearch example deploys the AWS ELK stack to analyse streaming event data. Tools used include Nifi, PySpark, Elasticsearch, Logstash and Kibana for visualisation.
Expedia Hotel Recommendations Data Science Project
In this data science project, you will contextualize customer data and predict the likelihood a customer will stay at 100 different hotel groups.
Forecast Inventory demand using historical sales data in R
In this machine learning project, you will develop a machine learning model to accurately forecast inventory demand based on historical sales data.
Walmart Sales Forecasting Data Science Project
Data Science Project in R-Predict the sales for each department using historical markdown data from the Walmart dataset containing data of 45 Walmart stores.
Perform Time series modelling using Facebook Prophet
In this project, we are going to talk about Time Series Forecasting to predict the electricity requirement for a particular house using Prophet.
Web Server Log Processing using Hadoop
In this hadoop project, you will be using a sample application log file from an application server to a demonstrated scaled-down server log processing pipeline.
Mercari Price Suggestion Challenge Data Science Project
Data Science Project in Python- Build a machine learning algorithm that automatically suggests the right product prices.
If anyone has a problem to run Hive in CDH3 VM, try this:
Getting error when accessing HIVE commands
HCatalog Providing interoperability across data processing tools such as Pig, MapReduce, and Hive?
Hive Assignment - Nasdaq Top Dividend
A couple of hive questions
Hive Thrift Server takes Forever to Start
Recap of Hadoop News for September 2018
Recap of Hadoop News for August 2018
AWS vs Azure-Who is the big winner in the cloud war?
Top 5 Reasons to Learn AWS
Top 50 AWS Interview Questions and Answers for 2018
Recap of Hadoop News for July 2018
You have not activated your email address. We have emailed you an activation code - please enter it below