Learn how you can build Big Data Projects
If anyone has a problem to run Hive in CDH3 VM, try this:
Don't lose time like me. I had the same issue and resolved it by the following way:
For granting write access to the derby metastore database directory used by Hive :
sudo chmod a+rwx . --recursive
(Be warned that this will give permissions to all users but this shouldn't be a problem. You can change r-w-x bits accordingly if you want)
Then clear the lock files in directory '/var/lib/hive/metastore/metastore_db':
sudo rm *.lck
Then it should work like below, if you fire the command 'show databases;'
hive> show databases;
Time taken: 3.134 seconds
May 27 2014 07:00 AM
As per today's class move away from CDH3...CDH4 is released on Oct 2013 and now CDH5 is available
May 31 2014 09:28 PM
Yelp Data Processing using Spark and Hive Part 2
In this spark project, we will continue building the data warehouse from the previous project Yelp Data Processing Using Spark And Hive Part 1 and will do further data processing to develop diverse data products.
Zillow’s Home Value Prediction (Zestimate)
Data Science Project in R -Build a machine learning algorithm to predict the future sale prices of homes.
Walmart Sales Forecasting Data Science Project
Data Science Project in R-Predict the sales for each department using historical markdown data from the Walmart dataset containing data of 45 Walmart stores.
Data Science Project-TalkingData AdTracking Fraud Detection
Machine Learning Project in R-Detect fraudulent click traffic for mobile app ads using R data science programming language.
Real-Time Log Processing in Kafka for Streaming Architecture
The goal of this apache kafka project is to process log entries from applications in real-time using Kafka for the streaming architecture in a microservice sense.
Data Science Project - Instacart Market Basket Analysis
Data Science Project - Build a recommendation engine which will predict the products to be purchased by an Instacart consumer again.
Hadoop Project-Analysis of Yelp Dataset using Hadoop Hive
The goal of this hadoop project is to apply some data engineering principles to Yelp Dataset in the areas of processing, storage, and retrieval.
Predict Credit Default | Give Me Some Credit Kaggle
In this data science project, you will predict borrowers chance of defaulting on credit loans by building a credit score prediction model.
Design a Hadoop Architecture
Learn to design Hadoop Architecture and understand how to store data using data acquisition tools in Hadoop.
Data Science Project-All State Insurance Claims Severity Prediction
Data science project in R to develop automated methods for predicting the cost and severity of insurance claims.
External script plugin, UDF and UDAF for Hive
Getting error when accessing HIVE commands
HCatalog Providing interoperability across data processing tools such as Pig, MapReduce, and Hive?
Hive Assignment - Nasdaq Top Dividend
A couple of hive questions
Hive Thrift Server takes Forever to Start
Recap of Hadoop News for September 2018
Recap of Hadoop News for August 2018
AWS vs Azure-Who is the big winner in the cloud war?
Top 5 Reasons to Learn AWS
Top 50 AWS Interview Questions and Answers for 2018
Recap of Hadoop News for July 2018
You have not activated your email address. We have emailed you an activation code - please enter it below