Learn how you can build Big Data Projects
Webinar recording for Oct 18th
Vamsi Krishna Reddy
Could you please help in finding the recording for Oct 18th session , First session for this batch (Oct 18 weekend batch)
Oct 19 2015 01:34 AM
Identifying Product Bundles from Sales Data Using R Language
In this data science project in R, we are going to talk about subjective segmentation which is a clustering technique to find out product bundles in sales data.
Explore features of Spark SQL in practice on Spark 2.0
The goal of this spark project for students is to explore the features of Spark SQL in practice on the latest version of Spark i.e. Spark 2.0.
Real-Time Log Processing in Kafka for Streaming Architecture
The goal of this apache kafka project is to process log entries from applications in real-time using Kafka for the streaming architecture in a microservice sense.
Spark Project -Real-time data collection and Spark Streaming Aggregation
In this big data project, we will embark on real-time data collection and aggregation from a simulated real-time system using Spark Streaming.
Tough engineering choices with large datasets in Hive Part - 2
This is in continuation of the previous Hive project "Tough engineering choices with large datasets in Hive Part - 1", where we will work on processing big data sets using Hive.
Predict Credit Default | Give Me Some Credit Kaggle
In this data science project, you will predict borrowers chance of defaulting on credit loans by building a credit score prediction model.
Data Science Project on Wine Quality Prediction in R
In this R data science project, we will explore wine dataset to assess red wine quality. The objective of this data science project is to explore which chemical properties will influence the quality of red wines.
Finding Unique URL's using Hadoop Hive
Hive Project -Learn to write a Hive program to find the first unique URL, given 'n' number of URL's.
Data Science Project-All State Insurance Claims Severity Prediction
Data science project in R to develop automated methods for predicting the cost and severity of insurance claims.
Hadoop Project-Analysis of Yelp Dataset using Hadoop Hive
The goal of this hadoop project is to apply some data engineering principles to Yelp Dataset in the areas of processing, storage, and retrieval.
Google Drive link
could you please share the winutils link and also other links that discussed in class
Cannnot see link to join the live class for today 06/06/2018
Deb - Can u or support team upload another copy of Aug 4th recording?
Webinar recordings for this past Sunday & Monday
Recap of Hadoop News for September 2018
Recap of Hadoop News for August 2018
AWS vs Azure-Who is the big winner in the cloud war?
Top 5 Reasons to Learn AWS
Top 50 AWS Interview Questions and Answers for 2018
Recap of Hadoop News for July 2018
You have not activated your email address. We have emailed you an activation code - please enter it below