Predict Credit Default | Give Me Some Credit Kaggle

Predict Credit Default | Give Me Some Credit Kaggle

In this data science project, you will predict borrowers chance of defaulting on credit loans by building a credit score prediction model.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Understanding the problem statement
Importing the dataset
Initializing necessary libraries and understanding its use
Performing basic EDA on the dataset
Visualizing outliers using boxplot and whiskers plot
Fixing the outliers using IQR method
Creating new features form existing features/Feature engineering
Preparing the dataset for fitting it into a model
Applying Logistics Regression
Understanding "Sensitivity" and "Specificity"
ROC and AUC curve as evaluation metrics
Sampling technique called as Bagging and boosting technique
Applying Gradient Boosting and Random Forest
Making final predictions

Project Description

Banks often depend on credit score prediction models to approve or deny a loan request. A good prediction model is necessary for a bank so that they can provide maximum credit without exceeding the risk threshold. This data science project uses credit score dataset which has fairly large volume of data (250K). The predictive models will be build following various approaches - random forests, graident boosting and logistic regression. At the end of the project you will build a predictive model that will automatically score each applicant with a credit score which is human readable and easy to interpret.

Similar Projects

In this R data science project, we will explore wine dataset to assess red wine quality. The objective of this data science project is to explore which chemical properties will influence the quality of red wines.

In this machine learning project, we will predict which coupons a customer will buy.

In this project, we are going to talk about H2O and functionality in terms of building Machine Learning models.

Curriculum For This Mini Project

Problem Statement
03m
Classification vs Prediction
03m
Import Data Set
02m
Data Set Exploration
20m
Data Validation - Missing Values
38m
Outliers Overview
05m
Boxplot to Identify Outliers
01m
Quantile Method
09m
Replace Outliers
02m
What is Linear Regression?
15m
What is Logistic Regression?
06m
Convert Data Type to factor
13m
Build a Regression Model
11m
Model Summary
02m
Prediction
01m
Calculate Event Rate
07m
ROC Curve method
13m
Decision Trees
14m
Prediction using Trees
12m
Sampling Techniques
07m