Predicting Loan Default

Predicting Loan Default

In this project, we will automate the loan eligibility process (real-time) based on customer details while filling the online application form.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews


Lead Consultant, ITC Infotech

The project orientation is very much unique and it helps to understand the real time scenarios most of the industries are dealing with. And there is no limit, one can go through as many projects... Read More

Ray Han

Tech Leader | Stanford / Yale University

I think that they are fantastic. I attended Yale and Stanford and have worked at Honeywell,Oracle, and Arthur Andersen(Accenture) in the US. I have taken Big Data and Hadoop,NoSQL, Spark, Hadoop... Read More

What will you learn

Understanding the Problem Statement and Importing the Dataset
Performing basic EDA to get Insights into the data
Importing the necessary libraries
Using Info function to check for null values and datatypes
Imputing null values using suitable methods
Converting categorical values into numerical vectors
Plotting barplot of the dependent variable versus Independent variable
Using Boxplot for identifying outliers
Seperating dependent and Independent columns for training the model
Using train_test_split function for creating training and testing dataset
Understanding and Implementing Standardization
Applying ensemble model using Random Forest Classifier
Applying Decision Tree Classifier using AdaBoost
Applying ensembling model Voting Classifier
Applying Liner Model Logistic Regression
Plotting graphs for weight coefficients for different variables
Defining a function for performing Cross-Validation and calculating accuracy simultaneously
Applying Gradient Boosting Classifier and feature selection to extract best features for GBC
Extracting best features for Random Forest Classifier
Using the selected features for training the final model
Making predictions using the trained model and saving the predictions

Project Description

About Company
Dream Housing Finance company deals in all home loans. They have a presence across all urban, semi-urban and rural areas. Customer first applies for the home loan after that company validates the customer eligibility for the loan.

The company wants to automate the loan eligibility process (real-time) based on customer detail provided while filling online application form. These details are Gender, Marital Status, Education, Number of Dependents, Income, Loan Amount, Credit History and others. To automate this process, they have given a problem to identify the customer's segments, those are eligible for loan amount so that they can specifically target these customers. Here they have provided a partial data set.


  1. Anaconda Continuum Python 64-bit
  2. Seaborn for visualization


Similar Projects

In this project, we will build a model to predict the purchase amount of customers against various products which will help a retail company to create personalized offer for customers against different products.

There are different time series forecasting methods to forecast stock price, demand etc. In this machine learning project, you will learn to determine which forecasting method to be used when and how to apply with time series forecasting example.

The goal of this machine learning project is to predict which products existing customers will use next month based on their past behaviour and that of similar customers.

Curriculum For This Mini Project

02h 19m