Predicting Loan Default

Predicting Loan Default

In this project, we will automate the loan eligibility process (real-time) based on customer details while filling the online application form.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

Ray Han

Tech Leader | Stanford / Yale University

I think that they are fantastic. I attended Yale and Stanford and have worked at Honeywell,Oracle, and Arthur Andersen(Accenture) in the US. I have taken Big Data and Hadoop,NoSQL, Spark, Hadoop... Read More

Camille St. Omer

Artificial Intelligence Researcher, Quora 'Most Viewed Writer in 'Data Mining'

I came to the platform with no experience and now I am knowledgeable in Machine Learning with Python. No easy thing I must say, the sessions are challenging and go to the depths. I looked at graduate... Read More

What will you learn

Understanding the Problem Statement and Importing the Dataset
Performing basic EDA to get Insights into the data
Importing the necessary libraries
Using Info function to check for null values and datatypes
Imputing null values using suitable methods
Converting categorical values into numerical vectors
Plotting barplot of the dependent variable versus Independent variable
Using Boxplot for identifying outliers
Seperating dependent and Independent columns for training the model
Using train_test_split function for creating training and testing dataset
Understanding and Implementing Standardization
Applying ensemble model using Random Forest Classifier
Applying Decision Tree Classifier using AdaBoost
Applying ensembling model Voting Classifier
Applying Liner Model Logistic Regression
Plotting graphs for weight coefficients for different variables
Defining a function for performing Cross-Validation and calculating accuracy simultaneously
Applying Gradient Boosting Classifier and feature selection to extract best features for GBC
Extracting best features for Random Forest Classifier
Using the selected features for training the final model
Making predictions using the trained model and saving the predictions

Project Description

About Company
Dream Housing Finance company deals in all home loans. They have a presence across all urban, semi-urban and rural areas. Customer first applies for the home loan after that company validates the customer eligibility for the loan.

The company wants to automate the loan eligibility process (real-time) based on customer detail provided while filling online application form. These details are Gender, Marital Status, Education, Number of Dependents, Income, Loan Amount, Credit History and others. To automate this process, they have given a problem to identify the customer's segments, those are eligible for loan amount so that they can specifically target these customers. Here they have provided a partial data set.


  1. Anaconda Continuum Python 64-bit
  2. Seaborn for visualization


Similar Projects

In this data science project, you will be working on building a machine learning model that can identify nerve structures in a data set of ultrasound images of the neck. This will help enhance catheter placement and contribute to a more pain free future.

In this project, we are going to talk about insurance forecast by using regression techniques.

In this project, we will use traditional time series forecasting methods as well as modern deep learning methods for time series forecasting.

Curriculum For This Mini Project

02h 19m