Loan Default Risk Prediction Machine Learning Project

In this project, we are going to predict how capable each applicant is repaying a loan.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

  • Understanding the problem statement and importing the file

  • Initializing the libraries and understand it's use

  • Using info and describe the function and extracting information from the results

  • Checking for the null values and performing necessary imputations

  • Plotting histogram and bar plot for numerical versus target variable for advanced EDA

  • How to analyze categorical variables using graphs

  • How to plot heatmap and FacetGrid in seaborn

  • Creating new features from existing features (Feature Engineering)

  • Understanding One Hot and Label encoding and it's implementation

  • Applying ensembling method Random Forest and extracting important features using feature_importance function

  • Difference between Deep learning model and the ML model

  • Creating a function for extensive Feature Engineering and Pre-processing of the Dataset

  • Preparing dataset for LightGBM

  • Initializing parameters for LightGBM

  • Selecting the right metrics according to the Dataset

  • Training the model and making predictions

  • Plotting graphs different metrics and models to select the best one out

Project Description

Home Credit makes use of a variety of alternative data--including telco and transactional information--to predict their clients' repayment abilities. Many people struggle to get loans due to insufficient or non-existent credit histories. And, unfortunately, this population is often taken advantage of by untrustworthy lenders. Home Credit strives to broaden financial inclusion for the unbanked population by providing a positive and safe borrowing experience.

Similar Projects

Big Data Project Predict Quora Question Pairs Meaning using NLP in Python
The goal of this NLP project is to predict which of the provided quora question pairs contain two questions with the same meaning.
Big Data Project Applying Deep Learning to Time Series Forecasting with Python
In this project, we will use traditional time series forecasting methods as well as modern deep learning methods for time series forecasting.
Big Data Project Predict Employee Computer Access Needs in Python
Data Science Project in Python- Given his or her job role, predict employee access needs using amazon employee database.
Big Data Project Learn to prepare data for your next machine learning project
Text data requires special preparation before you can start using it for any machine learning project.In this ML project, you will learn about applying Machine Learning models to create classifiers and learn how to make sense of textual data.