Data Science Project in Python on BigMart Sales Prediction

The goal of this data science project is to build a predictive model and find out the sales of each product at a given Big Mart store.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

  • Understanding the problem Statement

  • Importing the Dataset and performing basic EDA

  • Checking for the null values and describing the variables

  • Imputation of the Null-Values using pivot tables

  • Feature Engineering/ Creating New features

  • Using seaborn to understand the contribution of the categorical values on target variables

  • Using boxplot for identifying outliers

  • Fixing categorical variables using Label and One hot encoding

  • Applying Linear, Bayesian Regression models

  • Applying ensemble bagging models like Random Forest and Bagging models

  • Applying boosting models like Gradient Boosting Tree and XGboost

  • Applying Neural Network model MLPRegressor

  • Making function for On spot-checking and selecting the best for hyperparameter tuning

  • Defining function for HyperParameter tuning

  • Standardization and effect of Standardization

  • Understanding Robust Scaler and Normalization

  • Implementing Robust Scaler and Normalization

  • Concluding the final model and predicting for the test data set

  • Saving the model using Joblib

Project Description

The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. Also, certain attributes of each product and store have been defined. The aim of this data science project is to build a predictive model and find out the sales of each product at a particular store.

Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

 The data has missing values as some stores do not report all the data due to technical glitches. Hence, it will be required to treat them accordingly.

Similar Projects

Big Data Project Build a predictive model for Otto Group Product Classification
Build a predictive model to correctly classify products between 9 product categories (fashion, electronics, etc.) using the Otto Group dataset.
Big Data Project Expedia Hotel Recommendations Data Science Project
In this data science project, you will contextualize customer data and predict the likelihood a customer will stay at 100 different hotel groups.
Big Data Project Implement Back-Propagation Algorithm for Classification Problems
In this machine learning project, we will implement Back-propagation Algorithm from scratch for classification problems.
Big Data Project Loan Default Risk Prediction Machine Learning Project
In this project, we are going to predict how capable each applicant is repaying a loan.

Curriculum For This Mini Project

 
  11-Jun-2016
03h 46m