Data Science Project in Python on BigMart Sales Prediction

Data Science Project in Python on BigMart Sales Prediction

The goal of this data science project is to build a predictive model and find out the sales of each product at a given Big Mart store.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

Hiren Ahir

Microsoft Azure SQL Sever Developer, BI Developer

I'm a Graduate student and came into the job market and found a university degree wasn't sufficient to get a good paying job. I aimed at hottest technology in the market Big Data but the word BigData... Read More


Lead Consultant, ITC Infotech

The project orientation is very much unique and it helps to understand the real time scenarios most of the industries are dealing with. And there is no limit, one can go through as many projects... Read More

What will you learn

Understanding the problem Statement
Importing the Dataset and performing basic EDA
Checking for the null values and describing the variables
Imputation of the Null-Values using pivot tables
Feature Engineering/ Creating New features
Using seaborn to understand the contribution of the categorical values on target variables
Using boxplot for identifying outliers
Fixing categorical variables using Label and One hot encoding
Applying Linear, Bayesian Regression models
Applying ensemble bagging models like Random Forest and Bagging models
Applying boosting models like Gradient Boosting Tree and XGboost
Applying Neural Network model MLPRegressor
Making function for On spot-checking and selecting the best for hyperparameter tuning
Defining function for HyperParameter tuning
Standardization and effect of Standardization
Understanding Robust Scaler and Normalization
Implementing Robust Scaler and Normalization
Concluding the final model and predicting for the test data set
Saving the model using Joblib

Project Description

The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. Also, certain attributes of each product and store have been defined. The aim of this data science project is to build a predictive model and find out the sales of each product at a particular store.

Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

 The data has missing values as some stores do not report all the data due to technical glitches. Hence, it will be required to treat them accordingly.

Similar Projects

Forecast the business for the upcoming years by Exploring Hidden Trends, Calculating Machine Productivity , Extrapolation and Assumptions and Summarizing Answers through Visualizations.

In this machine learning project, we will build a predictive model to find out the sales of each product at a particular store.

In this machine learning project, we will implement Back-propagation Algorithm from scratch for classification problems.

Curriculum For This Mini Project

The Business Problem
Exploring The Dataset
Exploratory Data Analysis (eda) - Outliers
Exploratory Data Analysis (eda) - Graphs
Converting Categorical To Numerical
Seperating Training And Test Data
Running The Models
Hyper Parameter Tuning XGB And GBR
Standard Scaling
Robust Scaling
Final Predictions On The Test Dataset
Saving The Final Model