Data Science Project in Python on BigMart Sales Prediction

Data Science Project in Python on BigMart Sales Prediction

The goal of this data science project is to build a predictive model and find out the sales of each product at a given Big Mart store.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

Shailesh Kurdekar

Solutions Architect at Capital One

I have worked for more than 15 years in Java and J2EE and have recently developed an interest in Big Data technologies and Machine learning due to a big need at my workspace. I was referred here by a... Read More

Camille St. Omer

Artificial Intelligence Researcher, Quora 'Most Viewed Writer in 'Data Mining'

I came to the platform with no experience and now I am knowledgeable in Machine Learning with Python. No easy thing I must say, the sessions are challenging and go to the depths. I looked at graduate... Read More

What will you learn

Understanding the problem Statement
Importing the Dataset and performing basic EDA
Checking for the null values and describing the variables
Imputation of the Null-Values using pivot tables
Feature Engineering/ Creating New features
Using seaborn to understand the contribution of the categorical values on target variables
Using boxplot for identifying outliers
Fixing categorical variables using Label and One hot encoding
Applying Linear, Bayesian Regression models
Applying ensemble bagging models like Random Forest and Bagging models
Applying boosting models like Gradient Boosting Tree and XGboost
Applying Neural Network model MLPRegressor
Making function for On spot-checking and selecting the best for hyperparameter tuning
Defining function for HyperParameter tuning
Standardization and effect of Standardization
Understanding Robust Scaler and Normalization
Implementing Robust Scaler and Normalization
Concluding the final model and predicting for the test data set
Saving the model using Joblib

Project Description

The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. Also, certain attributes of each product and store have been defined. The aim of this data science project is to build a predictive model and find out the sales of each product at a particular store.

Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

 The data has missing values as some stores do not report all the data due to technical glitches. Hence, it will be required to treat them accordingly.

Similar Projects

In this data science project, you will be working on building a machine learning model that can identify nerve structures in a data set of ultrasound images of the neck. This will help enhance catheter placement and contribute to a more pain free future.

In this project, we are going to predict how capable each applicant is repaying a loan.

Build a predictive model to correctly classify products between 9 product categories (fashion, electronics, etc.) using the Otto Group dataset.

Curriculum For This Mini Project

The Business Problem
Exploring The Dataset
Exploratory Data Analysis (eda) - Outliers
Exploratory Data Analysis (eda) - Graphs
Converting Categorical To Numerical
Seperating Training And Test Data
Running The Models
Hyper Parameter Tuning XGB And GBR
Standard Scaling
Robust Scaling
Final Predictions On The Test Dataset
Saving The Final Model