Data Science Project in Python on BigMart Sales Prediction

Data Science Project in Python on BigMart Sales Prediction

The goal of this data science project is to build a predictive model and find out the sales of each product at a given Big Mart store.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

Arvind Sodhi

VP - Data Architect, CDO at Deutsche Bank

I have extensive experience in data management and data processing. Over the past few years I saw the data management technology transition into the Big Data ecosystem and I needed to follow suit. I... Read More

Camille St. Omer

Artificial Intelligence Researcher, Quora 'Most Viewed Writer in 'Data Mining'

I came to the platform with no experience and now I am knowledgeable in Machine Learning with Python. No easy thing I must say, the sessions are challenging and go to the depths. I looked at graduate... Read More

What will you learn

Understanding the problem Statement
Importing the Dataset and performing basic EDA
Checking for the null values and describing the variables
Imputation of the Null-Values using pivot tables
Feature Engineering/ Creating New features
Using seaborn to understand the contribution of the categorical values on target variables
Using boxplot for identifying outliers
Fixing categorical variables using Label and One hot encoding
Applying Linear, Bayesian Regression models
Applying ensemble bagging models like Random Forest and Bagging models
Applying boosting models like Gradient Boosting Tree and XGboost
Applying Neural Network model MLPRegressor
Making function for On spot-checking and selecting the best for hyperparameter tuning
Defining function for HyperParameter tuning
Standardization and effect of Standardization
Understanding Robust Scaler and Normalization
Implementing Robust Scaler and Normalization
Concluding the final model and predicting for the test data set
Saving the model using Joblib

Project Description

The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. Also, certain attributes of each product and store have been defined. The aim of this data science project is to build a predictive model and find out the sales of each product at a particular store.

Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

 The data has missing values as some stores do not report all the data due to technical glitches. Hence, it will be required to treat them accordingly.

Similar Projects

In this project, we will use traditional time series forecasting methods as well as modern deep learning methods for time series forecasting.

This data science in python project predicts if a loan should be given to an applicant or not. We predict if the customer is eligible for loan based on several factors like credit score and past history.

In this machine learning project, we will use binary leaf images and extracted features, including shape, margin, and texture to accurately identify plant species using different benchmark classification techniques.

Curriculum For This Mini Project

The Business Problem
Exploring The Dataset
Exploratory Data Analysis (eda) - Outliers
Exploratory Data Analysis (eda) - Graphs
Converting Categorical To Numerical
Seperating Training And Test Data
Running The Models
Hyper Parameter Tuning XGB And GBR
Standard Scaling
Robust Scaling
Final Predictions On The Test Dataset
Saving The Final Model