Walmart Sales Forecasting Data Science Project

Walmart Sales Forecasting Data Science Project

Data Science Project in R-Predict the sales for each department using historical markdown data from the Walmart dataset containing data of 45 Walmart stores.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

James Peebles

Data Analytics Leader, IQVIA

This is one of the best of investments you can make with regards to career progression and growth in technological knowledge. I was pointed in this direction by a mentor in the IT world who I highly... Read More

Hiren Ahir

Microsoft Azure SQL Sever Developer, BI Developer

I'm a Graduate student and came into the job market and found a university degree wasn't sufficient to get a good paying job. I aimed at hottest technology in the market Big Data but the word BigData... Read More

What will you learn

What you will learn
Understand the Problem Statement
Perform basic EDA to familiarize with the data
Take care of missing values and datatype issues in the data
Understand the unique key in different data and merging the data
Perform Univariate analysis for both numeric and categorical variables
Perform Bi-variate analysis to identify redundant variables
Plot Trend of each predictor with the target variable
Do in-depth analysis on the impact of Date/Week on Sales
Create new features that might add value to the model
Define a function for each set of code that might need to be repeated again
Prepare the data for modelling
Make prediction using statistical techniques
Make model using machine learning techniques
Create time series ARIMA models and learn to give their parameters
Perform Hyper-parameter tuning to get the best parameters
Learn how to make predictions where data is sparse
Compare the performance of different models using multiple metrics

Project Description

Every Departmental store chain like Walmart wants to predict the store sales in the nearby future so that inventory planning can be done. Along with that, sales prediction helps to increase/decrease store staff based on the rush (More sales can mean more customers are coming to the stores). Also, it is always a good idea to do sales and revenue forecasting to better understand the company's cash-flows and overall growth.

For inventory planning, you also need to know what products (or category of products aka department) will be utilised more. Under-stock some products and your sales are hit. Over-stock items like perishables and you run into losses if the product expires. That's why the sales prediction is done at a combination of store and department level (and sometimes even at product level for high-selling products).

In this problem, we have been given the sales data of 45 stores based on store, department and week. The size and type of each store has been provided. Holiday weeks have been marked. Along with these, price markdown data (almost like discount data) has been given. A few macro-indicators like CPI, Unemployment rate, Fuel price etc. are also provided.

Similar Projects

In this project, we are going to work on Deep Learning using H2O to predict Census income.

Given a partial trajectory of a taxi, you will be asked to predict its final destination using the taxi trajectory dataset.

In this data science project in R, we are going to talk about subjective segmentation which is a clustering technique to find out product bundles in sales data.

Curriculum For This Mini Project

Problem Statement
02m
Exploratory Data Analysis - Sales Data
07m
Exploratory Data Analysis - Stores Data
05m
Data Pre-processing - Imputing Missing Values
07m
Data Pre-processing - Merging Data
05m
Data Pre-processing - Splitting The Data
08m
Univariate Analysis
08m
Bivariate Analysis
07m
Dependent Variables Trends - 1
06m
Dependent Variables Trends - 2
06m
Date Trends - 1
07m
Date Trends - 2
07m
Feature Creation
07m
Building The Model - 1
08m
Building The Model - 2
05m
Building The Model - 3
07m
Building The Model - 4
06m
Building The Model - 5
06m
Building The Model - 6
05m
Model Comparsion
07m
Conclusion
06m