Walmart Sales Forecasting Data Science Project

Walmart Sales Forecasting Data Science Project

Data Science Project in R-Predict the sales for each department using historical markdown data from the Walmart dataset containing data of 45 Walmart stores.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

What you will learn
Understand the Problem Statement
Perform basic EDA to familiarize with the data
Take care of missing values and datatype issues in the data
Understand the unique key in different data and merging the data
Perform Univariate analysis for both numeric and categorical variables
Perform Bi-variate analysis to identify redundant variables
Plot Trend of each predictor with the target variable
Do in-depth analysis on the impact of Date/Week on Sales
Create new features that might add value to the model
Define a function for each set of code that might need to be repeated again
Prepare the data for modelling
Make prediction using statistical techniques
Make model using machine learning techniques
Create time series ARIMA models and learn to give their parameters
Perform Hyper-parameter tuning to get the best parameters
Learn how to make predictions where data is sparse
Compare the performance of different models using multiple metrics

Project Description

Every Departmental store chain like Walmart wants to predict the store sales in the nearby future so that inventory planning can be done. Along with that, sales prediction helps to increase/decrease store staff based on the rush (More sales can mean more customers are coming to the stores). Also, it is always a good idea to do sales and revenue forecasting to better understand the company's cash-flows and overall growth.

For inventory planning, you also need to know what products (or category of products aka department) will be utilised more. Under-stock some products and your sales are hit. Over-stock items like perishables and you run into losses if the product expires. That's why the sales prediction is done at a combination of store and department level (and sometimes even at product level for high-selling products).

In this problem, we have been given the sales data of 45 stores based on store, department and week. The size and type of each store has been provided. Holiday weeks have been marked. Along with these, price markdown data (almost like discount data) has been given. A few macro-indicators like CPI, Unemployment rate, Fuel price etc. are also provided.

Similar Projects

In this machine learning project, you will uncover the predictive value in an uncertain world by using various artificial intelligence, machine learning, advanced regression and feature transformation techniques.

In this data science project in R, we are going to talk about subjective segmentation which is a clustering technique to find out product bundles in sales data.

Machine Learning Project in R- Predict the customer churn of telecom sector and find out the key drivers that lead to churn. Learn how the logistic regression model using R can be used to identify the customer churn in telecom dataset.

Curriculum For This Mini Project

Problem Statement
02m
Exploratory Data Analysis - Sales Data
07m
Exploratory Data Analysis - Stores Data
05m
Data Pre-processing - Imputing Missing Values
07m
Data Pre-processing - Merging Data
05m
Data Pre-processing - Splitting The Data
08m
Univariate Analysis
08m
Bivariate Analysis
07m
Dependent Variables Trends - 1
06m
Dependent Variables Trends - 2
06m
Date Trends - 1
07m
Date Trends - 2
07m
Feature Creation
07m
Building The Model - 1
08m
Building The Model - 2
05m
Building The Model - 3
07m
Building The Model - 4
06m
Building The Model - 5
06m
Building The Model - 6
05m
Model Comparsion
07m
Conclusion
06m