Walmart Sales Forecasting Data Science Project

Data Science Project in R-Predict the sales for each department using historical markdown data from the Walmart dataset containing data of 45 Walmart stores.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

  • Understanding the problem statement and importing the file

  • Performing basic EDA

  • Merging multi datasets on basis of unique columns

  • How to study a merged dataset?

  • Using groupby function to analyze the effect of multiple columns

  • Plotting a time-series plot

  • How to analyze a time series graph

  • Seasonality and Trend analysis

  • How to decompose a time-series dataset to remove any trends

  • ARIMA model and its insights

  • How to fit dataset into an ARIMA model for training

  • Selecting the most important features for increasing prediction accuracy

  • Making final predictions using the most important selected features

  • Saving the made predictions into CSV format

Project Description

We have been provided with historical sales Data of 45 Walmart stores located in different regions. Each store contains many departments and we have to project the sales for each department in each store.

To add to the challenge, selected holiday markdown events are included in the dataset. These markdowns are known to affect sales, but it is challenging to predict which departments are affected and the extent of the impact.

Similar Projects

Big Data Project Data Science Project-All State Insurance Claims Severity Prediction
Data science project in R to develop automated methods for predicting the cost and severity of insurance claims.
Big Data Project Taxi Trajectory Prediction-Predict the destination of taxi trips
Given a partial trajectory of a taxi, you will be asked to predict its final destination using the taxi trajectory dataset.
Big Data Project Deep Learning with Keras in R to Predict Customer Churn
In this deep learning project, we will predict customer churn using Artificial Neural Networks and learn how to model an ANN in R with the keras deep learning package.
Big Data Project Choosing the right Time Series Forecasting Methods
There are different time series forecasting methods to forecast stock price, demand etc. In this machine learning project, you will learn to determine which forecasting method to be used when and how to apply with time series forecasting example.

Curriculum For This Mini Project

  Import Data Files
  Explore Data Set
  Calculate Average sales by Store
  Calculate Average sales by Department
  Average Sales by Store and Department
  Test Data Set
  Sample Submission Data
  Check null values
  Submitting to Kaggle
  Problem with current solution
  Recap of Code
  Avg Sales by Store, Department, Holiday & week
  Forecast Methods - Overview
  Arima Model
  Holt-Winters Forecasting
  How Machine Learning works