Data Science Project - Instacart Market Basket Analysis

Data Science Project - Instacart Market Basket Analysis

Data Science Project - Build a recommendation engine which will predict the products to be purchased by an Instacart consumer again.

Videos

Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Understanding the problem statement
Importing a training dataset and testing from AWS
Installing necessary libraries and understanding its use
Standard MBA or Market basket analysis
Using a predictive model to estimate the demand for a particular product
Product recommendation engine using collaborative filtering
Merging the relevant CSV files
Applying the minimum support criteria to identify most frequent item set
Eclat algorithm and Apriori algorithm
Visualizing the target variable with variation in time
Converting the variables to suitable datatypes
Visualization of a time series
Visualization using ggplot
Convert the available information to a transactional dataset
Converting the rules into a data frame
Sorting the values before recommending it to the company

Project Description

Whether you shop from meticulously planned grocery lists or let whimsy guide your grazing, our unique food rituals define who we are. Instacart, a grocery ordering and delivery app aim to make it easy to fill your refrigerator and pantry with your personal favorites and staples when you need them. After selecting products through the Instacart app, personal shoppers review your order and do the in-store shopping and delivery for you.

Instacart’s data science team plays a big part in providing this delightful shopping experience. Currently, they use transactional data to develop models that predict which products a user will buy again, try for the first time, or add to their cart next during a session. Recently, Instacart open-sourced this data - see their blog post on 3 Million Instacart Orders, Open Sourced.

In this data science project, we are going to use this anonymized data on customer orders over time to predict which previously purchased products will be in a user’s next order.

Similar Projects

In this data science project in R, we are going to talk about subjective segmentation which is a clustering technique to find out product bundles in sales data.

In this data science project, you will learn to predict churn on a built-in dataset using Ensemble Methods in R.

Given a partial trajectory of a taxi, you will be asked to predict its final destination using the taxi trajectory dataset.

Curriculum For This Mini Project

Problem Statement Overview
05m
Import Libraries
06m
Market Basket Analysis
04m
Transaction Set
01m
Association Rules
08m
Steps for creating Association Rules
02m
Read the Data Set files
06m
Explore the Data Set
01m
Which day receives most orders?
14m
Which department is purchased most?
12m
Exploratory Data Analysis
08m
Recoding the variables
04m
Graphs
07m
Prior Orders Placed
01m
Number of Items ordered
04m
Association Rule Mining
18m
Apriori Algorithm
02m
Creating Association Rules
09m
Product Recommendations
09m
Convert Rule to DataFrame
09m
Remove Redundant Rules
06m
Conclusion
00m