How to plot scattermatrix in R?

How to plot scattermatrix in R?

How to plot scattermatrix in R?

This recipe helps you plot scattermatrix in R


Recipe Objective

A scatter matrix or a scatter plot matrix is a grid which consists of NxN scatter plot. A scattermatrix is mainly used to display the bivariate relationships among all the pairs of variables in the dataset through scatterplots. This allows us to explore all the relationships between pairs in a single graph. ​

Minimum three numeric variables are needed to plot a scatter matrix. The layout of the matrix consists of a upper right half and left lower half cut across a diagonal. ​

This recipe demonstrates how to plot a scattermatrix in R. ​

STEP 1: Loading required library and dataset

Dataset description: It is the basic data about the customers going to the supermarket mall. The variables that we are interested in: Annual.Income (which is in 1000s) , Spending Score and Age

# Data manipulation package library(dplyr) library(tidyverse) # reading a dataset customer_seg = read.csv('R_121_Mall_Customers.csv') # selecting the required variables using the select() function customer_seg_var = select(customer_seg, Age, Annual.Income..k..,Spending.Score..1.100.) # summary of the selected variables glimpse(customer_seg_var)
Observations: 200
Variables: 3
$ Age                     19, 21, 20, 23, 31, 22, 35, 23, 64, 30, 67, 35…
$ Annual.Income..k..      15, 15, 16, 16, 17, 17, 18, 18, 19, 19, 19, 19…
$ Spending.Score..1.100.  39, 81, 6, 77, 40, 76, 6, 94, 3, 72, 14, 99, 1…
STEP 2: Plotting a scatter matrix

We use pairs() function to plot a scatter matrix.

Syntax: pairs(x, col = , pch = , labels = , main = )


  1. x = dataframe
  2. col = used to change the colour of the points
  3. pch = used to change the shape of the points
  4. labels = used to change the labels of the diagnol
  5. main = used to give a title to the graph
pairs(customer_seg_var, col = "green", pch = 19, labels = c("Age", "Annual Income", "Spending Score"), main = "Scatter Matrix")

Relevant Projects

Identifying Product Bundles from Sales Data Using R Language
In this data science project in R, we are going to talk about subjective segmentation which is a clustering technique to find out product bundles in sales data.

Perform Time series modelling using Facebook Prophet
In this project, we are going to talk about Time Series Forecasting to predict the electricity requirement for a particular house using Prophet.

Time Series Forecasting with LSTM Neural Network Python
Deep Learning Project- Learn to apply deep learning paradigm to forecast univariate time series data.

Build a Music Recommendation Algorithm using KKBox's Dataset
Music Recommendation Project using Machine Learning - Use the KKBox dataset to predict the chances of a user listening to a song again after their very first noticeable listening event.

Natural language processing Chatbot application using NLTK for text classification
In this NLP AI application, we build the core conversational engine for a chatbot. We use the popular NLTK text classification library to achieve this.

Expedia Hotel Recommendations Data Science Project
In this data science project, you will contextualize customer data and predict the likelihood a customer will stay at 100 different hotel groups.

Human Activity Recognition Using Multiclass Classification in Python
In this human activity recognition project, we use multiclass classification machine learning techniques to analyse fitness dataset from a smartphone tracker.

Build an Image Classifier for Plant Species Identification
In this machine learning project, we will use binary leaf images and extracted features, including shape, margin, and texture to accurately identify plant species using different benchmark classification techniques.

Data Science Project in Python on BigMart Sales Prediction
The goal of this data science project is to build a predictive model and find out the sales of each product at a given Big Mart store.

Topic modelling using Kmeans clustering to group customer reviews
In this Kmeans clustering machine learning project, you will perform topic modelling in order to group customer reviews based on recurring patterns.