How to optimize a function using SGD in R

This recipe helps you optimize a function using SGD in R
Last Updated: 10 Jun 2022

Get access to Data Science projects View all Data Science projects

DATA SCIENCE PROJECTS IN R DATA CLEANING PYTHON DATA MUNGING MACHINE LEARNING RECIPES PANDAS CHEATSHEET ALL TAGS

Recipe Objective -How to optimize a function using SGD in R?

The SGD or Stochastic gradient descent SGD is defined as a variant of gradient descent in which instead of performing computations on whole dataset which is mostly redundant and inefficient, Stochastic Gradient Descent only computes on the small subset or the random selection of the data examples. The stochastic gradient descent produces the same performance as a regular gradient descent whenever the learning rate is low. The stochastic gradient descent performs one update at a time and thus it is usually much faster and also used to learn online. The Stochastic Gradient Descent performs the frequent updates with a high variance which causes the objective function to fluctuate heavily that is at much faster rate. The stochastic gradient descent enables itself to jump to the new and potentially better local minima as it keeps overshooting.

This recipe explains what is SGD optimizer, what are its benefits and how it can be excecuted.

Implementing SGD optimizer.

Step 1: Installing and Loading keras package to build neural network using keras.

# Installing Packages install.packages("keras") # Loading packages library(keras)

Step 2: Loading MNIST handwritten digit dataset which comes pre-loaded in keras package.

# Loading the data mnist <- dataset_mnist()

Step 3: Train and test dataset containing images are prepared using MNIST dataset.

# Preparing train data and test data training_images <- mnist$train$x training_labels <- mnist$train$y testing_images <- mnist$test$x testing_labels <- mnist$test$y

Step 4: Transform the train images data and test image data into the double array of [0, 255] shape (60000, 28 * 28) with values between 0 and 1.

# Reshaping train data and test data training_images <- array_reshape(training_images, c(60000, 28 * 28)) training_images <- training_images / 255 testing_images <- array_reshape(testing_images, c(10000, 28 * 28)) testing_images <- testing_images / 255

Step 5: Labels are prepared by categorically encoding them.

# Preparing Labels training_labels <- to_categorical(training_labels) testing_labels <- to_categorical(testing_labels)

Step 6: Model is build using dense layers with relu and softmax activation.

# Model Buidling neural_network <- keras_model_sequential() %>% layer_dense(units = 512, activation = "relu", input_shape = c(28 * 28)) %>% layer_dense(units = 10, activation = "softmax")

Step 7: Model is compiled with optimizer SGD, loss as categorical entropy and accuracy as metrics.

# Model Compiling neural_network %>% compile( optimizer = "sgd", loss = "categorical_crossentropy", metrics = c("accuracy")

Step 8: Model is fitted. neural_network %>% fit(training_images, training_labels, epochs = 3, batch_size = 64)

Step 9: Model performance is evaluated on testing dataset of images and test dataset of labels.

# Model performance metric <- neural_network %>% evaluate(testing_images, testing_labels)

What Users are saying..

Anand Kumpatla

Sr Data Scientist @ Doubleslash Software Solutions Pvt Ltd

ProjectPro is a unique platform and helps many people in the industry to solve real-life problems with a step-by-step walkthrough of projects. A platform with some fantastic resources to gain... Read More

How to optimize a function using SGD in R

Recipe Objective -How to optimize a function using SGD in R?

Implementing SGD optimizer.

Anand Kumpatla

Relevant Projects

You might also like

Relevant Projects