Classifying Handwritten Digits using MNIST Dataset

Classifying Handwritten Digits using MNIST Dataset

The goal of this data science project is to take an image of a handwritten single digit, and determine what that digit is.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

Customer Love

Read All Reviews

Nathan Elbert

Senior Data Scientist at Tiger Analytics

This was great. The use of Jupyter was great. Prior to learning Python I was a self taught SQL user with advanced skills. I hold a Bachelors in Finance and have 5 years of business experience.. I... Read More

Camille St. Omer

Artificial Intelligence Researcher, Quora 'Most Viewed Writer in 'Data Mining'

I came to the platform with no experience and now I am knowledgeable in Machine Learning with Python. No easy thing I must say, the sessions are challenging and go to the depths. I looked at graduate... Read More

What will you learn

Unzipping folders and loading the dataset
Visualizing different images available in the dataset
Using the summary function for basic EDA
Understanding left-skew and right-skew of the dataset
Preprocessing the train dataset for initial predictions
Apply ensemble model Random Forest for predictions
Use the Importance function in R for extracting the necessary features
Plotting graphs for feature versus MeanDecresedGini
Hyper-parameter tuning Random Forest and selecting the best parameters for this model
Plotting graphs for against parameters and OOB errors
Importing FNN library and using K-nearest neighbors as the training model
Importing XGBoost and converting Dataset into DMatrix for performing predictions
Defining parameters and performing Cross Folds validation using XGBoost model
Predicting using XGBoost and saving the predictions in form of CSV
Installing h2o package for using complete RAM and CPU cores available
Initializing an h2o cluster
Initializing a DeepLearning Neural Networks model
Defining , Understanding parameters and Training Neural Networks for predictions
Plotting Confusion matrix and interpreting the result
Predicting the result and saving it in the form of CSV
Shutting down the h2o created cluster

Project Description

Data scientists looking for their first machine learning or data science project begin by trying the handwritten digit recognition problem. The Digit Recognizer data science project makes use of the popular MNIST database of handwritten digits, taken from American Census Bureau employees. The dataset consists of already pre-processed and formatted 60,000 images of 28x28 pixel handwritten digits. With the use of image recognition techniques and a chosen machine learning algorithm, a program can be built to accurately read the handwritten digits with 95% accuracy. The accuracy rate can be higher based on the chosen machine learning algorithm,

Similar Projects

Data Science Project in Python- Build a machine learning algorithm that automatically suggests the right product prices.

In this machine learning project, you will build a model to predict the purchase amount of customer against various products which will help the company create personalized offer for customers against different products.

In this data science project, we will predict the credit card fraud in the transactional dataset using some of the predictive models.

Curriculum For This Mini Project

04h 29m