Build a predictive model for Otto Group Product Classification

Build a predictive model for Otto Group Product Classification

Build a predictive model to correctly classify products between 9 product categories (fashion, electronics, etc.) using the Otto Group dataset.


Each project comes with 2-5 hours of micro-videos explaining the solution.

Code & Dataset

Get access to 50+ solved projects with iPython notebooks and datasets.

Project Experience

Add project experience to your Linkedin/Github profiles.

What will you learn

Understanding the problem statement and importing the file
Initializing the libraries and understand it's use
Check for null values and necessary imputations
Use and interpret the summary function in R
Use a Box plot to identify the outliers and handle them
Applying ensembling model Decision Tree for a Multi-Class Classification problem
Preparing the dataset and training the model
Parameter tuning for better results
How to use Confusion Matrix for a Multi-Class Classification problem
Making final predictions from the trained model

Project Description

The dataset has 93 features for more than 200,000 products with the Test Data containing 144K rows and Training Data containing 61K rows. This data science project is a supervised , multinomial classification problem. The dataset has a total of 9 possible product lines and the objective is accurately make class predictions on 144,000 unlabeled products based on the 93 features for the products provided in the dataset.

Similar Projects

In this machine learning project, you will build a model to predict the purchase amount of customer against various products which will help the company create personalized offer for customers against different products.

Deep Learning Project using Keras Deep Learning Library to predict the effect of Genetic Variants to enable personalized Medicine.

In this machine learning project, we will use binary leaf images and extracted features, including shape, margin, and texture to accurately identify plant species using different benchmark classification techniques.

Curriculum For This Mini Project

03h 58m