How to use Regression Metrics in Python?
MACHINE LEARNING RECIPES DATA CLEANING PYTHON DATA MUNGING PANDAS CHEATSHEET     ALL TAGS

How to use Regression Metrics in Python?

How to use Regression Metrics in Python?

This recipe helps you use Regression Metrics in Python

0

Recipe Objective

In a dataset after applying a regression model how to evaluate it. There are many metrics that we can use. We will be using mean absolute error , mean squared error and R squared.

So this is the recipe on how we we can use Regression Metrics in Python.

Step 1 - Import the library

from sklearn import datasets from sklearn import tree, model_selection from sklearn.model_selection import train_test_split

We have imported datasets, tree, model_selection and test_train_split which will be needed for the dataset.

Step 2 - Setting up the Data

We have imported inbuilt wine dataset and stored data in x and target in y. We have used to split the data by test train split. Then we have used model_selection.KFold. seed = 42 dataset = datasets.load_wine() X = dataset.data y = dataset.target X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.25) kfold = model_selection.KFold(n_splits=10, random_state=seed)

Step 3 - Training model and calculating Metrics

Here we will be using DecisionTreeRegressior as a model model = tree.DecisionTreeRegressor() Now we will be calculating different metrics. We will be using cross validation score to calculate the metrices. So we will be printing the mean and standard deviation of all the scores.

  • Calculating Mean Absolute Error
  • scoring = "neg_mean_absolute_error" results = model_selection.cross_val_score(model, X_train, y_train, cv=kfold, scoring=scoring) print("Mean Absolute Error: ", results.mean()); print("Standard Deviation: ", results.std())
  • Calculating Mean squared error
  • scoring = "neg_mean_squared_error" results = model_selection.cross_val_score(model, X_train, y_train, cv=kfold, scoring=scoring) print(); print("Mean Squared Error: ", results.mean()); print("Standard Deviation: ", results.std())
  • Calculating R squared value
  • scoring = "r2" results = model_selection.cross_val_score(model, X_train, y_train, cv=kfold, scoring=scoring) print(); print("R squared val: ", results.mean()); print("Standard Deviation: ", results.std())
So the output comes as:

Mean Absolute Error:  -0.12692307692307692
Standard Deviation:  0.09994715425303413

Mean Squared Error:  -0.13351648351648354
Standard Deviation:  0.10845352186546801

R squared val:  0.7997306366386379
Standard Deviation:  0.13923964626776147
​

Relevant Projects

Perform Time series modelling using Facebook Prophet
In this project, we are going to talk about Time Series Forecasting to predict the electricity requirement for a particular house using Prophet.

Choosing the right Time Series Forecasting Methods
There are different time series forecasting methods to forecast stock price, demand etc. In this machine learning project, you will learn to determine which forecasting method to be used when and how to apply with time series forecasting example.

Customer Churn Prediction Analysis using Ensemble Techniques
In this machine learning churn project, we implement a churn prediction model in python using ensemble techniques.

Data Science Project in Python on BigMart Sales Prediction
The goal of this data science project is to build a predictive model and find out the sales of each product at a given Big Mart store.

Sequence Classification with LSTM RNN in Python with Keras
In this project, we are going to work on Sequence to Sequence Prediction using IMDB Movie Review Dataset​ using Keras in Python.

Predict Churn for a Telecom company using Logistic Regression
Machine Learning Project in R- Predict the customer churn of telecom sector and find out the key drivers that lead to churn. Learn how the logistic regression model using R can be used to identify the customer churn in telecom dataset.

Machine Learning project for Retail Price Optimization
In this machine learning pricing project, we implement a retail price optimization algorithm using regression trees. This is one of the first steps to building a dynamic pricing model.

Learn to prepare data for your next machine learning project
Text data requires special preparation before you can start using it for any machine learning project.In this ML project, you will learn about applying Machine Learning models to create classifiers and learn how to make sense of textual data.

Predict Census Income using Deep Learning Models
In this project, we are going to work on Deep Learning using H2O to predict Census income.

German Credit Dataset Analysis to Classify Loan Applications
In this data science project, you will work with German credit dataset using classification techniques like Decision Tree, Neural Networks etc to classify loan applications using R.