How to use k fold cross validation in sklearn

This recipe helps you use k fold cross validation in sklearn. The k fold cross validation procedure is a method for estimating the performance of a ML algorithm on a dataset. The k fold cross validation procedure divides a limited dataset into k non overlapping folds.
Last Updated: 20 Jul 2022

Get access to Data Science projects View all Data Science projects

MACHINE LEARNING PROJECTS IN PYTHON DATA CLEANING PYTHON DATA MUNGING MACHINE LEARNING RECIPES PANDAS CHEATSHEET ALL TAGS

Recipe Objective - How to use k-fold cross-validation in sklearn?

K-Folds cross-validator.

The k-fold cross-validation procedure is a method for estimating the performance of an ML algorithm on a dataset. The k-fold cross-validation procedure divides a limited dataset into k non-overlapping folds. A total of k models are fit and evaluated on the k hold-out test sets and the mean performance is reported.

Each fold is then used once as validation while the k - 1 remaining folds form the training set.

Recipe Objective - How to use k-fold cross-validation in sklearn?

Links for the more related projects:-

https://www.projectpro.io/projects/data-science-projects/deep-learning-projects
https://www.projectpro.io/projects/data-science-projects/neural-network-projects

Example:-

Step:1 Import Libraries:-

from sklearn.model_selection import KFold import numpy as np # create the range 1 to 25 rn = range(1,26)

Step:2 Creating Folds:-

# to demonstrate how the data are split, we will create 3 and 5 folds. # it returns an location (index) of the train and test samples. kf5 = KFold(n_splits=5, shuffle=False) kf3 = KFold(n_splits=3, shuffle=False) # the Kfold function retunrs the indices of the data. Our range goes from 1-25 so the index is 0-24 for train_index, test_index in kf3.split(rn): print(train_index, test_index)

[ 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24] [0 1 2 3 4 5 6 7 8]
[ 0  1  2  3  4  5  6  7  8 17 18 19 20 21 22 23 24] [ 9 10 11 12 13 14 15 16]
[ 0  1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16] [17 18 19 20 21 22 23 24]

KFold returns the index, if you want to see the real data we must use "np.take" in the NumPy array or ".iloc" in pandas.

# to get the values from our data, we use np.take() to access a value at particular index for train_index, test_index in kf3.split(rn): print(np.take(rn,train_index), np.take(rn,test_index))

[10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25] [1 2 3 4 5 6 7 8 9]
[ 1  2  3  4  5  6  7  8  9 18 19 20 21 22 23 24 25] [10 11 12 13 14 15 16 17]
[ 1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17] [18 19 20 21 22 23 24 25]

What Users are saying..

Gautam Vermani

Data Consultant at Confidential

Having worked in the field of Data Science, I wanted to explore how I can implement projects in other domains, So I thought of connecting with ProjectPro. A project that helped me absorb this topic... Read More

Relevant Projects

Machine Learning Projects

Data Science Projects

Python Projects for Data Science

Data Science Projects in R

Machine Learning Projects for Beginners

Deep Learning Projects

Neural Network Projects

Tensorflow Projects

NLP Projects

Kaggle Projects

IoT Projects

Big Data Projects

Hadoop Real-Time Projects Examples

Spark Projects

Data Analytics Projects for Students

Relevant Projects

Classification Projects on Machine Learning for Beginners - 2

Learn to implement various ensemble techniques to predict license status for a given business.

View Project Details

Build an AI Chatbot from Scratch using Keras Sequential Model

In this NLP Project, you will learn how to build an AI Chatbot from Scratch using Keras Sequential Model.

View Project Details

ML Model Deployment on AWS for Customer Churn Prediction

MLOps Project-Deploy Machine Learning Model to Production Python on AWS for Customer Churn Prediction

View Project Details

Build a Face Recognition System in Python using FaceNet

In this deep learning project, you will build your own face recognition system in Python using OpenCV and FaceNet by extracting features from an image of a person's face.

View Project Details

Build a Graph Based Recommendation System in Python -Part 1

Python Recommender Systems Project - Learn to build a graph based recommendation system in eCommerce to recommend products.

View Project Details

Create Your First Chatbot with RASA NLU Model and Python

Learn the basic aspects of chatbot development and open source conversational AI RASA to create a simple AI powered chatbot on your own.

View Project Details

Learn How to Build a Linear Regression Model in PyTorch

In this Machine Learning Project, you will learn how to build a simple linear regression model in PyTorch to predict the number of days subscribed.

View Project Details

Build an End-to-End AWS SageMaker Classification Model

MLOps on AWS SageMaker -Learn to Build an End-to-End Classification Model on SageMaker to predict a patient’s cause of death.

View Project Details

Build OCR from Scratch Python using YOLO and Tesseract

In this deep learning project, you will learn how to build your custom OCR (optical character recognition) from scratch by using Google Tesseract and YOLO to read the text from any images.

View Project Details

Learn Object Tracking (SOT, MOT) using OpenCV and Python

Get Started with Object Tracking using OpenCV and Python - Learn to implement Multiple Instance Learning Tracker (MIL) algorithm, Generic Object Tracking Using Regression Networks Tracker (GOTURN) algorithm, Kernelized Correlation Filters Tracker (KCF) algorithm, Tracking, Learning, Detection Tracker (TLD) algorithm for single and multiple object tracking from various video clips.

View Project Details

How to use k fold cross validation in sklearn

Recipe Objective - How to use k-fold cross-validation in sklearn?

Table of Contents

Links for the more related projects:-

Example:-

Step:1 Import Libraries:-

Step:2 Creating Folds:-

Gautam Vermani

Relevant Projects

You might also like

Relevant Projects