How to use Porter Stemmer in nltk

This recipe helps you use Porter Stemmer in nltk
Last Updated: 23 Feb 2023

Get access to Data Science projects View all Data Science projects

MACHINE LEARNING RECIPES DATA CLEANING PYTHON DATA MUNGING PANDAS CHEATSHEET ALL TAGS

Recipe Objective

As we have discussed before what is stemming, So it is nothing but reducing the words or chopping the words into their root forms for e.g eating becomes eat and so on. So in stemming there are different stemmers and we are going to discuss PortersStemmer the most popularly used one.

Porters Stemmer It is a type of stemmer which is mainly known for Data Mining and Information Retrieval. As its applications are limited to the English language only. It is based on the idea that the suffixes in the English language are made up of a combination of smaller and simpler suffixes, it is also majorly known for its simplicity and speed. The advantage is, it produces the best output from other stemmers and has less error rate.

Recipe Objective

Step 1 - Import the NLTK library and from NLTK import PorterStemmer

import nltk from nltk.stem import PorterStemmer

Step 2 - Creat a variable and store PorterStemmer into it

ps = PorterStemmer()

Step 3 - lets see how to use PorterStemmer

print(ps.stem('bat')) print(ps.stem('batting'))

bat

from the above we can say that the word bat and batting has reduced to bat lets try with some more examples

print(ps.stem('code')) print(ps.stem('coding')) print(ps.stem('coder')) print(ps.stem('coded'))

code

coder

code

So, we have observed that it is working for the words like code, coding, coded but not working for coder because if the word has at least one vowel and consonant plus EED ending, change the ending to 'EE' for e.g agreed become agree.

What Users are saying..

Ray han

Tech Leader | Stanford / Yale University

I think that they are fantastic. I attended Yale and Stanford and have worked at Honeywell,Oracle, and Arthur Andersen(Accenture) in the US. I have taken Big Data and Hadoop,NoSQL, Spark, Hadoop... Read More

Relevant Projects

Machine Learning Projects

Data Science Projects

Python Projects for Data Science

Data Science Projects in R

Machine Learning Projects for Beginners

Deep Learning Projects

Neural Network Projects

Tensorflow Projects

NLP Projects

Kaggle Projects

IoT Projects

Big Data Projects

Hadoop Real-Time Projects Examples

Spark Projects

Data Analytics Projects for Students

Relevant Projects

Loan Eligibility Prediction in Python using H2O.ai

In this loan prediction project you will build predictive models in Python using H2O.ai to predict if an applicant is able to repay the loan or not.

View Project Details

Avocado Machine Learning Project Python for Price Prediction

In this ML Project, you will use the Avocado dataset to build a machine learning model to predict the average price of avocado which is continuous in nature based on region and varieties of avocado.

View Project Details

Classification Projects on Machine Learning for Beginners - 1

Classification ML Project for Beginners - A Hands-On Approach to Implementing Different Types of Classification Algorithms in Machine Learning for Predictive Modelling

View Project Details

Deep Learning Project for Time Series Forecasting in Python

Deep Learning for Time Series Forecasting in Python -A Hands-On Approach to Build Deep Learning Models (MLP, CNN, LSTM, and a Hybrid Model CNN-LSTM) on Time Series Data.

View Project Details

Build ARCH and GARCH Models in Time Series using Python

In this Project we will build an ARCH and a GARCH model using Python

View Project Details

Learn to Build an End-to-End Machine Learning Pipeline - Part 1

In this Machine Learning Project, you will learn how to build an end-to-end machine learning pipeline for predicting truck delays, addressing a major challenge in the logistics industry.

View Project Details

Time Series Analysis with Facebook Prophet Python and Cesium

Time Series Analysis Project - Use the Facebook Prophet and Cesium Open Source Library for Time Series Forecasting in Python

View Project Details

Customer Churn Prediction Analysis using Ensemble Techniques

In this machine learning churn project, we implement a churn prediction model in python using ensemble techniques.

View Project Details

Learn How to Build a Logistic Regression Model in PyTorch

In this Machine Learning Project, you will learn how to build a simple logistic regression model in PyTorch for customer churn prediction.

View Project Details

Llama2 Project for MetaData Generation using FAISS and RAGs

In this LLM Llama2 Project, you will automate metadata generation using Llama2, RAGs, and AWS to reduce manual efforts.

View Project Details

How to use Porter Stemmer in nltk

Recipe Objective

Table of Contents

Step 1 - Import the NLTK library and from NLTK import PorterStemmer

Step 2 - Creat a variable and store PorterStemmer into it

Step 3 - lets see how to use PorterStemmer

Ray han

Relevant Projects

You might also like

Relevant Projects