DATA MUNGING
DATA CLEANING PYTHON
MACHINE LEARNING RECIPES
PANDAS CHEATSHEET
ALL TAGS
# How to determine Pearsons correlation in Python?

# How to determine Pearsons correlation in Python?

This recipe helps you determine Pearsons correlation in Python

Pearson"s correlation is very important statical data that we need many times. We can calculate it manually but it takes time.

So this is the recipe on how we can determine Pearson"s correlation in Python

```
import matplotlib.pyplot as plt
import statistics as stats
import pandas as pd
import random
import seaborn as sns
```

We have imported stats, seaborn and pandas which is needed.

We have created a empty dataframe and then added rows to it with random numbers.
```
df = pd.DataFrame()
df["x"] = random.sample(range(1, 100), 75)
df["y"] = random.sample(range(1, 100), 75)
print(); print(df.head())
```

We hawe defined a function with differnt steps that we will see.

- We have calculated mean and standard deviation of x and length of x
- We atre calculating mean and standard deviation of y
- We are calculating standard score by dividing difference of observation and mean with standard deviation. We have done this for both X and Y

```
def pearson(x,y):
n = len(x)
standard_score_x = []; standard_score_y = [];
mean_x = stats.mean(x)
standard_deviation_x = stats.stdev(x)
```

```
mean_y = stats.mean(y)
standard_deviation_y = stats.stdev(y)
```

```
for observation in x:
standard_score_x.append((observation - mean_x)/standard_deviation_x)
for observation in y:
standard_score_y.append((observation - mean_y)/standard_deviation_y)
return (sum([i*j for i,j in zip(standard_score_x, standard_score_y)]))/(n-1)
```

```
result = pearson(df.x, df.y)
print()
print("Pearson"s correlation coefficient is: ", result)
sns.lmplot("x", "y", data=df, fit_reg=True)
plt.show()
```

x y 0 96 62 1 1 81 2 27 73 3 55 26 4 83 93 Pearson"s correlation coefficient is: -0.006387074440361877

Use the Amazon Reviews/Ratings dataset of 2 Million records to build a recommender system using memory-based collaborative filtering in Python.

In this machine learning resume parser example we use the popular Spacy NLP python library for OCR and text classification.

In this deep learning project, we will predict customer churn using Artificial Neural Networks and learn how to model an ANN in R with the keras deep learning package.

In this data science project, you will contextualize customer data and predict the likelihood a customer will stay at 100 different hotel groups.

PySpark Project-Get a handle on using Python with Spark through this hands-on data processing spark python tutorial.

In this ensemble machine learning project, we will predict what kind of claims an insurance company will get. This is implemented in python using ensemble machine learning algorithms.

In this machine learning project, you will develop a machine learning model to accurately forecast inventory demand based on historical sales data.

In this data science project in R, we are going to talk about subjective segmentation which is a clustering technique to find out product bundles in sales data.

In this project, we are going to work on Sequence to Sequence Prediction using IMDB Movie Review Datasetâ€‹ using Keras in Python.

In this machine learning project, you will uncover the predictive value in an uncertain world by using various artificial intelligence, machine learning, advanced regression and feature transformation techniques.