Basic Statistics for Machine Learning -Attached



1

I have some documentation / material on the basic statistics that are needed for machine learning. I am attaching the same and who ever want to review before the class that will be great, We can make use of our class much better / stremlined.

If you guys have other materials, let us share those too.

 

Regards,

Raj

 

 



8 Answer(s)


0

Thank You Raj.  I have not recieved anything.  I thought we did not have to open ticket ourselves and Ms. Sakshi is going to resolve the issue and get the materials to all of us. Anyway i opened a ticket today just in case. 

Regards

Rekha

 

 


0

That is awesome Suresh!  I agree we can and should make better use of our class opportunites - we we get from class/teacher, but also from each other - random sharing and discussions.

Apparently I must have had a prior relationship with a site called codecademy.com - they had my email.  I renewed my password and have a 7day trial to an excellent tutorial that has hands on coding (numpy matplotlib pandas) - all with in 20 minutes.  That course is named 'Learning Statistics with Python'.  Someone let me know if they try it.  I am enjoying it, and scraping the code as I go.

Here is the first code example.  They don't really explain where the data files come from, and used one proprietary libaray, but I believe the code will work asside from that.  In the code sample below I am including the comments they say about the lib.

++++

# Import packages
import codecademylib

    # codecademylib, which is a package that Codecademy uses to plot
    # your histogram in the panel to the right. Don't worry about this
    # library. Any Python development environment that you may use
    # will take care of this for you.


import numpy as np
import pandas as pd

# Import matplotlib pyplot
from matplotlib import pyplot as plt

# Read in transactions data
mu, sigma = 800, 100 # mean and standard deviation
burrito_calories = np.random.normal(mu, sigma, 320)

# Save transaction times to a separate numpy array
plt.hist(burrito_calories, range=(250, 1250), bins=100,  edgecolor='black')
plt.title("Calories in a Burrito Bowl", fontsize = 24)
plt.xlabel("Calories", fontsize=18)
plt.ylabel("Count", fontsize=18)

plt.show()

++++


0

Thanks you again !.  

 

Regards

Rekha


0

Hi Suresh:

Thanks for sharing. I found a couple of free ones for statistics based on R but unable to upload here due to the size.

Thanks

Indu


0

This one seems to be a good one.



0

Hi Indu 

if the files are big, we can create a sub directory in the Googel drive that had been shared and we can post our materials in that. 

Regards,

Raj


0

Hello Guys,

 

Could you please let me know if you have access to all the relevant python codes so that we can continue the next session seamlessly


0

I have access to files posted in google drive 02Mar and 03Mar folders, total 6 files.

 

Regards

Rekha