The Central Limit Theorem

The Central Limit Theorem is a pretty important concept in statistics. It states that, even if the original probability distribution isn’t normal, the mean of the samples taken from this distribution is distributed normally as the number of samples increases. What does this mean? Let’s try an experiment. You’re going to need python (I’m using […]

(Re)Learning Machine Learning – Logistic Regression

This time we’re going to do logistic regression. Logistic regression uses the same idea as linear regression (see my previous post) to classify input as one of two different classes. Instead of a linear function (one that looks like a straight line) we use a log-based function (making it “logistic”), and even though we’re using […]

(Re)Learning Machine Learning – Linear Regression

I am writing this as I relearn the basics of machine learning to record my thoughts and hopefully help someone (probably me) in the future. I find that I learn best when I implement the algorithms myself, even if I have to make less-efficient versions of built-in functions that many tutorials gloss over. If you’d […]