How to add a column to pandas dataframe
Pandas allow for many methods for adding and dropping content. We have covered how to drop a column and how to drop a row...
How to convert Google Colabs from Python 2 to Python 3
I don't know why Google Colabs defaults to Python 2, regardless, it's extremely easy to change your notebook to Python 3. It's so easy...
What is probability mass function?
Probability mass function is recognized as a probability that is distributed over discrete variables.
First, probability mass function is always denoted with the capital P.
Second,...
Feature Engineering: LabelEncoder sklearn example
A big part of machine learning is data cleanup and feature engineering. A good tool in your data science toolkit will be label encoding....
How to select the best model from multiple machine learning algorithms
There are many ways to figure out which machine learning algorithm is the best for your dataset. One, you can just pick one and...
Data Wrangling Part 2: Cleaning up Ohio Crime Data for Machine Learning
In a previous post, I discuss cleaning public Ohio crime data. As I start to get deeper into the data, and go through years...
How to run iPython notebook online for Machine Learning projects
Recently Google had a Kaggle image contest with test and train image dataset files that were well over a TB in size. My Macbook...
What is an epoch in machine learning?
An epoch is one pass through an entire dataset. This can be in random order. You an also batch your epoch so that you...
Machine Learning NFL: My rule changes for punts
The NFL has created a competition in an attempt to reduce concussions on punt plays. Read more about the competition in my previous post....
How to fail and still succeed – a little data science fun
This past week has been brutal on my sinuses. And being that I'm the type who has to find some science in nearly everything....