Data Wrangling: Cleaning up Ohio Crime Data for Machine Learning
Often it seems like the biggest part of machine learning is actually acquiring and cleaning up data. The state of Ohio provides crime data...
How to run iPython notebook online for Machine Learning projects
Recently Google had a Kaggle image contest with test and train image dataset files that were well over a TB in size. My Macbook...
How to drop rows in a pandas dataframe
A previous tutorial showed you how to drop columns in a pandas dataframe. Now we will look at how to drop rows in a...
Analyzing NFL Concussion data for Kaggle Data Science Competition
Recently, I entered the NFL Concussion on punt returns contest for data scientists. It wasn't the normal machine learning problem. In fact, it is...
What is probability mass function?
Probability mass function is recognized as a probability that is distributed over discrete variables.
First, probability mass function is always denoted with the capital P.
Second,...
Use Google Colab and Kaggle Data with bonus: fastai2
I was just running through this process and thought it might be helpful for others:
What to do in Kaggle:
Step 1. Go to your Kaggle...
How to fail and still succeed – a little data science fun
This past week has been brutal on my sinuses. And being that I'm the type who has to find some science in nearly everything....
What is an epoch in machine learning?
An epoch is one pass through an entire dataset. This can be in random order. You an also batch your epoch so that you...
How to run notebook in Google Cloud for Machine Learning
Step one: Go to Google Cloud Platform and create a Project.
Step two: Click Activate Cloud Shell
Step three: In the console you will want to...
How I got a Perfect Score on a Kaggle NLP with Disaster Tweets Competition
I love Kaggle. I love the competition and testing my skills against brilliant data scientists from around the world. Today I decided to get...