Data Wrangling Part 2: Cleaning up Ohio Crime Data for Machine Learning
In a previous post, I discuss cleaning public Ohio crime data. As I start to get deeper into the data, and go through years...
What is probability mass function?
Probability mass function is recognized as a probability that is distributed over discrete variables.
First, probability mass function is always denoted with the capital P.
Second,...
How to import data into Python to run Machine Learning projects
One of the most important aspects of running any machine learning program is the ability to pull in data from various sources and of...
What is Conditional Probability and formula?
Conditional probability is used to find out the probability of some event happening given that some other event has happened. Easy right?
Therefore, conditional probability...
How to add a column to pandas dataframe
Pandas allow for many methods for adding and dropping content. We have covered how to drop a column and how to drop a row...
Analyzing NFL Concussion data for Kaggle Data Science Competition
Recently, I entered the NFL Concussion on punt returns contest for data scientists. It wasn't the normal machine learning problem. In fact, it is...
Machine Learning NFL: My rule changes for punts
The NFL has created a competition in an attempt to reduce concussions on punt plays. Read more about the competition in my previous post....
Set up the best parameters for Deep Learning RNN with Grid Search
Most of the time I've spent on Kaggle contests have been hyperparameter optimization. It's a major part of deep learning and fine tuning if...
How to fail and still succeed – a little data science fun
This past week has been brutal on my sinuses. And being that I'm the type who has to find some science in nearly everything....
Genetic Programming is an awesome way to tackle machine learning problems
I don't know how I missed out on genetic programming. I'm still trying to pick my jaw up off the ground. It reminds me...













