Analyzing NFL Concussion data for Kaggle Data Science Competition
Recently, I entered the NFL Concussion on punt returns contest for data scientists. It wasn't the normal machine learning problem. In fact, it is...
How to fail and still succeed – a little data science fun
This past week has been brutal on my sinuses. And being that I'm the type who has to find some science in nearly everything....
How I wrote a terrible machine learning Nirvana song
We all see articles on how this is successful or that is successful and we get to brag about our successes. But what about...
How to fillna in pandas in Python
We are going to show you how to fillna using pandas in Python. No dataset is going to come perfect and ready to go....
What is Conditional Probability and formula?
Conditional probability is used to find out the probability of some event happening given that some other event has happened. Easy right?
Therefore, conditional probability...
Data Wrangling Part 2: Cleaning up Ohio Crime Data for Machine Learning
In a previous post, I discuss cleaning public Ohio crime data. As I start to get deeper into the data, and go through years...
Machine Learning: Creating vectors, matrix and arrays
To get started in machine learning there are some basics that you need to know and understand before you can ever just start coding...
How to run iPython notebook online for Machine Learning projects
Recently Google had a Kaggle image contest with test and train image dataset files that were well over a TB in size. My Macbook...
How I got a Perfect Score on a Kaggle NLP with Disaster Tweets Competition
I love Kaggle. I love the competition and testing my skills against brilliant data scientists from around the world. Today I decided to get...
What is probability mass function?
Probability mass function is recognized as a probability that is distributed over discrete variables.
First, probability mass function is always denoted with the capital P.
Second,...