Data Wrangling Part 2: Cleaning up Ohio Crime Data for Machine Learning
In a previous post, I discuss cleaning public Ohio crime data. As I start to get deeper into the data, and go through years...
What is the difference between data science vs data analytics?
Perhaps you are at the beginning of your career or making a change in your career and want to know the difference between data...
Analyzing NFL Concussion data for Kaggle Data Science Competition
Recently, I entered the NFL Concussion on punt returns contest for data scientists. It wasn't the normal machine learning problem. In fact, it is...
How I wrote a terrible machine learning Nirvana song
We all see articles on how this is successful or that is successful and we get to brag about our successes. But what about...
Data Wrangling: Cleaning up Ohio Crime Data for Machine Learning
Often it seems like the biggest part of machine learning is actually acquiring and cleaning up data. The state of Ohio provides crime data...
Machine Learning Algorithms for beginners
Below are some basics on machine learning algorithms. This is for beginners who are trying to get a better understanding about machine learning and...
Feature Engineering: LabelEncoder sklearn example
A big part of machine learning is data cleanup and feature engineering. A good tool in your data science toolkit will be label encoding....
Genetic Programming is an awesome way to tackle machine learning problems
I don't know how I missed out on genetic programming. I'm still trying to pick my jaw up off the ground. It reminds me...
Use Google Colab and Kaggle Data with bonus: fastai2
I was just running through this process and thought it might be helpful for others:
What to do in Kaggle:
Step 1. Go to your Kaggle...
How to convert Google Colabs from Python 2 to Python 3
I don't know why Google Colabs defaults to Python 2, regardless, it's extremely easy to change your notebook to Python 3. It's so easy...













