Data Wrangling: Cleaning up Ohio Crime Data for Machine Learning
Often it seems like the biggest part of machine learning is actually acquiring and cleaning up data. The state of Ohio provides crime data...
How to fail and still succeed – a little data science fun
This past week has been brutal on my sinuses. And being that I'm the type who has to find some science in nearly everything....
How to convert Google Colabs from Python 2 to Python 3
I don't know why Google Colabs defaults to Python 2, regardless, it's extremely easy to change your notebook to Python 3. It's so easy...
Machine Learning: How to pull Google Sheets data into Colabs
If you are using Google Colabs it would make sense to want to pull in a CSV file for a machine learning project. It...
Use Google Colab and Kaggle Data with bonus: fastai2
I was just running through this process and thought it might be helpful for others:
What to do in Kaggle:
Step 1. Go to your Kaggle...
What is Conditional Probability and formula?
Conditional probability is used to find out the probability of some event happening given that some other event has happened. Easy right?
Therefore, conditional probability...
Data Wrangling Part 2: Cleaning up Ohio Crime Data for Machine Learning
In a previous post, I discuss cleaning public Ohio crime data. As I start to get deeper into the data, and go through years...
How to make the first row in your spreadsheet or dataframe the header in...
If you have imported a CSV file into your notebook and use Pandas to view the dataframe you might find that the header of...
What is an epoch in machine learning?
An epoch is one pass through an entire dataset. This can be in random order. You an also batch your epoch so that you...
Set up the best parameters for Deep Learning RNN with Grid Search
Most of the time I've spent on Kaggle contests have been hyperparameter optimization. It's a major part of deep learning and fine tuning if...












