How to select the best model from multiple machine learning algorithms
There are many ways to figure out which machine learning algorithm is the best for your dataset. One, you can just pick one and...
Use Google Colab and Kaggle Data with bonus: fastai2
I was just running through this process and thought it might be helpful for others:
What to do in Kaggle:
Step 1. Go to your Kaggle...
How I wrote a terrible machine learning Nirvana song
We all see articles on how this is successful or that is successful and we get to brag about our successes. But what about...
How to make the first row in your spreadsheet or dataframe the header in...
If you have imported a CSV file into your notebook and use Pandas to view the dataframe you might find that the header of...
What is probability mass function?
Probability mass function is recognized as a probability that is distributed over discrete variables.
First, probability mass function is always denoted with the capital P.
Second,...
How to add a column to pandas dataframe
Pandas allow for many methods for adding and dropping content. We have covered how to drop a column and how to drop a row...
Set up the best parameters for Deep Learning RNN with Grid Search
Most of the time I've spent on Kaggle contests have been hyperparameter optimization. It's a major part of deep learning and fine tuning if...
What is an epoch in machine learning?
An epoch is one pass through an entire dataset. This can be in random order. You an also batch your epoch so that you...
Data Wrangling Part 2: Cleaning up Ohio Crime Data for Machine Learning
In a previous post, I discuss cleaning public Ohio crime data. As I start to get deeper into the data, and go through years...
Feature Engineering: LabelEncoder sklearn example
A big part of machine learning is data cleanup and feature engineering. A good tool in your data science toolkit will be label encoding....














