How to add a column to pandas dataframe
Pandas allow for many methods for adding and dropping content. We have covered how to drop a column and how to drop a row...
How to run iPython notebook online for Machine Learning projects
Recently Google had a Kaggle image contest with test and train image dataset files that were well over a TB in size. My Macbook...
How to select the best model from multiple machine learning algorithms
There are many ways to figure out which machine learning algorithm is the best for your dataset. One, you can just pick one and...
Data Wrangling: Cleaning up Ohio Crime Data for Machine Learning
Often it seems like the biggest part of machine learning is actually acquiring and cleaning up data. The state of Ohio provides crime data...
Analyzing NFL Concussion data for Kaggle Data Science Competition
Recently, I entered the NFL Concussion on punt returns contest for data scientists. It wasn't the normal machine learning problem. In fact, it is...
Feature Engineering: LabelEncoder sklearn example
A big part of machine learning is data cleanup and feature engineering. A good tool in your data science toolkit will be label encoding....
What is the difference between data science vs data analytics?
Perhaps you are at the beginning of your career or making a change in your career and want to know the difference between data...
Genetic Programming is an awesome way to tackle machine learning problems
I don't know how I missed out on genetic programming. I'm still trying to pick my jaw up off the ground. It reminds me...
How to make the first row in your spreadsheet or dataframe the header in...
If you have imported a CSV file into your notebook and use Pandas to view the dataframe you might find that the header of...
How I got a Perfect Score on a Kaggle NLP with Disaster Tweets Competition
I love Kaggle. I love the competition and testing my skills against brilliant data scientists from around the world. Today I decided to get...