Feature Engineering: LabelEncoder sklearn example
A big part of machine learning is data cleanup and feature engineering. A good tool in your data science toolkit will be label encoding....
What is an epoch in machine learning?
An epoch is one pass through an entire dataset. This can be in random order. You an also batch your epoch so that you...
Machine Learning: How to pull Google Sheets data into Colabs
If you are using Google Colabs it would make sense to want to pull in a CSV file for a machine learning project. It...
What is the difference between data science vs data analytics?
Perhaps you are at the beginning of your career or making a change in your career and want to know the difference between data...
Data Wrangling Part 2: Cleaning up Ohio Crime Data for Machine Learning
In a previous post, I discuss cleaning public Ohio crime data. As I start to get deeper into the data, and go through years...
What is Conditional Probability and formula?
Conditional probability is used to find out the probability of some event happening given that some other event has happened. Easy right?
Therefore, conditional probability...
How to drop rows in a pandas dataframe
A previous tutorial showed you how to drop columns in a pandas dataframe. Now we will look at how to drop rows in a...
How to import data into Python to run Machine Learning projects
One of the most important aspects of running any machine learning program is the ability to pull in data from various sources and of...
Machine Learning: Creating vectors, matrix and arrays
To get started in machine learning there are some basics that you need to know and understand before you can ever just start coding...
Analyzing NFL Concussion data for Kaggle Data Science Competition
Recently, I entered the NFL Concussion on punt returns contest for data scientists. It wasn't the normal machine learning problem. In fact, it is...













