Month: September 2020
11 posts
What On Earth Is The Support Vector Machines (SVM) ML Model?
Support vector machines are supervised classification models, within which each observation (or data point) is plotted on an…
Boost Your Random Forest Machine Learning Model Accuracy With Gradient Boosted Machines
In a traditional random forest, there is parallel learning. In the below, we can see that each model…
My Three Favourite Supervised Regression Machine Learning Model Options
In this article, we’re going to cover what models you could use to predict continuous numeric values. You…
Three Methods For Feature Engineering and Selection For Data Engineers
As with the previous sections in this series, there is a little overlap – but not a huge…
How Do We Go About Data Exploration As Data Engineers?
In the previous article, I highlighted the below phases as being part of the ML workflow. Data cleansing,…
How Do Data Scientists Carry Out Data Cleaning?
When we’re working through a data science problem, there really are a few main steps which we need…
Outlier and Anomaly Dection Using Isolation Forest For Data Scientists
Detecting outliers in highly dimensional data is hard. There are so many observations across a large number of…
Calculating Distance Between Two Geo Points In Python
As you may be aware, I am a Python tutor online and quite often I get asked pretty…
A UK Postcode Validation Script In Python
The below script takes the input of a UK postcode and ensures that it matches a valid format.…
Making a simple hangman game in Python
Today I thought it might be cool to make a super simple little text based game in Python…