Distribution Plots in SeabornI created a project in a Jupyter Notebook under Anaconda that’s called Distribution Plots in Seaborn. For this […] March 26, 2024 in Data Visualization tagged plot / seaborn / chart / distribution by Mike
One-Hot Encoding of Categorical VariablesAre you a data analyst or are you working on a data analysis project and you are wondering […] March 24, 2024 in Data Analytics tagged convert / encode / ordinal / hierarchy / categorical / dummies / data by Mike
Python Floating Point NumbersOne important thing to know when you are programming in Python, is how Python deals with floating point […] March 21, 2024 in Python tagged number / float / point / round by Mike
Decision Tree – Only Six RowsThis is a very simple example of building a decision tree model on a very small dataset that […] March 20, 2024 in Machine Learning tagged python / sklearn / decisiontreeclassifier / decision / simple / tree / Decision Trees by Mike
Decision Tree WorkflowAre you working in Python? Do you want to build a decision tree? Let’s work though this workflow […] March 20, 2024 in Machine Learning tagged decision / tree / python / Decision Trees by Mike
Decision Trees and Random ForestsA random forest is a collection of decision trees whose results are aggregated into one final result. Their […] March 19, 2024 in Machine Learning tagged forest / random / decision / tree / Decision Trees by Mike
Titanic Logistic RegressionThis post will discuss the building of a logistic regression model on the Titanic dataset provided by Kaggle. […] March 18, 2024 in Data Analytics tagged table / pivot / regression / impute / logistic / Titanic by Mike
Data Imputation of AgeExploratory Data Analysis (EDA) has six main practices. The six main practices of EDA are discovering, structuring, cleaning, […] March 17, 2024 in Python tagged clean / apply / python / missing / pandas / groupby / function / boxplot / search / seaborn / replace / impute / Pandas EDA Cleaning by Mike
Seaborn Style and ColorAt the seaborn website there is an article called Controlling Figure Aesthetics. There is another article called Choosing […] March 12, 2024 in Python tagged colour / visualization / color / chart / seaborn / matplotlib / data / style / Seaborn by Mike
Seaborn IntroductionWhat is seaborn? Seaborn is a visualization library for making statistical graphics in Python. It builds on top […] March 12, 2024 in Python tagged graph / python / EDA / seaborn / matplotlib / visualize / graphics / library / chart / Seaborn by Mike
A Titanic DatasetIn data analytics and data science circles, this is a very famous dataset. Many people who are learning […] March 12, 2024 in Data Analytics tagged dataset / Titanic by Mike
EDA Discovering with VisualsAre you exploring a dataset using Python? If you are in your initial discovery phase, you might want […] March 11, 2024 in Python tagged visualization / EDA / Pandas EDA Discovery by Mike