2 Anscombe’s QuartetWhat is Anscombe’s Quartet? It has four datasets with nearly identical summary statistics. It underscores the limitations of […] May 23, 2024 in Data Analytics tagged quartet / data / visualization / statistics / EDA by Mike
The Gapminder FoundationWikipedia says: “Gapminder Foundation is a non-profit venture registered in Stockholm, Sweden, that promotes sustainable global development and […] April 13, 2024 in Data Analytics tagged data / world / statistics / foundation / gapminder by Mike
EDA Validating with PythonExploratory Data Analysis (EDA) has six main practices. The six main practices of EDA are discovering, structuring, cleaning, […] March 30, 2024 in Data Analytics tagged validation / validate by Mike
HR Analytics Job Prediction ProjectThis is a dataset on Kaggle. The title of the Kaggle project is the same as the title […] March 27, 2024 in Data Analytics tagged data / resources / employee / human / dataset / HR by Mike
One-Hot Encoding of Categorical VariablesAre you a data analyst or are you working on a data analysis project and you are wondering […] March 24, 2024 in Data Analytics tagged dummies / data / convert / encode / ordinal / hierarchy / categorical by Mike
Titanic Logistic RegressionThis post will discuss the building of a logistic regression model on the Titanic dataset provided by Kaggle. […] March 18, 2024 in Data Analytics tagged impute / logistic / Titanic / table / pivot / regression by Mike
A Titanic DatasetIn data analytics and data science circles, this is a very famous dataset. Many people who are learning […] March 12, 2024 in Data Analytics tagged dataset / Titanic by Mike
The diamonds Dataset in PythonThe Diamonds dataset comes with seaborn, which is a Python library. Below is some Python code you can […] March 6, 2024 in Data Analytics tagged dataset / seaborn / diamonds / python by Mike
Data Structuring & Cleaning with MikeWhat are the steps to cleaning your data in a data analysis project? This article combines a few […] March 4, 2024 in Data Analytics tagged data / clean / analysis / EDA / exploratory / cleaning / Data Cleaning by Mike
Good Data SourcesYou are looking at a dataset or a series of datasets and you are wondering if the data […] January 18, 2024 in Data Analytics tagged rock / governance / data / ROCCC / credible / accurate / good by Mike
Google Analytics IntroductionGoogle Analytics is a software service offered by Google that tracks and reports website and app traffic. Google […] November 7, 2023 in Data Analytics tagged analytics / GA4 / Google / Google Analytics by Mike
Data Imputation of Negative NumbersExploratory Data Analysis (EDA) has six main practices. The six main practices of EDA are discovering, structuring, cleaning, […] October 14, 2023 in Data Analytics tagged data / clean / outliers / impute / imputation / errors / estimate / Pandas EDA Cleaning by Mike