One-Hot Encoding of Categorical VariablesAre you a data analyst or are you working on a data analysis project and you are wondering […] March 24, 2024 in Data Analytics tagged encode / ordinal / hierarchy / categorical / dummies / data / convert by Mike
Seaborn Style and ColorAt the seaborn website there is an article called Controlling Figure Aesthetics. There is another article called Choosing […] March 12, 2024 in Python tagged style / colour / visualization / color / chart / seaborn / matplotlib / data / Seaborn by Mike
Remove Bad Characters in PandasIf you are working with a pandas DataFrame and you need to remove some faulty characters that should […] March 5, 2024 in Python tagged data / clean / characters / strip / bad by Mike
Data Structuring & Cleaning with MikeWhat are the steps to cleaning your data in a data analysis project? This article combines a few […] March 4, 2024 in Data Analytics tagged EDA / exploratory / cleaning / data / clean / analysis / Data Cleaning by Mike
The Iris Dataset in PythonAccording to Wikipedia, “The Iris flower data set or Fisher’s Iris data set is a multivariate data set […] February 1, 2024 in Python tagged dataset / correlation / iris / flower / heatmap / data by Mike
Data Types in pandasAre you performing data analysis on a dataset? In other words, are you performing exploratory data analysis (EDA)? […] January 31, 2024 in Python tagged data / types / Pandas Data Types by Mike
Good Data SourcesYou are looking at a dataset or a series of datasets and you are wondering if the data […] January 18, 2024 in Data Analytics tagged ROCCC / credible / accurate / good / rock / governance / data by Mike
Filtering a Data Set in RHow do you filter a data set in R? Suppose you want to only see certain rows of […] January 16, 2024 in R Language tagged data / filter / dataset by Mike
Types of Machine LearningThere are several types of machine learning. When considering which type of machine learning model to use, it’s […] October 15, 2023 in Machine Learning tagged machine / dataset / AI / data / learning / types / Machine Learning Overview by Mike
Data Imputation of Negative NumbersExploratory Data Analysis (EDA) has six main practices. The six main practices of EDA are discovering, structuring, cleaning, […] October 14, 2023 in Data Analytics tagged outliers / impute / imputation / errors / estimate / data / clean / Pandas EDA Cleaning by Mike
Data WarehousesWhat is a data warehouse? It is a central data hub of data used for reporting and data […] September 12, 2023 in Data Engineering tagged data / warehouse by Mike
Data ArchitectureWhat is data architecture? It’s not a simple task to define data architecture, partly because the field is […] September 12, 2023 in Data Engineering tagged data / database / design / architecture by Mike