Use isinstance to Check Data TypeAre you a data professional working with a pandas dataset and have you found that one or more […] April 7, 2024 in Python tagged function / type / object / isinstance / custom / data by Mike
Display All of the Columns in a DataFrameAre you working with pandas in a Python project? It can be very helpful to ensure that all […] April 6, 2024 in Python tagged max / show / dataset / all / pandas / dataframe / option / display / columns / pandas DataFrame by Mike
Encode Category to Ordinal NumericFor this example, I will use Python. Are you a data professional working with a dataset that contains […] April 6, 2024 in Python tagged category / encode / ordinal / categorical / pandas / dummy by Mike
Correlation Heatmap in PythonA heatmap is a type of data visualization that depicts the magnitude of an instance or set of […] April 6, 2024 in Python tagged seaborn / heatmap / data / visualization / create / chart / python / correlation by Mike
Python’s None and pandas’ NaNIn Python, the built-in value for null is None. To experiment with this I have created a Jupyter […] April 4, 2024 in Python tagged None / NaN / empty / null by Mike
Split a Delimited String in a DataFrameAre you needing to split a string inside a column of a pandas DataFrame? Do you want to […] April 4, 2024 in Python tagged string / parse / split / delimited / pandas / comma by Mike
Parse a String in PythonDo you need to parse a delimited string in Python? Is you string comma-delimited? Are there several strings […] April 3, 2024 in Python tagged pandas / comma / parse / python / delimited by Mike
Google BooksGoogle Books is a project developed by the company Google to create a fully digitized library available to […] April 2, 2024 in Learning tagged books / free / ebooks / research by Mike
Bag of WordsWikipedia says: “The bag-of-words model is a model of text which uses a representation of text that is […] April 2, 2024 in Machine Learning tagged NLP / processing / language / natural / NLP by Mike
EDA Validating with PythonExploratory Data Analysis (EDA) has six main practices. The six main practices of EDA are discovering, structuring, cleaning, […] March 30, 2024 in Data Analytics tagged validation / validate by Mike
Interquartile Range (IQR)The middle 50% of your data is called the interquartile range, or IQR. The interquartile range is the […] March 27, 2024 in Statistics tagged quartile / median / irq / range / Statistics by Mike
HR Analytics Job Prediction ProjectThis is a dataset on Kaggle. The title of the Kaggle project is the same as the title […] March 27, 2024 in Data Analytics tagged data / resources / employee / human / dataset / HR by Mike