Map ReduceWhat is Map Reduce? First of all, this is an advanced topic, however we need to be aware […] May 18, 2024 in Data Science tagged data / programming / model / big / key-value / pairs / hadoop / Big Data by Mike
Apache Spark IntroductionWhat is Apache Spark? Apache Spark is an open-source, distributed processing system used for big data workloads. Apache […] May 18, 2024 in Data Science tagged Spark / data / distributed by Mike
Data.WorldWhat is Data World? It’s an organization with a website at data.world. If you are a data professional […] May 18, 2024 in Data Science tagged data / community by Mike
Ben Schneiderman’s Information Seeking MantraDr. Ben Shneiderman’s “information seeking mantra” is a foundational guideline in the field of data visualization. The mantra […] May 16, 2024 in Data Visualization tagged data / visualization by Mike
DBeaver DatabaseDBeaver is a SQL client software application and a database administration tool for relational databases. DBeaver Community is […] May 15, 2024 in Database tagged database / dataset / data by Mike
Visualizing DataVisualizing data is the most intuitive way to interpret it, so it’s an invaluable skill. It is much […] April 13, 2024 in Statistics tagged data / visualization / variable / chart / bar / graph / Statistics by Mike
The Classification of DataAre you starting out in your exploration of what it is like to be a data professional? This […] April 13, 2024 in Statistics tagged qualitative / data / Quantitative / categorical / Statistics by Mike
The Gapminder FoundationWikipedia says: “Gapminder Foundation is a non-profit venture registered in Stockholm, Sweden, that promotes sustainable global development and […] April 13, 2024 in Data Analytics tagged data / world / statistics / foundation / gapminder by Mike
International GDP DatasetsWe have several options for accessing datasets that compare countries and include indicators like GDP per capita. This […] April 11, 2024 in Datasets tagged data / product / world / dataset / gross / domestic / GDP by Mike
Use isinstance to Check Data TypeAre you a data professional working with a pandas dataset and have you found that one or more […] April 7, 2024 in Python tagged object / isinstance / custom / data / function / type by Mike
Correlation Heatmap in PythonA heatmap is a type of data visualization that depicts the magnitude of an instance or set of […] April 6, 2024 in Python tagged chart / python / correlation / seaborn / heatmap / data / visualization / create by Mike
HR Analytics Job Prediction ProjectThis is a dataset on Kaggle. The title of the Kaggle project is the same as the title […] March 27, 2024 in Data Analytics tagged data / resources / employee / human / dataset / HR by Mike