Analyzing the sinking of the Titanic – Data Analysis with Python (Course V)July 12, 2020 2020-08-04 10:49
Analyzing the sinking of the Titanic – Data Analysis with Python (Course V)
What is Exploratory Data Analysis (EDA)?
Whenever a data scientist receives a new dataset, does he/she start building a Machine Learning model directly?
Well, the answer is no. They start exploring the dataset in a quest to better understand the different features present in the dataset and their relationship to each other. In other words, they perform Exploratory Data Analysis or simply, EDA.
As a formal definition, Exploratory Data Analysis (EDA) is an approach for data analysis that makes use of various analytical and graphical techniques to:
- better understand the data
- extract important variables for data modelling
- detect outliers and anomalies
- generate and test a (or multiple) hypothesis hypotheses about the data
A good data scientist has excellent EDA skills and in this course, we will be focusing on harnessing that within you.