tags:
- Data-Science
When no value is stored in a certain observation within a row, it's a missing value problem.
Missing values in data rows are problematic because most ML algorithms can’t handle null value. therefor either their column should be removed which reduces training data size, or they should be replaced with sensible and meaningful values which is called imputation.
Missing values cause two problems:
Understand effects of missing values:
Types if missing values:
The mechanisms by which missing fields are introduced in a dataset, can help us in choosing the best solution to handle them. Business Understanding or statistical tests can help us in assuming such variables.
Solutions: