Datascience-Query

Select all the approaches which can be used for dealing with missing values while cleaning the data

Drop the rows which contain missing values
Drop the entire column which has missing values
Replace missing values with the interquartile range
Replace missing values with the standard deviation
Replace missing values with the median

There are two ways you can handle missing values.

  1. Deletion.
    You can delete rows or columns use dropna()
  2. Imputation.
    You can replace the NaN values with.
    a) Median or Mean.
    b) or anything number that would like numbers or string etc.

But you don’t replace with interquartile range or std. as they are prone to outliers.

All the best!

Thanks for your valuable comments.

1 Like