Data cleaning - sideround/project-ml-onlineshop GitHub Wiki

  1. Replace bad formatted strings ("FeB" -> "feb").
  2. String values on Float column series.
  3. Outliers. Based on the case and if it makes sense, act accordingly. Change it to the mean.
  4. NaN. Ffill.