How to detect outliers in categorical data?




I am not sure what you mean by Outliers in categorical data?

If you mean values with low frequency - the best way to detect them is frequency distribution and the best way to treat them is by combining them with similar values.

If you mean something else, let me know.


I have a dataset with 5 continuous columns and 7 categorical columns.
What’s should be the process of removing outliers.

  1. Should the z score or IQR methods applied to both continuous and categorical or only to categorical,.
  2. If we are removing outliers of categorical column on basis of frequency then what should be the minimum number of frequency to retain the record.