How to encode ordered categorical data? Do they improve model accuracy? What are some other ways to to encode categorical data other than one-hot encoding?


sambid,

yes, It can help in improving model accuracy. if you have order in categorical feature.

There are multiple ways to encode categorical feature.

  1. One-Hot-Encoding
  2. Label Encoding
  3. Mean response
  4. Out of fold mean response
  5. Take it’s interaction with other variable.
  6. Convert into frequency feature.
  7. Hash trick

Are you referring ordered categorical variable as ordinal variable?
Ankit Gupta


Thanks ,Yes i mean ordinal variable. Can you give me a link/example for label encoding and out of fold mean response ?