Handling encoding of a dataset which has more than total 2000 columns

Whenever we have a dataset for converting the categorical values to numerical values we generally use LabelEncoding, One Hot encoding etc techniques but all these are done manually going through each column. But what if our dataset is huge in terms of columns, here it wont be possible to go through each column manually, in such cases how do we handle encoding?

Are there any specific libraries available which deal with automatic encoding of variable?


Get the categorical variables first by finding the columns which have dtype == ‘O’
Then you write a function to encode these categorical variables.

© Copyright 2013-2020 Analytics Vidhya