Hi

My dataset have some variables with factor level more than 30. When running my model for prediction I am getting the following error in R.

modelRF1 <- mlr::train(tunedRFmodel, trainTask)

#Prediction on test data

predictTest <- predict(modelRF1, testTask)

Error in predict.randomForest(.model$learner.model, newdata = .newdata, :

New factor levels not present in the training data

I am using mlr package in R.

My question is that is there a proper way to split our data so that we get same level of factors in both train and test data.

Thanks