Hello,
While using random forests I came across a function in the randomForest package in R called combine.This combines the results from 2 or more random forest models.
randomForest_1 = randomForest(label ~ ., data=combi, nodesize=5,ntree=500,do.trace = T)
randomForest_2 = randomForest(label ~ ., data=combi, nodesize=5,ntree=500,do.trace = T)
randomForest_3 = randomForest(label ~ ., data=combi, nodesize=5,ntree=500,do.trace = T)
randomForest_4 = randomForest(label ~ ., data=combi, nodesize=5,ntree=500,do.trace = T)
randomForest_5 = randomForest(label ~ ., data=combi, nodesize=5,ntree=500,do.trace = T)
rf.all <- combine(randomForest_1,randomForest_2,randomForest_3,randomForest_4,randomForest_5)
However,having so many random forests might give memory issues and also might be time extensive.Is this something that is generally done to improve random forests performance?Also how might we decide how many randomForests to generate??