I have a dataset with 1200+ variables. I want to find the variable importance using random forest for this dataset. I am facing memory issues / system gets hanged completely when i run the predictor importance.
Total number of records is 120000. I could not sample the data for two reasons. 1. No of columns and records proportionate seems to be less. 2. Huge variation of pattern with in the dataset.
My system has 8 gb ram. I cannot go for cloud machine as it is a client confidential data. How can I overcome this issue and still find the variable importance?