How to identify which variables are important in a large dataset?

data_wrangling

#1

Hello,

I am currently trying to explore the dataset on Human Activity Recognition on Smartphones, and the training dataset contains 561 variables.
http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones
How to identify the important variables from among such a large number of variables?

Thanks.


#2

Hi Aditya,

One of the ways to find is to apply the algorithm upon the dataset and see the summary of the model which list outs the important variables.

Regards,
Karthikeyan P