Is it required to remove correlated variables before performing PCA?

pca

#1

Is it required to remove correlated variables before performing PCA, I don’t think it is necessary but it is mentioned as necessary in question10 of "40 Interview Questions asked at Startups in Machine Learning / Data Science"


#2

PCA is used to remove multicollinearity from the data. As far as I know there is no point in removing correlated variables. If there are correlated variables, then PCA replaces them with a principle component which can explain max variance.


#3

yes, if you have several highly correlated variables then choose between them. This leads to differentially weight several eigenvectors
If you have some correlation between variables it is ok, proceed with PCA


#4

Todo lo contrario, para utilizar PCA es un requisito que las variables tengan correlación. Debes de hacer una serie de pruebas que demuestren correlación entre ellas, no solo la correlación de Pearson.