[scikit-learn] Should we standardize data before PCA?

Shiheng Duan shiduan at ucdavis.edu
Thu May 24 19:39:42 EDT 2018


Hello all,

I wonder is it necessary or correct to do z score transformation before
PCA? I didn't see any preprocessing for face image in the example of Faces
recognition example using eigenfaces and SVMs, link:
http://scikit-learn.org/stable/auto_examples/applications/plot_face_recognition.html#sphx-glr-auto-examples-applications-plot-face-recognition-py

I am doing on a similar dataset and got a weird result if I standardized
data before PCA. The components figure will have a strong gradient and it
doesn't make any sense. Any ideas about the reason?

Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20180524/39617e43/attachment-0001.html>


More information about the scikit-learn mailing list