[scikit-learn] scikit-learn Digest, Vol 19, Issue 37

Andreas Mueller t3kcit at gmail.com
Wed Oct 18 15:01:26 EDT 2017



On 10/17/2017 11:18 PM, Ismael Lemhadri wrote:
> How about editing the various chunks of code concerned to add the 
> option to scale the parameters, and set it by default to NOT scale? 
> This would make what happens clear without the redundancy Andreas 
> mentioned, and would add more convenience to the user shall they want 
> to scale their data.
>
I don't feel that would add a lot, and it would still requires the users 
to read the docs.
There are many ways to scale, and applying any of them is very easy with 
scikit-learn.

The main source of confusion seems to be that you expected the PCA to 
scale, and it doesn't.
It doesn't say anywhere that it scales, and scaling is not part of the 
definition of PCA
(in contrast to subtracting the mean).

I guess part of the confusion came from the somewhat cryptic docstring 
about SVD, but you fixed that.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20171018/850b7938/attachment-0001.html>


More information about the scikit-learn mailing list