[scikit-learn] How to determine suitable cluster algo

lampahome pahome.chen at mirlab.org
Thu Jan 24 20:40:41 EST 2019


Maybe the suitable way is try-and-error?

What I'm interesting is that my datasets is very huge and I can't try
number of cluster from 1 to N if I have N samples
That cost too much time for me.

Maybe I should define the initial number of cluster based on execution time?

Then analyze the next step is increase/decrease the number of cluster?

thx
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20190125/135592ec/attachment.html>


More information about the scikit-learn mailing list