[SciPy-User] kmeans

Keith Goodman kwgoodman at gmail.com
Fri Jul 23 14:54:27 EDT 2010


On Fri, Jul 23, 2010 at 11:39 AM, Lutz Maibaum <lutz.maibaum at gmail.com> wrote:
> On Fri, Jul 23, 2010 at 11:33 AM, Keith Goodman <kwgoodman at gmail.com> wrote:
>> What needs to be fixed?
>>
>> - Switch code and doc to use rmse
>
> To be compatible with the (at least to me!) standard use of k-means, I
> think both code and doc should use the sum of squared distances as the
> cost function in the optimization, and also as the return value.

What about the thresh (threshold) input parameter? If the sum of
squares were used then the user would have to adjust the threshold for
the number of data points.



More information about the SciPy-User mailing list