clusters of numbers
shakti.shrivastava13 at gmail.com
Sat Dec 15 23:56:11 EST 2018
On Sun, 16 Dec 2018 at 09:49, Vincent Davis <vincent at vincentdavis.net> wrote:
> Why not start with a histogram.
> On Sat, Dec 15, 2018 at 6:46 PM Marc Lucke <marc at marcsnet.com> wrote:
> > hey guys,
> > I have a hobby project that sorts my email automatically for me & I want
> > to improve it. There's data science and statistical info that I'm
> > missing, & I always enjoy reading about the pythonic way to do things too.
> > I have a list of percentage scores:
[clipped for brevity]
> > That algorithm is entirely untested & I think it could work, it's just I
> > don't want to reinvent the wheel. Any ideas kindly appreciated.
> > --
> > https://mail.python.org/mailman/listinfo/python-list
+1 for k means certainly.
Also k means in 1D will be like a simple distance comparison and
assignment. A quick Google will give you the exact codes for doing so.
It will be you yourself who will decide how many clusters you want, as
Avi has rightly pointed out.
More information about the Python-list