[scikit-learn] How to deal with hierarchical and real-time analysis in machine learning?

lampahome pahome.chen at mirlab.org
Tue Feb 12 21:04:18 EST 2019


For example, I may have huge different regions and every regions have many
or less points.

And I also want to real-time to analyze the newest data and older data, but
I don't want to put data into memory cuz I don't have enough memory.

What I thought I can use is partial_fit to accept streaming data when new
data comes in.

But the incoming data has hierarchical, it's hard to cluster them cuz I
don't have older and newer data together to cluster.

How to design the system better?

thx
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20190213/226c2d4c/attachment.html>


More information about the scikit-learn mailing list