[scikit-learn] Construct the microclusters using a CF-Tree

Sema Atasever s.atasever at gmail.com
Mon Jul 3 10:09:44 EDT 2017


Dear Roman,

When I try the code with the original data (*data.dat*) as you suggested, I
get the following error : *Memory Error* --> (*error.png*), how can i
overcome this problem, thank you so much in advance.
​
 data.dat
<https://drive.google.com/file/d/0B4rY6f4kvHeCYlpZOURKNnR0Q1k/view?usp=drive_web>
​

On Fri, Jun 30, 2017 at 5:42 PM, Roman Yurchak <rth.yurchak at gmail.com>
wrote:

> Hello Sema,
>
> On 30/06/17 17:14, Sema Atasever wrote:
>
>> I want to cluster them using Birch clustering algorithm.
>> Does this method have 'precomputed' option.
>>
>
> No it doesn't, see http://scikit-learn.org/stable
> /modules/generated/sklearn.cluster.Birch.html so you would need to
> provide it with the original features matrix (not the precomputed distance
> matrix). Since your dataset is fairly small, there is no reason in
> precomputing it anyway.
>
> I needed train an SVM on the centroids of the microclusters so
>> *How can i get the centroids of the microclusters?*
>>
>
> By "microclusters" do you mean sub-clusters? If you are interested in the
> leaves subclusters see the Birch.subcluster_centers_ parameter.
>
> Otherwise if you want all the centroids in the hierarchy of subclusters,
> you can browse the hierarchical tree via the  Birch.root_ attribute then
> look at _CFSubcluster.centroid_ for each subcluster.
>
> Hope this helps,
> --
> Roman
> _______________________________________________
> scikit-learn mailing list
> scikit-learn at python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20170703/1874bc1c/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: error.png
Type: image/png
Size: 74377 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20170703/1874bc1c/attachment-0001.png>


More information about the scikit-learn mailing list