Is there any general way to make clustering huge time-series dataset better?
June 20, 2019
2:33 p.m.
I have a huge time-series dataset and should load batch by batch. My procedures like below: Scale to (0~1) Shuffle (because I use Birch not MiniBatchKMeans) Train Birch model with partial_fit Evaluate with silhouette_score (large is better) Why I use Birch is because it have partial_fit and no need to specify the cluster number But...I found evaluting by silhouette_score and db score, it will cluster with fewer cluster numbers. When I look into the data, it should cluster more than the clustering results. Should I change the evaluating way? or else? thx
2428
Age (days ago)
2428
Last active (days ago)
0 comments
1 participants
participants (1)
-
lampahome