Clustering 4 dimensional data
I'm having trouble understanding how to cluster multidimensional data. Specifically, a 4 dimensional array. test = [[[[3,10],[1,5],[3,18]],[[3,1],[0,0],[0,0]],[[3,3],[1,5],[0,0]]],[[[1,5],[2,7],[0,0]],[[1,7],[0,0],[0,0]],[[0,0],[0,0],[0,0]]]] from sklearn import mixture gmm = mixture.GMM() gmm.fit(test) The code returns the following error: "Found array with dim 4. GMM expected <= 2." Do I need to change the way my data is formatted? Is there a way of doing clustering on 4 dimensional data?
What do your four dimensions mean? Can you reshape your data such that it can be seen as a collection of 1d vectors drawn independently from some distribution? On 28 February 2017 at 14:43, Rohan Koodli <rovik05@gmail.com> wrote:
I'm having trouble understanding how to cluster multidimensional data. Specifically, a 4 dimensional array.
test = [[[[3,10],[1,5],[3,18]],[[3,1],[0,0],[0,0]],[[3,3],[1,5],[0,0]]],[[[1,5],[2,7],[0,0]],[[1,7],[0,0],[0,0]],[[0,0],[0,0],[0,0]]]]
from sklearn import mixture gmm = mixture.GMM() gmm.fit(test)
The code returns the following error:
"Found array with dim 4. GMM expected <= 2."
Do I need to change the way my data is formatted? Is there a way of doing clustering on 4 dimensional data?
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn
Sometimes, when you need to find homogeneous subtensors, you can refer to it as multimodal clustering, an extension of biclustering. I cannot see clearly whether this is the case here. 28 февр. 2017 г. 6:54 пользователь "Joel Nothman" <joel.nothman@gmail.com> написал: What do your four dimensions mean? Can you reshape your data such that it can be seen as a collection of 1d vectors drawn independently from some distribution? On 28 February 2017 at 14:43, Rohan Koodli <rovik05@gmail.com> wrote:
I'm having trouble understanding how to cluster multidimensional data. Specifically, a 4 dimensional array.
test = [[[[3,10],[1,5],[3,18]],[[3,1],[0,0],[0,0]],[[3,3],[1,5],[0,0]]],[[[1,5],[2,7],[0,0]],[[1,7],[0,0],[0,0]],[[0,0],[0,0],[0,0]]]]
from sklearn import mixture gmm = mixture.GMM() gmm.fit(test)
The code returns the following error:
"Found array with dim 4. GMM expected <= 2."
Do I need to change the way my data is formatted? Is there a way of doing clustering on 4 dimensional data?
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn
Use whitespace and carriage returns to reformat your data. It’s not clear what you are doing. Also, put it into a Pandas dataframe and make a few plots. The Visualization page is very helpful, along with the Seaborn examples. ____________________________________________________________________ Dale T. Smith | Macy's Systems and Technology | IFS eCom CSE Data Science 5985 State Bridge Road, Johns Creek, GA 30097 | dale.t.smith@macys.com From: scikit-learn [mailto:scikit-learn-bounces+dale.t.smith=macys.com@python.org] On Behalf Of Rohan Koodli Sent: Monday, February 27, 2017 10:43 PM To: scikit-learn@python.org Subject: [scikit-learn] Clustering 4 dimensional data ⚠ EXT MSG: I'm having trouble understanding how to cluster multidimensional data. Specifically, a 4 dimensional array. test = [[[[3,10],[1,5],[3,18]],[[3,1],[0,0],[0,0]],[[3,3],[1,5],[0,0]]],[[[1,5],[2,7],[0,0]],[[1,7],[0,0],[0,0]],[[0,0],[0,0],[0,0]]]] from sklearn import mixture gmm = mixture.GMM() gmm.fit(test) The code returns the following error: "Found array with dim 4. GMM expected <= 2." Do I need to change the way my data is formatted? Is there a way of doing clustering on 4 dimensional data? * This is an EXTERNAL EMAIL. Stop and think before clicking a link or opening attachments.
participants (5)
-
Dale T Smith -
Dmitry Ignatov -
Joel Nothman -
John Ladasky -
Rohan Koodli