> Ben's suggestion is identical to: > > A = numpy.tensordot(P, C, axes=(1, 0)) Yes, that does the trick! Thank, very good idea. Since i've build atlas with threading support, in the computation of the dot product all four cpus go to 100%, which makes it quite fast. I'm starting to love numpy array facilities.... Ciao Davide