[Numpy-discussion] Using numpy on hadoop streaming: ImportError: cannot import name multiarray
Daπid
davidmenhur at gmail.com
Wed Feb 11 07:17:35 EST 2015
On 11 February 2015 at 08:06, Kartik Kumar Perisetla
<kartik.peri at gmail.com> wrote:
> Thanks David. But do I need to install virtualenv on every node in hadoop
> cluster? Actually I am not very sure whether same namenodes are assigned for
> my every hadoop job. So how shall I proceed on such scenario.
I have never used hadoop, but in the clusters I have used, you have a
home folder on the central node, and each and every computing node has
access to it. You can then install Python in your home folder and make
every node run that, or pull a local copy.
Probably the cluster support can clear this up further and adapt it to
your particular case.
/David.
More information about the NumPy-Discussion
mailing list