Ah, I see. It works for me because all the processors crash at the same time on mine. On your error they must not...
Yes, in fact, I can see that after the first MPI error, other threads can still go and try to get spheres before they get killed by task manager.
Do we think it's a memory issue? You said you ssh'd in and memory growth was zilch. Is that in fact the case? Is it a crash, somewhere else? Does it dump core?
There are no core dumps. I ssh-ed in and ran 'top' and looked at the memory per process and total usage on the node, and it wasn't approaching the limits of the machine when it crashed. I think the fact that it crashes at different places in the run cycle means something else, but I don't know what. _______________________________________________________ sskory@physics.ucsd.edu o__ Stephen Skory http://physics.ucsd.edu/~sskory/ _.>/ _Graduate Student ________________________________(_)_\(_)_______________