Mailman 3 Question on Using Parallel Functionality - yt-users

9 Jun 2018

      Hi everyone,

I'm working on doing an analysis on the ~500 outputs I have from a
simulation run so naturally I want to do it in parallel.  The data lives on
Pleiades where a single node has 32/64 Gb depending on the machine you
pick.

The general code structure is to take a dataset and compute the fluxes of
multiple quantities binned in radius. Because the outputs are large, I'd
like to load in 1 dataset per node but then use all 16 cores on the node
for the radial flux calculations.

To test my code, I'm using 2 smaller outputs of ~4.5 Gb each so they should
easily fit on one node but I keep getting memory errors from pleiades. The
code does run correctly on my laptop. I'm fairly certain I'm not setting up
the code correctly with the different num_proc keywords so that it's trying
to do the calculation on a single core instead of half the node.

I've posted a paired down example of my code to pastebin that uses two
outputs from enzo_comoslogy_plus dataset. The code is named
"flux_test_parallel.py" and should run if put inside that dataset
directory. The parallel portion of the code is preceded by a line of #'s. (
http://paste.yt-project.org/show/21/)

Any advice for how to force the parallel structure to use the machine
memory correctly or general pointers for this kind of script would be
really appreciated!

Thanks!
Lauren

Question on Using Parallel Functionality

Lauren Corlies

Nathan Goldbaum

Lauren Corlies

Nathan Goldbaum

Lauren Corlies

John Wise

Nathan Goldbaum

tags

participants (3)