Mailman 3 Internal memory usage - yt-users

28 Aug 2018

      Folks,

I would like to test the integrity of a large dataset, for example by 
finding the max value of some data field. The dataset is too large to be 
fully loaded into memory, so I would like to read it slice by slice and 
compute max in each slice, and then combine them together. To that end I 
wrote the following script:

import yt, yt.funcs
import sys
import numpy as np
f = sys.argv[1]
ds = yt.load(f)
n = ds.domain_dimensions[0]
dmax = np.zeros((n,))
for i in range(n):
     d = ds.r[i:(i+1),:,:][('artio', 'HVAR_GAS_DENSITY')]
     dmax[i] = np.amax(d)
     print(i,dmax[i],yt.funcs.get_memory_usage())
     del d
print(np.amax(dmax))

However, when I run it, I get:

yt : [INFO     ] 2018-08-28 10:52:58,075 Created 4945 chunks for ARTIO
0 16945762.0 202.5546875
yt : [INFO     ] 2018-08-28 10:52:59,388 Created 1232 chunks for ARTIO
1 2416576.25 221.36328125
yt : [INFO     ] 2018-08-28 10:53:01,599 Created 9635 chunks for ARTIO
2 10419311.0 269.06640625
...
yt : [INFO     ] 2018-08-28 10:53:23,397 Created 1474 chunks for ARTIO
13 2395590.5 698.91015625
yt : [INFO     ] 2018-08-28 10:53:25,594 Created 9747 chunks for ARTIO
14 11139424.0 739.16015625

The number of chunks created varies in each iteration of the loop, but 
the total memory usage is still steadily climbing up.

Could you advise what I am doing wrong?

Many thanks,

Nick

Internal memory usage

Nick Gnedin

Matthew Turk

tags

participants (2)