It's relatively easy to test replacing our custom allocators with the
system ones, yes? Can we try those to see whether they have the same
characteristic?

Yes.

PYTHONMALLOC=malloc LD_PRELOAD=/path/to/jemalloc.so python script.py

I will try it tomorrow.