[Python-Dev] iterzip()
Barry A. Warsaw
barry@zope.com
Mon, 29 Apr 2002 18:26:11 -0400
>>>>> "GvR" == Guido van Rossum <guido@python.org> writes:
GvR> The current thresholds are used to count the number of
GvR> allocations. Adding the number of deallocations to mix seems
GvR> dangerous: when (nearly) all data is tied up in cycles, there
GvR> may not be any deallocations. It seems hard to distinguish
GvR> between these two cases:
GvR> (a) lots of memory is allocated and kept alive for real by
GvR> containers
GvR> (b) lots of memory is allocated and kept alive accidentally
GvR> by cycles
GvR> The zip example is a case of (a), but the same allocation
GvR> behavior could ensue from case (b). Only running the
GvR> allocator can determine which case we're seeing. I like
GvR> Tim's idea of adjusting the thresholds base upon the
GvR> effectiveness of recent collections.
Isn't this a case of "knowing your application"? IOW, you're doing
something that the gc isn't well-tuned to handle, by default. That's
why we expose the its operation through the gc module -- so you can
take explicit steps for the hotspots in your application.
Not to say we can't improve the tuning, but it'll never be perfect, so
just try to make it good enough for the most common types of
programs. Then document situations where it might not do so well.
-Barry