Mailman 3 Re: [Python-Dev] cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903) - Python-Dev

R. David Murray

April 2012

9:55 p.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

On Mon, 23 Apr 2012 22:22:18 +0200, Antoine Pitrou <solipsis@pitrou.net> wrote:

I'm pretty sure that anything heavily using sqlalchemy will benefit, so that would be a good place to look for a real-world benchmark. --David

Reply

Sign in to reply online Use email software

Antoine Pitrou

10:37 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

On Tue, 24 Apr 2012 10:24:16 +0000 Kristján Valur Jónsson <kristjan@ccpgames.com> wrote:

...

The sparseness of hash tables is a well-known time/space tradeoff. See e.g. http://bugs.python.org/issue10408 Regards Antoine.

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

11:41 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

Thanks. Meanwhile, I blogged about tuning the dict implementation. Preliminary testing seems to indicate that tuning it to conserve memory saves us 2Mb of wasted slots on the login screen. No small thing on a PS3 system. http://blog.ccpgames.com/kristjan/2012/04/25/optimizing-the-dict/ I wonder if we shouldn't make those factors into #defines as I did in my 2.7 modifications, and even provide a "memory saving" predefine for embedders. (Believe it or not, sometimes python performance is not an issue at all, but memory usage is.) K

...

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

9:11 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

...

Yes, you can query each python object about how big it thinks it is. What I'm speaking of is more like: start_allocs, start_mem = allocator.get_current() allocator.reset_limits() run_complicated_tests() end_allocs, end_mem = allocator.get=current() Print "delta blocks: %d, delta mem: %d"%(end_allocs-start_allocs, end_mem-start_mem) print "peak blocks: %d, peak mem: %d"%allocator.peak()

...

I'm going to experiment with tunable parameters in 2.7 to trade performance for memory. In some applications, memory trumps performance. K

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

10:32 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

...

Yes, out of process monitoring of memory as reported by the OS. We do gather those counters as well on clients and servers. But they don't give you the granularity you want when checking for memory leaks and memory usage by certain algorithms. In the same way that the unittests have reference leak reports, they could just have memory usage reports, if the underlying allocator supported that. FYI the current state of affairs of the cPython 2.7 branch we use is as follows: 1) We allow the API user to specify the base allocator python uses, both for regular allocs and allocating blocks for the obmalloc one, using: /* Support for custom allocators */ typedef void *(*PyCCP_Malloc_t)(size_t size, void *arg, const char *file, int line, const char *msg); typedef void *(*PyCCP_Realloc_t)(void *ptr, size_t size, void *arg, const char *file, int line, const char *msg); typedef void (*PyCCP_Free_t)(void *ptr, void *arg, const char *file, int line, const char *msg); typedef size_t (*PyCCP_Msize_t)(void *ptr, void *arg); typedef struct PyCCP_CustomAllocator_t { PyCCP_Malloc_t pMalloc; PyCCP_Realloc_t pRealloc; PyCCP_Free_t pFree; PyCCP_Msize_t pMsize; /* can be NULL, or return -1 if no size info is avail. */ void *arg; /* opaque argument for the functions */ } PyCCP_CustomAllocator_t; /* To set an allocator! use 0 for the regular allocator, 1 for the block allocator. * pass a null pointer to reset to internal default */ PyAPI_FUNC(void) PyCCP_SetAllocator(int which, const PyCCP_CustomAllocator_t *); /* for BLUE to set the current context */ /* internal data member */ extern PyCCP_CustomAllocator_t _PyCCP_CustomAllocator[]; 2) using ifdefs, the macros will delegate all final allocations through these allocators. This includes all the "naked" malloc calls scattered about, they are patched up using #defines. 3) Additionally, there is an internal layer of management, before delegating to the external allocators. This internal manager provides statistics, exposed through the "sys" module. The layering is something like this, all more or less definable by pre-processor macros. (raw malloc() is turned into something else via pre-processor magic and a special "patch_malloc.h" file added to the modules which uses raw malloc()) PyMem_Malloc() PyObject_Malloc() | | v v Mem bookkeeping obj bookkeeping | | | v malloc() | obmallocator | | | v v v PyMem_MALLOC_RAW() PyObject_MALLOC_RAW | | v v malloc() or vectored allocator specified through API function Cheers, K

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

1:26 p.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

...

Thanks for pointing out pympler to me. Sounds like fun, I'll try it out. I should point out that gc.get_objects() also works, if you don't care about stuff like ints and floats. Another reason why I like the runtime stats we have built in, however, is that they provide no query overhead. You can query the current resource usage as often as you like and this is important in a running app. We log python memory usage every second or so. Cheers, K

Reply

Sign in to reply online Use email software

R. David Murray

April 2012

4:55 p.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

On Mon, 23 Apr 2012 22:22:18 +0200, Antoine Pitrou <solipsis@pitrou.net> wrote:

...

I'm pretty sure that anything heavily using sqlalchemy will benefit, so that would be a good place to look for a real-world benchmark. --David

Reply

Sign in to reply online Use email software

Antoine Pitrou

5:37 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

On Tue, 24 Apr 2012 10:24:16 +0000 Kristján Valur Jónsson <kristjan@ccpgames.com> wrote:

...

The sparseness of hash tables is a well-known time/space tradeoff. See e.g. http://bugs.python.org/issue10408 Regards Antoine.

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

6:41 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

Thanks. Meanwhile, I blogged about tuning the dict implementation. Preliminary testing seems to indicate that tuning it to conserve memory saves us 2Mb of wasted slots on the login screen. No small thing on a PS3 system. http://blog.ccpgames.com/kristjan/2012/04/25/optimizing-the-dict/ I wonder if we shouldn't make those factors into #defines as I did in my 2.7 modifications, and even provide a "memory saving" predefine for embedders. (Believe it or not, sometimes python performance is not an issue at all, but memory usage is.) K

...

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

April 2012

4:11 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

...

Yes, you can query each python object about how big it thinks it is. What I'm speaking of is more like: start_allocs, start_mem = allocator.get_current() allocator.reset_limits() run_complicated_tests() end_allocs, end_mem = allocator.get=current() Print "delta blocks: %d, delta mem: %d"%(end_allocs-start_allocs, end_mem-start_mem) print "peak blocks: %d, peak mem: %d"%allocator.peak()

...

I'm going to experiment with tunable parameters in 2.7 to trade performance for memory. In some applications, memory trumps performance. K

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

5:32 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

...

Yes, out of process monitoring of memory as reported by the OS. We do gather those counters as well on clients and servers. But they don't give you the granularity you want when checking for memory leaks and memory usage by certain algorithms. In the same way that the unittests have reference leak reports, they could just have memory usage reports, if the underlying allocator supported that. FYI the current state of affairs of the cPython 2.7 branch we use is as follows: 1) We allow the API user to specify the base allocator python uses, both for regular allocs and allocating blocks for the obmalloc one, using: /* Support for custom allocators */ typedef void *(*PyCCP_Malloc_t)(size_t size, void *arg, const char *file, int line, const char *msg); typedef void *(*PyCCP_Realloc_t)(void *ptr, size_t size, void *arg, const char *file, int line, const char *msg); typedef void (*PyCCP_Free_t)(void *ptr, void *arg, const char *file, int line, const char *msg); typedef size_t (*PyCCP_Msize_t)(void *ptr, void *arg); typedef struct PyCCP_CustomAllocator_t { PyCCP_Malloc_t pMalloc; PyCCP_Realloc_t pRealloc; PyCCP_Free_t pFree; PyCCP_Msize_t pMsize; /* can be NULL, or return -1 if no size info is avail. */ void *arg; /* opaque argument for the functions */ } PyCCP_CustomAllocator_t; /* To set an allocator! use 0 for the regular allocator, 1 for the block allocator. * pass a null pointer to reset to internal default */ PyAPI_FUNC(void) PyCCP_SetAllocator(int which, const PyCCP_CustomAllocator_t *); /* for BLUE to set the current context */ /* internal data member */ extern PyCCP_CustomAllocator_t _PyCCP_CustomAllocator[]; 2) using ifdefs, the macros will delegate all final allocations through these allocators. This includes all the "naked" malloc calls scattered about, they are patched up using #defines. 3) Additionally, there is an internal layer of management, before delegating to the external allocators. This internal manager provides statistics, exposed through the "sys" module. The layering is something like this, all more or less definable by pre-processor macros. (raw malloc() is turned into something else via pre-processor magic and a special "patch_malloc.h" file added to the modules which uses raw malloc()) PyMem_Malloc() PyObject_Malloc() | | v v Mem bookkeeping obj bookkeeping | | | v malloc() | obmallocator | | | v v v PyMem_MALLOC_RAW() PyObject_MALLOC_RAW | | v v malloc() or vectored allocator specified through API function Cheers, K

Reply

Sign in to reply online Use email software

Kristján Valur Jónsson

8:26 a.m.

New subject: cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

...

Thanks for pointing out pympler to me. Sounds like fun, I'll try it out. I should point out that gc.get_objects() also works, if you don't care about stuff like ints and floats. Another reason why I like the runtime stats we have built in, however, is that they provide no query overhead. You can query the current resource usage as often as you like and this is important in a running app. We log python memory usage every second or so. Cheers, K

Reply

Sign in to reply online Use email software

Re: [Python-Dev] cpython: Implement PEP 412: Key-sharing dictionaries (closes #13903)

Antoine Pitrou

R. David Murray

Kristján Valur Jónsson

Antoine Pitrou

Nick Coghlan

Kristján Valur Jónsson

martin＠v.loewis.de

Kristján Valur Jónsson

Mark Shannon

Kristján Valur Jónsson

"Martin v. Löwis"

Kristján Valur Jónsson

R. David Murray

Kristján Valur Jónsson

Antoine Pitrou

Nick Coghlan

Kristján Valur Jónsson

martin＠v.loewis.de

Kristján Valur Jónsson

Mark Shannon

Kristján Valur Jónsson

"Martin v. Löwis"

Kristján Valur Jónsson

tags

participants (7)