Mailman 3 Looking for master thesis ideas involving Python - Python-Dev

Looking for master thesis ideas involving Python

Brett C.

29 Oct 2003 29 Oct '03

2:01 a.m.

Today I got the wheels turning on my masters thesis by getting an adviser. Now I just need a topic. =) The big goal is to do something involving Python for a thesis to be finished by fall of next year (about October) so as to have it done, hopefully published (getting into LL4 would be cool), and ready to be used for doctoral applications come January 2005. So, anyone have any ideas? The best one that I can think of is optional type-checking. I am fairly open to ideas, though, in almost any area involving language design. There is no deadline to this, so if an idea strikes you a while from now still let me know. I suspect I won't settle on an idea any sooner than December, and that is only if the idea just smacks me in the face and says, "DO THIS!" Otherwise it might be a while since I don't want to take up a topic that won't interest me or is not helpful in some way. -Brett

Show replies by date

martin＠v.loewis.de

29 Oct 29 Oct

7:17 a.m.

"Brett C." writes:

...

So, anyone have any ideas? The best one that I can think of is optional type-checking. I am fairly open to ideas, though, in almost any area involving language design.

Did you explicitly mean language *design*? Because there might be areas of research relevant to language implementation, in terms of efficiency, portability, etc. Here are some suggestions: - memory management: attempt to replace reference counting by "true" garbage collection - threading: attempt to provide free threading efficiently - typing: attempt to provide run-time or static type inference, and see whether this could be used to implement some byte codes more efficiently (although there is probably overlap with the specializing compilers) - floating point: provide IEEE-794 (or some such) in a portable yet efficient way - persistency: provide a mechanism to save the interpreter state to disk, with the possibility to restart it later (similar to Smalltalk images) On language design, I don't have that many suggestions, as I think the language itself should evolve slowly if at all: - deterministic finalization: provide a way to get objects destroyed implicitly at certain points in control flow; a use case would be thread-safety/critical regions - attributes: provide syntax to put arbitrary annotations to functions, classes, and class members, similar to .NET attributes. Use that facility to implement static and class methods, synchronized methods, final methods, web methods, transactional methods, etc (yes, there is a proposal, but nobody knows whether it meets all requirements - nobody knows what the requirements are) - interfaces (this may go along with optional static typing) Regards, Martin

Brett C.

30 Oct 30 Oct

1:05 a.m.

Martin v. Löwis wrote:

...

"Brett C." writes:

...
So, anyone have any ideas? The best one that I can think of is optional type-checking. I am fairly open to ideas, though, in almost any area involving language design.

Did you explicitly mean language *design*?

Design/implementation. Basically something involving how a language either works or is created.

...

Because there might be areas of research relevant to language implementation, in terms of efficiency, portability, etc.

Here are some suggestions: - memory management: attempt to replace reference counting by "true" garbage collection

Maybe. Kind of happy with the way things work now, though. =)

...

- threading: attempt to provide free threading efficiently

Wow, that would be a challenge, to say the least. Might be too much for just a masters thesis.

...

- typing: attempt to provide run-time or static type inference, and see whether this could be used to implement some byte codes more efficiently (although there is probably overlap with the specializing compilers)

I was actually thinking of type-inference since I am planning on learning (or at least starting to learn) Standard ML next month.

...

- floating point: provide IEEE-794 (or some such) in a portable yet efficient way

You mean like how we have longs? So code up in C our own way of storing 794 independent of the CPU?

...

- persistency: provide a mechanism to save the interpreter state to disk, with the possibility to restart it later (similar to Smalltalk images)

Hmm. Interesting. Could be the start of continuations.

...

On language design, I don't have that many suggestions, as I think the language itself should evolve slowly if at all: - deterministic finalization: provide a way to get objects destroyed implicitly at certain points in control flow; a use case would be thread-safety/critical regions

I think you get what you mean by this, but I am not totally sure since I can't come up with a use beyond threads killing themselves properly when the whole program is shutting down.

...

- attributes: provide syntax to put arbitrary annotations to functions, classes, and class members, similar to .NET attributes. Use that facility to implement static and class methods, synchronized methods, final methods, web methods, transactional methods, etc (yes, there is a proposal, but nobody knows whether it meets all requirements - nobody knows what the requirements are)

Have no clue what this is since I don't know C#. Almost sounds like Michael's def func() [] proposal at the method level. Or just a lot of descriptors. =) Time to do some Googling.

...

- interfaces (this may go along with optional static typing)

Yeah, but that is Alex's baby. Thanks for the suggestions, Martin. -Brett

"Martin v. Löwis"

7:44 a.m.

Brett C. wrote:

...

...
- floating point: provide IEEE-794 (or some such) in a portable yet efficient way

You mean like how we have longs? So code up in C our own way of storing 794 independent of the CPU?

Not longs, but floats. And you would not attempt to store it independent of the CPU, but instead, you would make as much use of the CPU as possible, and only implement things in C that the CPU gets wrong. The portion of emulation would vary from CPU to CPU. As a starting point, you might look at the Java strictfpu mode (which got only added after the initial Java release). Java 1.0 was where Python is today: expose whatever the platform provides. In Java, they have the much stronger desire to provide bit-for-bit reproducability on all systems, so they added strictfpu as a trade-off of performance vs. write-once-run-anywhere.

...

...
- deterministic finalization: provide a way to get objects destroyed implicitly at certain points in control flow; a use case would be thread-safety/critical regions

I think you get what you mean by this, but I am not totally sure since I can't come up with a use beyond threads killing themselves properly when the whole program is shutting down.

Not at all. In Python, you currently do def bump_counter(self): self.mutex.acquire() try: self.counter = self.counter+1 more_actions() finally: self.mutex.release() In C++, you do void bump_counter(){ MutexAcquistion acquire(this); this->counter+=1; more_actions(); } I.e. you can acquire the mutex at the beginning (as a local object), and it gets destroyed automatically at the end of the function. So they have the "resource acquisition is construction, resource release is destruction" design pattern. This is very powerful and convenient, and works almost in CPython, but not in Python - as there is no uarantee when objects get destroyed.

...

Have no clue what this is since I don't know C#. Almost sounds like Michael's def func() [] proposal at the method level. Or just a lot of descriptors. =)

Yes, the func()[] proposal might do most of it. However, I'm uncertain whether it puts in place all pieces of the puzzle - one would actually have to try to use that stuff to see whether it really works sufficiently. You would have to set goals first (what is it supposed to do), and then investigate, whether these things can actually be done with it. As I said: static, class, synchronized, final methods might all be candidates; perhaps along with some of the .NET features, like security evidence check (caller must have permission to write files in order to call this method), webmethod (method is automagically exposed as a SOAP/XML-RPC method), etc. Regards, Martin

Brett C.

31 Oct 31 Oct

2:11 a.m.

Martin v. Löwis wrote:

...

Brett C. wrote:

...
...
- floating point: provide IEEE-794 (or some such) in a portable yet efficient way

You mean like how we have longs? So code up in C our own way of storing 794 independent of the CPU?

Not longs, but floats. And you would not attempt to store it independent of the CPU, but instead, you would make as much use of the CPU as possible, and only implement things in C that the CPU gets wrong. The portion of emulation would vary from CPU to CPU.

OK, so in other words play cleanup for how the CPU handles floating point by having custom code that deals with its mix-ups.

...

As a starting point, you might look at the Java strictfpu mode (which got only added after the initial Java release). Java 1.0 was where Python is today: expose whatever the platform provides. In Java, they have the much stronger desire to provide bit-for-bit reproducability on all systems, so they added strictfpu as a trade-off of performance vs. write-once-run-anywhere.

Remembrances of Tim mentioning FPU exceptions start to flood back into my mind. =)

...

...
...
- deterministic finalization: provide a way to get objects destroyed implicitly at certain points in control flow; a use case would be thread-safety/critical regions

I think you get what you mean by this, but I am not totally sure since I can't come up with a use beyond threads killing themselves properly when the whole program is shutting down.

Not at all. In Python, you currently do

def bump_counter(self): self.mutex.acquire() try: self.counter = self.counter+1 more_actions() finally: self.mutex.release()

In C++, you do

void bump_counter(){ MutexAcquistion acquire(this); this->counter+=1; more_actions(); }

I.e. you can acquire the mutex at the beginning (as a local object), and it gets destroyed automatically at the end of the function. So they have the "resource acquisition is construction, resource release is destruction" design pattern. This is very powerful and convenient, and works almost in CPython, but not in Python - as there is no uarantee when objects get destroyed.

Ah, OK.

...

...
Have no clue what this is since I don't know C#. Almost sounds like Michael's def func() [] proposal at the method level. Or just a lot of descriptors. =)

Yes, the func()[] proposal might do most of it. However, I'm uncertain whether it puts in place all pieces of the puzzle - one would actually have to try to use that stuff to see whether it really works sufficiently. You would have to set goals first (what is it supposed to do), and then investigate, whether these things can actually be done with it. As I said: static, class, synchronized, final methods might all be candidates; perhaps along with some of the .NET features, like security evidence check (caller must have permission to write files in order to call this method), webmethod (method is automagically exposed as a SOAP/XML-RPC method), etc.

I remember that static and classmethod were reasons cited why the func()[] proposal was desired. It's an idea. -Brett

Holger Krekel

29 Oct 29 Oct

7:59 a.m.

Hi Brett, Brett C. wrote:

...

Today I got the wheels turning on my masters thesis by getting an adviser. Now I just need a topic. =) The big goal is to do something involving Python for a thesis to be finished by fall of next year (about October) so as to have it done, hopefully published (getting into LL4 would be cool), and ready to be used for doctoral applications come January 2005.

So, anyone have any ideas? The best one that I can think of is optional type-checking. I am fairly open to ideas, though, in almost any area involving language design.

Maybe you have heard of PyPy, a reimplementation of Python in Python. We are employing quite some innovative approaches to language design and implementation and there are certainly a lot of open research areas. See our OSCON 2003 paper http://codespeak.net/pypy/index.cgi?doc/oscon2003-paper.html or two interesting chapters out of our European Union proposal http://codespeak.net/pypy/index.cgi?doc/funding/B1.0 http://codespeak.net/pypy/index.cgi?doc/funding/B6.0 You are welcome to discuss stuff on e.g. the IRC channel #pypy on freenode or on the mailing list http://codespeak.net/mailman/listinfo/pypy-dev in order to find out, if you'd like to join us and/or do some interesting thesis. have fun, holger

Brett C.

30 Oct 30 Oct

1:15 a.m.

Holger Krekel wrote:

...

Hi Brett,

Brett C. wrote:

...
Today I got the wheels turning on my masters thesis by getting an adviser. Now I just need a topic. =) The big goal is to do something involving Python for a thesis to be finished by fall of next year (about October) so as to have it done, hopefully published (getting into LL4 would be cool), and ready to be used for doctoral applications come January 2005.

So, anyone have any ideas? The best one that I can think of is optional type-checking. I am fairly open to ideas, though, in almost any area involving language design.

Maybe you have heard of PyPy, a reimplementation of Python in Python. We are employing quite some innovative approaches to language design and implementation and there are certainly a lot of open research areas. See our OSCON 2003 paper

http://codespeak.net/pypy/index.cgi?doc/oscon2003-paper.html

Read a while back. I keep an eye on PyPy from a distance by reading the stuff you guys put out.

...

or two interesting chapters out of our European Union proposal

http://codespeak.net/pypy/index.cgi?doc/funding/B1.0 http://codespeak.net/pypy/index.cgi?doc/funding/B6.0

I will have a read.

...

You are welcome to discuss stuff on e.g. the IRC channel #pypy on freenode

Nuts. Guess I can't keep my use of IRC down to PyCon discussions. =)

...

or on the mailing list

http://codespeak.net/mailman/listinfo/pypy-dev

in order to find out, if you'd like to join us and/or do some interesting thesis.

Will do. Thanks, Holger. -Brett

Neil Schemenauer

29 Oct 29 Oct

4:35 p.m.

Hi Brett, Some ideas: * Finish of the AST compiler. Make it possible to manipulate ASTs from Python and allow them to be feed to the compiler to generate code. This is one half of macros for Python. The other half is harder. * Build a refactoring code editor that works using the AST. * Implement an object system that supports multiple dispatch. You can look at Dylan and Goo for ideas. * Optimize access to global variables and builtins. See PEP 267 for some ideas. If we can disallow inter-module shadowing of names the job becomes easier. Measure the performance difference. * Look at making the GC mark-and-sweep. You will need to provide it explict roots. Is it worth doing? Mark-and-sweep would require changes to extension modules since they don't expose roots to the interpreter. * More radically, look at Chicken� and it's GC. Henry Baker's "Cheney on the M.T.A"� is very clever, IMHO, and could be used instead of Python's reference counting. Build a limited Python interpreter based on this idea and evaluate it. 1. http://www.call-with-current-continuation.org/chicken.html 2. http://citeseer.nj.nec.com/baker94cons.html

Fernando Perez

5:23 p.m.

Hi Brett, I don't know how interested you are in scientific computing. But Pat Miller from Lawrence Livermore Lab (http://www.llnl.gov/CASC/people/pmiller/) presented at SciPy'03 some very interesting stuff for on-the-fly compilation of python code into C for numerical work. None of this has been publically released yet, but if that kind of thing sounds interesting to you, you might want to contact him. Just an idea. Best, f

Brett C.

30 Oct 30 Oct

1:28 a.m.

Neil Schemenauer wrote:

...

Hi Brett,

Some ideas:

* Finish of the AST compiler. Make it possible to manipulate ASTs from Python and allow them to be feed to the compiler to generate code. This is one half of macros for Python. The other half is harder.

I actually wanted to originally do that, but there is no real research involved; its just coding at this point, right?

...

* Build a refactoring code editor that works using the AST.

Would probably require the AST to be done.

...

* Implement an object system that supports multiple dispatch. You can look at Dylan and Goo for ideas.

Huh, cool. Just looked at Dylan quickly.

...

* Optimize access to global variables and builtins. See PEP 267 for some ideas. If we can disallow inter-module shadowing of names the job becomes easier. Measure the performance difference.

... and watch my head explode from reading the latest threads. =) Maybe, though.

...

* Look at making the GC mark-and-sweep. You will need to provide it explict roots. Is it worth doing? Mark-and-sweep would require changes to extension modules since they don't expose roots to the interpreter.

I don't know if it is worth it, although having so far two people suggest changing the GC to something else is interesting.

...

* More radically, look at Chicken? and it's GC. Henry Baker's "Cheney on the M.T.A"? is very clever, IMHO, and could be used instead of Python's reference counting. Build a limited Python interpreter based on this idea and evaluate it.

1. http://www.call-with-current-continuation.org/chicken.html 2. http://citeseer.nj.nec.com/baker94cons.html

I will have a read. Thanks, Neil. -Brett

Neil Schemenauer

4:15 a.m.

On Wed, Oct 29, 2003 at 05:28:13PM -0800, Brett C. wrote:

...

Neil Schemenauer wrote:

...
* Finish of the AST compiler.

I actually wanted to originally do that, but there is no real research involved; its just coding at this point, right?

Right. It's a prerequite to doing real research. See Jeremy's web log. If you don't want to finish the AST compiler you could just use the Python implementation. It would be slow but good enough for testing ideas.

...

Huh, cool. Just looked at Dylan quickly.

The reference manual is a good reading: http://www.gwydiondylan.org/drm/drm_1.htm Some of the parts I like are the builtin classes (numbers and sealing especially) and the collection protocols. The module and library system is also interesting (although overkill for many programs).

...

...
* Look at making the GC mark-and-sweep.

I don't know if it is worth it, although having so far two people suggest changing the GC to something else is interesting.

Implementating yet a another M&S GC is not research, IMHO. What _would_ be interesting is comparing the performance of reference counting and a mark and sweep collector. CPU, cache and memory speeds have changed quite dramatically. Also, comparing how easily the runtime can be integrated with the rest of the world (e.g. C libraries) would also be valuable. That said, I'm not sure it's worth it either. I find the Chicken GC more interesting and would look into that further if I had the time. Neil

Brett C.

5:39 a.m.

Neil Schemenauer wrote:

...

On Wed, Oct 29, 2003 at 05:28:13PM -0800, Brett C. wrote:

...
Neil Schemenauer wrote:

...
* Finish of the AST compiler.

I actually wanted to originally do that, but there is no real research involved; its just coding at this point, right?

Right. It's a prerequite to doing real research. See Jeremy's web log. If you don't want to finish the AST compiler you could just use the Python implementation. It would be slow but good enough for testing ideas.

Yeah, I read that. Too bad I can't finish the AST branch *and* do something with it.

...

...
Huh, cool. Just looked at Dylan quickly.

The reference manual is a good reading:

http://www.gwydiondylan.org/drm/drm_1.htm

Some of the parts I like are the builtin classes (numbers and sealing especially) and the collection protocols. The module and library system is also interesting (although overkill for many programs).

So many languages to learn! Happen to have a book recommendation?

...

...
...
* Look at making the GC mark-and-sweep.

I don't know if it is worth it, although having so far two people suggest changing the GC to something else is interesting.

Implementating yet a another M&S GC is not research, IMHO. What _would_ be interesting is comparing the performance of reference counting and a mark and sweep collector. CPU, cache and memory speeds have changed quite dramatically. Also, comparing how easily the runtime can be integrated with the rest of the world (e.g. C libraries) would also be valuable.

That is a possibility. Depends if anyone else has done a comparison lately. Seems like this may have been done to death, though.

...

That said, I'm not sure it's worth it either. I find the Chicken GC more interesting and would look into that further if I had the time.

I just like the name. =) That and the title of that paper, "Cheney on the M.T.A" causes the humorist in me to want to look at this further, so I will definitely be reading that paper. -Brett

Alex Martelli

9:05 a.m.

On Thursday 30 October 2003 06:39 am, Brett C. wrote: ...

...

...
...
Huh, cool. Just looked at Dylan quickly. ... So many languages to learn! Happen to have a book recommendation?

Besides the reference manual, which is also available as a book, there's a good book called "Dylan Programming", Addison-Wesley, Feinberg et al. There's a firm somewhat misleadingly called something like "functional programming" (misleadingly because Dylan's not a FP language...) which focuses on Dylan and used to have both books (reference and Feinberg) in stock and available for decently discounted prices, too. Alex

Michael Hudson

11:16 a.m.

Alex Martelli writes:

...

On Thursday 30 October 2003 06:39 am, Brett C. wrote: ...

...
...
...
Huh, cool. Just looked at Dylan quickly. ... So many languages to learn! Happen to have a book recommendation?

Besides the reference manual, which is also available as a book, there's a good book called "Dylan Programming", Addison-Wesley, Feinberg et al.

There's a firm somewhat misleadingly called something like "functional programming" (misleadingly because Dylan's not a FP language...) which focuses on Dylan and used to have both books (reference and Feinberg) in stock and available for decently discounted prices, too.

It was called "Functional Objects" -- and still is (I thought it was defunct). http://www.functionalobject.com Cheers, mwh -- This is an off-the-top-of-the-head-and-not-quite-sober suggestion, so is probably technically laughable. I'll see how embarassed I feel tomorrow morning. -- Patrick Gosling, ucam.comp.misc

Samuele Pedroni

12:43 p.m.

New subject: Looking for master thesis ideas involving Python

At 17:28 29.10.2003 -0800, Brett C. wrote:

...

...
* Implement an object system that supports multiple dispatch. You can look at Dylan and Goo for ideas.

Huh, cool. Just looked at Dylan quickly.

some bits on this: implementing one is probably not too hard apart from optimization but possible/relevant directions are also then - integration with the preexisting Python semantics - reflection. All of CLOS, Dylan, and Goo come with a rather low-level flavor of reflection, in contrast Python has a rather natural one. Once you have mmd what kind of idioms using reflection you can think of, how to best offer/package reflection for the language user? - multi methods cover some ground also coverd by interfaces and adaptation: *) a generic function/multi method is also an interface *) some of the things you can achieve with adaptation can be done with multi methods Once you have multimethods do you still need adaptation in some cases or, could one obtain the functionality otherwise or do you need dispatch on interfaces (not just classes), how would then interfaces look like and the dispatch on them? (Cecil type system and predicate dispatch would be thing to look at for example) Samuele

Phillip J. Eby

5:37 p.m.

New subject: Looking for master thesis ideas involving Python

At 01:43 PM 10/30/03 +0100, Samuele Pedroni wrote:

...

- multi methods cover some ground also coverd by interfaces and adaptation: *) a generic function/multi method is also an interface *) some of the things you can achieve with adaptation can be done with multi methods Once you have multimethods do you still need adaptation in some cases or, could one obtain the functionality otherwise or do you need dispatch on interfaces (not just classes), how would then interfaces look like and the dispatch on them?

With a sufficiently powerful predicate dispatch system, you could do away with adaptation entirely, since you can simulate interfaces by implementing a generic function that indicates whether a type supports the interface, and then defining a predicate type that calls the generic function. That is, I define a predicate type IFoo such that ob is of type IFoo if 'implementsIFoo(ob)'. Then, for any type that implements the interface, I define a multimethod saying that implementsIFoo() is true for objects of that type. Then, I can declare multimethod implementations for the IFoo predicate type. What I'm curious about is: is there any way to do it *without* predicate types? Could you have an "open ended union" type, that you can declare other types to be of, without having to inherit from a base type?

Dennis Allison

29 Oct 29 Oct

5:21 p.m.

How about re-engineering the interpreter to make it more MP friendly? (This is probably a bigger task than a Masters thesis.) The current interpreter serializes on the global interpreter lock (GIL) and blocks everything. Is there another approach which would allow processing to continue? Guido said once that there was an attempt to change the granularity of the locking, but that it quickly became overly complex and unstable. Perhaps some of Maurice Herlihy's ideas may be adapted to the problem. Moreover, it may not be necessary that the interpreter state be consistent and deterministic all the time as long as it eventually produces the same answer as a deterministic equivalent. There may be interpreter organizations which move forward optimistically, ignoring potential locking problems and then (if necessary) recoveri, and these may have better performance than the more conservative ones. Or they may not. Some kind of performance tests and evaluations would need to be part of any such study. On Tue, 28 Oct 2003, Brett C. wrote:

...

Today I got the wheels turning on my masters thesis by getting an adviser. Now I just need a topic. =) The big goal is to do something involving Python for a thesis to be finished by fall of next year (about October) so as to have it done, hopefully published (getting into LL4 would be cool), and ready to be used for doctoral applications come January 2005.

So, anyone have any ideas? The best one that I can think of is optional type-checking. I am fairly open to ideas, though, in almost any area involving language design.

Brett C.

30 Oct 30 Oct

1:38 a.m.

Dennis Allison wrote:

...

How about re-engineering the interpreter to make it more MP friendly? (This is probably a bigger task than a Masters thesis.) The current interpreter serializes on the global interpreter lock (GIL) and blocks everything. Is there another approach which would allow processing to continue? Guido said once that there was an attempt to change the granularity of the locking, but that it quickly became overly complex and unstable. Perhaps some of Maurice Herlihy's ideas may be adapted to the problem. Moreover, it may not be necessary that the interpreter state be consistent and deterministic all the time as long as it eventually produces the same answer as a deterministic equivalent. There may be interpreter organizations which move forward optimistically, ignoring potential locking problems and then (if necessary) recoveri, and these may have better performance than the more conservative ones. Or they may not. Some kind of performance tests and evaluations would need to be part of any such study.

As you said, Dennis, this might be too big for a masters thesis. But it definitely would be nice to have solved. I will definitely think about it. -Brett

Phillip J. Eby

29 Oct 29 Oct

6:25 p.m.

New subject: Looking for master thesis ideas involving Python

At 06:01 PM 10/28/03 -0800, Brett C. wrote:

...

Today I got the wheels turning on my masters thesis by getting an adviser. Now I just need a topic. =) The big goal is to do something involving Python for a thesis to be finished by fall of next year (about October) so as to have it done, hopefully published (getting into LL4 would be cool), and ready to be used for doctoral applications come January 2005.

So, anyone have any ideas? The best one that I can think of is optional type-checking. I am fairly open to ideas, though, in almost any area involving language design.

Throwing another Python-specific implementation issue into the ring... how about performance of Python function calls? Specifically, the current Python interpreter has a high overhead for argument passing and frame setup that dominates performance of simple functions. One strategy I've been thinking about for a little while is replacing the per-frame variable size stacks (e.g. argument and block stacks) with per-thread stacks. In principle, this would allow a few things to happen: * Fixed-size "miniframe" workspace objects allocated on the C stack (with lazy creation of heap-allocated "real" frame objects when needed for an exception or a sys._getframe() call) * Direct use of positional arguments on the stack as the "locals" of the next function called, without creating (and then unpacking) an argument tuple, in the case where there are no */** arguments provided by the caller. This would be a pretty sizeable change to Python's internals (especially the core interpreter's handling of "call" operations), but could possibly produce double-digit percentage speedups for function calls in tight loops. (I base this hypothesis on the speed difference between a function call and resuming a generator, and the general observation that the runtime of certain classes of Python programs is almost directly proportional to the number of function calls occurring.)

Michael Hudson

6:33 p.m.

"Phillip J. Eby" writes:

...

* Direct use of positional arguments on the stack as the "locals" of the next function called, without creating (and then unpacking) an argument tuple, in the case where there are no */** arguments provided by the caller.

Already done, unless I misunderstand your idea. Well, the arguments might still get copied into the new frame's locals area but I'm pretty sure no tuple is involved. Cheers, mwh -- That being done, all you have to do next is call free() slightly less often than malloc(). You may want to examine the Solaris system libraries for a particularly ambitious implementation of this technique. -- Eric O'Dell, comp.lang.dylan (& x-posts)

Phillip J. Eby

6:48 p.m.

New subject: Looking for master thesis ideas involving Python

At 06:33 PM 10/29/03 +0000, Michael Hudson wrote:

...

"Phillip J. Eby" writes:

...
* Direct use of positional arguments on the stack as the "locals" of the next function called, without creating (and then unpacking) an argument tuple, in the case where there are no */** arguments provided by the caller.

Already done, unless I misunderstand your idea. Well, the arguments might still get copied into the new frame's locals area but I'm pretty sure no tuple is involved.

Hm. I thought that particular optimization only could take place when the function lacks default arguments. But maybe I've misread that part. If it's true in all cases, then argument tuple creation isn't where the overhead is coming from. Anyway... it wouldn't be a good thesis idea if the answer were as obvious as my speculations, would it? ;)

Jeremy Hylton

30 Oct 30 Oct

4:24 a.m.

On Wed, 2003-10-29 at 13:48, Phillip J. Eby wrote:

...

At 06:33 PM 10/29/03 +0000, Michael Hudson wrote:

...
"Phillip J. Eby" writes:

...
* Direct use of positional arguments on the stack as the "locals" of the next function called, without creating (and then unpacking) an argument tuple, in the case where there are no */** arguments provided by the caller.

Already done, unless I misunderstand your idea. Well, the arguments might still get copied into the new frame's locals area but I'm pretty sure no tuple is involved.

Hm. I thought that particular optimization only could take place when the function lacks default arguments. But maybe I've misread that part. If it's true in all cases, then argument tuple creation isn't where the overhead is coming from.

There is an optimization that depends on having no default arguments (or keyword arguments or free variables). It copies the arguments directly from the caller's frame into the callee's frame without creating an argument tuple. It's interesting to avoid the copy from caller to callee, but I don't think it's a big cost relative to everything else we're doing to set up a frame for calling. (I expect the number of arguments is usually small.) You would need some way to encode what variables are loaded from the caller stack and what variables are loaded from the current frame. Either a different opcode or some kind of flag in the current LOAD/STORE argument. One other possibility for optimization is this XXX comment in fast_function(): /* XXX Perhaps we should create a specialized PyFrame_New() that doesn't take locals, but does take builtins without sanity checking them. */ f = PyFrame_New(tstate, co, globals, NULL); PyFrame_New() does a fair amount of work that is unnecessary in the common case. Jeremy

Michael Hudson

18 Dec 18 Dec

3:45 p.m.

New subject: speeding function calls (a little)

On Thursday, Oct 30, 2003, at 05:24 Europe/Stockholm, Jeremy Hylton wrote: [function calls]

...

There is an optimization that depends on having no default arguments (or keyword arguments or free variables).

Why does it depend on not having default arguments? If you supply the right number of arguments (something that's obviously already checked) why does the function having defaults make a jot of difference?

...

It copies the arguments directly from the caller's frame into the callee's frame without creating an argument tuple.

It's interesting to avoid the copy from caller to callee, but I don't think it's a big cost relative to everything else we're doing to set up a frame for calling. (I expect the number of arguments is usually small.) You would need some way to encode what variables are loaded from the caller stack and what variables are loaded from the current frame. Either a different opcode or some kind of flag in the current LOAD/STORE argument.

As I think Phillip managed to convince himself recently, some kind of JIT functionality seems to be needed to do function calls really efficiently. I wonder if libffi does enough... it would be nice if the body of CALL_FUNCTION could look a bit like this: x = POP() PUSH(((some_cast_or_other)x)(f, stack_pointer, oparg)) Gah, this doesn't really seem to work out, on thinking about it. Wins seem more likely to come from knowing with some certainly at the call site that you've not messed the arguments up (and so we're back to wanting a JIT, it seems to me).

...

One other possibility for optimization is this XXX comment in fast_function(): /* XXX Perhaps we should create a specialized PyFrame_New() that doesn't take locals, but does take builtins without sanity checking them. */ f = PyFrame_New(tstate, co, globals, NULL);

PyFrame_New() does a fair amount of work that is unnecessary in the common case.

Fair amount? I have a patch that gets ~1.5% on pystone along these lines, but it's a bit scary (makes a "lightweight-frame" subclass that assumes more about things on it's freelist, and so on). I'm not sure the modest gains are worth the complexity, but I'll post it to SF... Cheers, mwh

Michael Hudson

30 Oct 30 Oct

11:10 a.m.

"Phillip J. Eby" writes:

...

At 06:33 PM 10/29/03 +0000, Michael Hudson wrote:

...
"Phillip J. Eby" writes:

...
* Direct use of positional arguments on the stack as the "locals" of the next function called, without creating (and then unpacking) an argument tuple, in the case where there are no */** arguments provided by the caller.

Already done, unless I misunderstand your idea. Well, the arguments might still get copied into the new frame's locals area but I'm pretty sure no tuple is involved.

Hm. I thought that particular optimization only could take place when the function lacks default arguments. But maybe I've misread that part. If it's true in all cases, then argument tuple creation isn't where the overhead is coming from.

I hadn't realized/had forgotten that this optimization depended on the lack of default arguments. Instinct would say that it shouldn't be *too* hard to extend to that case (hardly a thesis topic, at any rate :-). Cheers, mwh -- The only problem with Microsoft is they just have no taste. -- Steve Jobs, (From _Triumph of the Nerds_ PBS special) and quoted by Aahz Maruch on comp.lang.python

Brett C.

2:01 a.m.

Just a quick "thank you!" to everyone who has emailed me, personally or publicly, with ideas. There have been a ton of great suggestions and I am going to seriously consider all of them. And this thanks stands indefinitely for any and all future emails on this subject. And please keep sending ideas! Even if I don't pick up on a certain idea maybe someone else will be inspired and decide to run with it or at least start a discussion on possible future improvements (there is always my doctoral thesis in a few years =). I can't believe I just said more discussion on this list was good that I know will most likely take on a life of their own. I guess I really do want to lose my 20/20 vision. =) I also think this thread is a testament to this community in general and this list specifically on how we help others when we can and in the nicest way possible. I have to admit I say with great pride that I am a part of this wonderful community. -Brett

Dennis Allison

3:21 a.m.

Brett -- You might put together a list of all the ideas (maybe even a ranked list) and post it as a unit to the list for archival purposes. Thanks. On Wed, 29 Oct 2003, Brett C. wrote:

...

Just a quick "thank you!" to everyone who has emailed me, personally or publicly, with ideas. There have been a ton of great suggestions and I am going to seriously consider all of them. And this thanks stands indefinitely for any and all future emails on this subject.

And please keep sending ideas! Even if I don't pick up on a certain idea maybe someone else will be inspired and decide to run with it or at least start a discussion on possible future improvements (there is always my doctoral thesis in a few years =). I can't believe I just said more discussion on this list was good that I know will most likely take on a life of their own. I guess I really do want to lose my 20/20 vision. =)

I also think this thread is a testament to this community in general and this list specifically on how we help others when we can and in the nicest way possible. I have to admit I say with great pride that I am a part of this wonderful community.

-Brett

_______________________________________________ Python-Dev mailing list Python-Dev@python.org http://mail.python.org/mailman/listinfo/python-dev Unsubscribe: http://mail.python.org/mailman/options/python-dev/allison%40sumeru.stanford....

Brett C.

5:30 a.m.

Dennis Allison wrote:

...

Brett --

You might put together a list of all the ideas (maybe even a ranked list) and post it as a unit to the list for archival purposes. Thanks.

Way ahead of you, Dennis. I have already started to come up with a reST doc for writing up all of these suggestions. It just might be a little while before I get it up since I will need to do some preliminary research on each idea to measure the amount of work they will be. -Brett

Michael Hudson

11:18 a.m.

"Brett C." writes:

...

Dennis Allison wrote:

...
Brett -- You might put together a list of all the ideas (maybe even a ranked list) and post it as a unit to the list for archival purposes. Thanks.

Way ahead of you, Dennis. I have already started to come up with a reST doc for writing up all of these suggestions. It just might be a little while before I get it up since I will need to do some preliminary research on each idea to measure the amount of work they will be.

Could go on the Python Wiki? I take it from your posting of last week that you've thought about other ways of implementing exception handling? I guess a non-reference count based GC is a prerequisite for that... Cheers, mwh -- >> REVIEW OF THE YEAR, 2000 << It was shit. Give us another one. -- NTK Now, 2000-12-29, http://www.ntk.net/

Brett C.

31 Oct 31 Oct

2:19 a.m.

Michael Hudson wrote:

...

"Brett C." writes:

...
Dennis Allison wrote:

...
Brett -- You might put together a list of all the ideas (maybe even a ranked list) and post it as a unit to the list for archival purposes. Thanks.

Way ahead of you, Dennis. I have already started to come up with a reST doc for writing up all of these suggestions. It just might be a little while before I get it up since I will need to do some preliminary research on each idea to measure the amount of work they will be.

Could go on the Python Wiki?

Could. Let me get it done in reST locally, then I can look at adding it to the wiki.

...

I take it from your posting of last week that you've thought about other ways of implementing exception handling? I guess a non-reference count based GC is a prerequisite for that...

Yeah, I have tossed the exception handling idea around in my head a little, but the culmination was what I posted. And a non-refcount GC would definitely help, even if the exception handling wasn't changed. More places where you could just return NULL instead of having to deal with DECREFing objects. -Brett

Michael Hudson

11:42 a.m.

"Brett C." writes:

...

...
I take it from your posting of last week that you've thought about other ways of implementing exception handling? I guess a non-reference count based GC is a prerequisite for that...

Yeah, I have tossed the exception handling idea around in my head a little, but the culmination was what I posted.

And a non-refcount GC would definitely help, even if the exception handling wasn't changed. More places where you could just return NULL instead of having to deal with DECREFing objects.

And reducing the memory overhead of objects. Here's my crazy idea that's been knocking around my head for a while. I wonder if anyone can shoot in down in flames. Remove the ob_type field from all PyObjects. Make pymalloc mandatory, make it use type specific pools and store a pointer to the type object at the start of each pool. So instead of p->ob_type it's *(p&MASK) I think having each type in its own pools would also let you lose the gc_next & gc_prev fields. Combined with a non-refcount GC, you could hammer sizeof(PyIntObject) down to sizeof(long)! (Actually, a potential killer is assigning to __class__ -- maybe you could only do this for heaptypes) Cheers, mwh -- To summarise the summary of the summary:- people are a problem. -- The Hitch-Hikers Guide to the Galaxy, Episode 12

Phillip J. Eby

1:52 p.m.

New subject: Looking for master thesis ideas involving Python

At 11:42 AM 10/31/03 +0000, Michael Hudson wrote:

...

"Brett C." writes:

...
...
I take it from your posting of last week that you've thought about other ways of implementing exception handling? I guess a non-reference count based GC is a prerequisite for that...

Yeah, I have tossed the exception handling idea around in my head a little, but the culmination was what I posted.

And a non-refcount GC would definitely help, even if the exception handling wasn't changed. More places where you could just return NULL instead of having to deal with DECREFing objects.

And reducing the memory overhead of objects.

OTOH, maybe you could see whether INCREF/DECREF can be used to control synchronization of objects between threads, and thus get a multiprocessor Python. Note that if an object's refcount is 1, it's not being shared between threads. INCREF could be looked at as, "I'm about to use this object", so if the object isn't "owned" by the current thread, then lock it and increment an ownership count. Or was that how the experimental free-threading Python worked?

...

Here's my crazy idea that's been knocking around my head for a while. I wonder if anyone can shoot in down in flames.

Remove the ob_type field from all PyObjects. Make pymalloc mandatory, make it use type specific pools and store a pointer to the type object at the start of each pool.

How would you get from the pointer to the pool head?

Michael Hudson

2:10 p.m.

"Phillip J. Eby" writes:

...

...
Here's my crazy idea that's been knocking around my head for a while. I wonder if anyone can shoot in down in flames.

Remove the ob_type field from all PyObjects. Make pymalloc mandatory, make it use type specific pools and store a pointer to the type object at the start of each pool.

How would you get from the pointer to the pool head?

Did you read the rest of my mail? Maybe I was too terse, but my thinking was that the pools are aligned on a known size boundary (e.g. 4K) so to get to the head you just mask off the 12 (or whatever) least significant bits. Wouldn't work for zeta-c[1], I'd have to admit, but do we care? Cheers, mwh [1] http://www.cliki.net/Zeta-C -- SPIDER: 'Scuse me. [scuttles off] ZAPHOD: One huge spider. FORD: Polite though. -- The Hitch-Hikers Guide to the Galaxy, Episode 11

Phillip J. Eby

4:20 p.m.

New subject: Looking for master thesis ideas involving Python

At 02:10 PM 10/31/03 +0000, Michael Hudson wrote:

...

"Phillip J. Eby" writes:

...
...
Here's my crazy idea that's been knocking around my head for a while. I wonder if anyone can shoot in down in flames.

Remove the ob_type field from all PyObjects. Make pymalloc mandatory, make it use type specific pools and store a pointer to the type object at the start of each pool.

How would you get from the pointer to the pool head?

Did you read the rest of my mail? Maybe I was too terse, but my

Yes, and yes. :)

...

thinking was that the pools are aligned on a known size boundary (e.g. 4K) so to get to the head you just mask off the 12 (or whatever) least significant bits.

Ah. But since even the most trivial of Python operations require access to the type, wouldn't this take longer? I mean, for every ob->ob_type->tp_whatever you'll now have something like *(ob & mask)->tp_whatever. So there are still two memory acesses, but now there's a bitmasking operation added in. I suppose that for some object types you could be getting a 12-25% decrease in memory use for the base object, though.

Michael Hudson

5:08 p.m.

"Phillip J. Eby" writes:

...

...
thinking was that the pools are aligned on a known size boundary (e.g. 4K) so to get to the head you just mask off the 12 (or whatever) least significant bits.

Ah. But since even the most trivial of Python operations require access to the type, wouldn't this take longer? I mean, for every ob->ob_type->tp_whatever you'll now have something like *(ob & mask)->tp_whatever.

Well, I dunno. I doubt the masking would add significant overhead -- it'd only be one instruction, after all -- but the fact that you'd have to haul the start of the pool into the cache to get the pointer to the type object might hurt. You'd have to try it and measure, I guess.

...

So there are still two memory acesses, but now there's a bitmasking operation added in. I suppose that for some object types you could be getting a 12-25% decrease in memory use for the base object, though.

More than that in the good cases. Something I forgot was that you'd probably have to knock variable length types on the head. Cheers, mwh -- I would hereby duly point you at the website for the current pedal powered submarine world underwater speed record, except I've lost the URL. -- Callas, cam.misc

Armin Rigo

1 Nov 1 Nov

12:46 p.m.

Hello Michael, On Fri, Oct 31, 2003 at 05:08:36PM +0000, Michael Hudson wrote:

...

...
be getting a 12-25% decrease in memory use for the base object, though.

More than that in the good cases. Something I forgot was that you'd probably have to knock variable length types on the head.

Why? Armin

martin＠v.loewis.de

2 Nov 2 Nov

7:05 p.m.

Armin Rigo writes:

...

On Fri, Oct 31, 2003 at 05:08:36PM +0000, Michael Hudson wrote:

...
...
be getting a 12-25% decrease in memory use for the base object, though.

More than that in the good cases. Something I forgot was that you'd probably have to knock variable length types on the head.

Why?

Assuming "to knock on the head" means "to put an end to": If you put all objects of the same type into a pool, you really want all objects to have the same side, inside a pool. With that assumption, garbage objects can be reallocated without causing fragmentation. If objects in a pool have different sizes, it is not possible to have an efficient reallocation strategy. Of course, you could try to make a compacting garbage collector, but that would break the current programming model even more (as object references would stop being pointers). Regards, Martin

Armin Rigo

11:35 p.m.

Hello Martin, On Sun, Nov 02, 2003 at 08:05:55PM +0100, Martin v. L?wis wrote:

...

...
...
More than that in the good cases. Something I forgot was that you'd probably have to knock variable length types on the head.

Why?

Assuming "to knock on the head" means "to put an end to":

If you put all objects of the same type into a pool, you really want all objects to have the same side, inside a pool. With that assumption, garbage objects can be reallocated without causing fragmentation. If objects in a pool have different sizes, it is not possible to have an efficient reallocation strategy.

"Not easy" would have been more appropriate. It is still basically what malloc() does. One way would be to use Python's current memory allocator, by adapting it to sort objects into pools not only according to size but also according to type. What seems to me like a good solution would be to use one relatively large "arena" per type and Python's memory allocator to subdivide each arena. If each arena starts at a pointer address which is properly aligned, then *(p&MASK) gives you the type of any object, and possibly even without much cache-miss overhead because there are not so many arenas in total (probably only 1-2 per type in common cases, and arenas can be large). A bientot, Armin.

"Martin v. Löwis"

3 Nov 3 Nov

12:18 a.m.

Armin Rigo wrote:

...

...
If you put all objects of the same type into a pool, you really want all objects to have the same side, inside a pool. With that assumption, garbage objects can be reallocated without causing fragmentation. If objects in a pool have different sizes, it is not possible to have an efficient reallocation strategy.

"Not easy" would have been more appropriate. It is still basically what malloc() does.

That's why I said "efficient". What malloc basically does is not efficient. It gets worse if, at reallocation time, you are not only bound by size, but also by type. E.g. if you have deallocated a tuple of 10 elements, and then reallocate a tuple of 6, the wasted space can only hold a tuple of 1 element, nothing else.

...

One way would be to use Python's current memory allocator, by adapting it to sort objects into pools not only according to size but also according to type. What seems to me like a good solution would be to use one relatively large "arena" per type and Python's memory allocator to subdivide each arena. If each arena starts at a pointer address which is properly aligned, then *(p&MASK) gives you the type of any object, and possibly even without much cache-miss overhead because there are not so many arenas in total (probably only 1-2 per type in common cases, and arenas can be large).

So where do you put strings with 100,000 elements (characters)? Or any other object that exceeds an arena in size? Regards, Martin

Tim Peters

2:48 a.m.

[Martin v. Löwis, on schemes to segregate object memory by type, with the type pointer shared at a calculated address]

...

... So where do you put strings with 100,000 elements (characters)? Or any other object that exceeds an arena in size?

You allocate enough extra memory so that there's room to stick a type pointer at the calculated address; or, IOW, it becomes a one-object pool but of unusually large size. Somes bytes may be lost at the start of the allocated region to allow planting a type pointer at a pool-aligned address; but by assumption the object is "very large", so the wastage can be small in percentage terms. That said, the current pymalloc is relentlessy about speeding alloc/free of exactly-the-same-size small blocks -- there's not much code that could be reused in a type-segregated scheme (the debug pymalloc wrapper is a different story -- it can wrap any malloc/free).

Armin Rigo

2:58 p.m.

Hello Martin, On Mon, Nov 03, 2003 at 01:18:53AM +0100, "Martin v. L?wis" wrote:

...

...
"Not easy" would have been more appropriate. It is still basically what malloc() does.

That's why I said "efficient". What malloc basically does is not efficient. It gets worse if, at reallocation time, you are not only bound by size, but also by type. E.g. if you have deallocated a tuple of 10 elements, and then reallocate a tuple of 6, the wasted space can only hold a tuple of 1 element, nothing else.

That's why we have a custom allocator in Python, to minimize this kind of impact by subdividing arenas into pools of objects grouped by size. I admit that adding the type constrain adds burden to the allocator, though.

...

So where do you put strings with 100,000 elements (characters)? Or any other object that exceeds an arena in size?

These ones are not a problem, because objects and arena can be larger than the MASK. You get to the start of the arena by masking bits away from the address of the *beginning* of the object. An arena can be of any size as long as all the objects it contains starts in the first MASK bytes. For a very large object, the arena would contain only this object, which would then start at the beginning of the arena. I'm more concerned about medium-sized objects, e.g. the ones whose size is 51% of MASK. At the moment I don't see a good solution for these. A bientot, Armin.

Michael Hudson

11:35 a.m.

Armin Rigo writes:

...

Hello Martin,

On Sun, Nov 02, 2003 at 08:05:55PM +0100, Martin v. L?wis wrote:

...
...
...
More than that in the good cases. Something I forgot was that you'd probably have to knock variable length types on the head.

Why?

Assuming "to knock on the head" means "to put an end to":

If you put all objects of the same type into a pool, you really want all objects to have the same side, inside a pool. With that assumption, garbage objects can be reallocated without causing fragmentation. If objects in a pool have different sizes, it is not possible to have an efficient reallocation strategy.

"Not easy" would have been more appropriate. It is still basically what malloc() does.

Well, yeah, but as Tim said pymalloc gets its wins from assuming that each allocation is the same size. You could combine my idea with some other allocation scheme, certainly, but given the relative paucity of variable length types and the reduction in allocator overhead using something like pymalloc gives us, I think it might just be easier to not do them any more. Of course, I don't see myself having any time to play with this idea any time soon, and it's probably not really beefy enough to get a masters thesis from, so maybe we'll never know.

...

One way would be to use Python's current memory allocator, by adapting it to sort objects into pools not only according to size but also according to type.

That's pretty much what I was suggesting.

...

What seems to me like a good solution would be to use one relatively large "arena" per type and Python's memory allocator to subdivide each arena. If each arena starts at a pointer address which is properly aligned, then *(p&MASK) gives you the type of any object, and possibly even without much cache-miss overhead because there are not so many arenas in total (probably only 1-2 per type in common cases, and arenas can be large).

Hmm, maybe. I'm not going to make guesses about that one :-) Cheers, mwh -- ... Windows proponents tell you that it will solve things that your Unix system people keep telling you are hard. The Unix people are right: they are hard, and Windows does not solve them, ... -- Tim Bradshaw, comp.lang.lisp

Armin Rigo

3:55 p.m.

Hello Michael, On Mon, Nov 03, 2003 at 11:35:05AM +0000, Michael Hudson wrote:

...

...
"Not easy" would have been more appropriate. It is still basically what malloc() does.

Well, yeah, but as Tim said pymalloc gets its wins from assuming that each allocation is the same size. You could combine my idea with some other allocation scheme, certainly, but given the relative paucity of variable length types and the reduction in allocator overhead using something like pymalloc gives us, I think it might just be easier to not do them any more. Of course, I don't see myself having any time to play with this idea any time soon, and it's probably not really beefy enough to get a masters thesis from, so maybe we'll never know.

Ok. I expect it to be much easier to experiment with with PyPy anyway. Armin

Michael Hudson

4 p.m.

Armin Rigo writes:

...

Hello Michael,

On Mon, Nov 03, 2003 at 11:35:05AM +0000, Michael Hudson wrote:

...
...
"Not easy" would have been more appropriate. It is still basically what malloc() does.

Well, yeah, but as Tim said pymalloc gets its wins from assuming that each allocation is the same size. You could combine my idea with some other allocation scheme, certainly, but given the relative paucity of variable length types and the reduction in allocator overhead using something like pymalloc gives us, I think it might just be easier to not do them any more. Of course, I don't see myself having any time to play with this idea any time soon, and it's probably not really beefy enough to get a masters thesis from, so maybe we'll never know.

Ok. I expect it to be much easier to experiment with with PyPy anyway.

This had occured to me too :-) Cheers, mwh -- Never meddle in the affairs of NT. It is slow to boot and quick to crash. -- Stephen Harris -- http://home.xnet.com/~raven/Sysadmin/ASR.Quotes.html

Phillip J. Eby

4:38 p.m.

New subject: Looking for master thesis ideas involving Python

At 11:35 AM 11/3/03 +0000, Michael Hudson wrote:

...

Armin Rigo writes:

...
What seems to me like a good solution would be to use one relatively large "arena" per type and Python's memory allocator to subdivide each arena. If each arena starts at a pointer address which is properly aligned, then *(p&MASK) gives you the type of any object, and possibly even without much cache-miss overhead because there are not so many arenas in total (probably only 1-2 per type in common cases, and arenas can be large).

Hmm, maybe. I'm not going to make guesses about that one :-)

You guys do realize that this scheme would make it impossible to change an object's type, right? Unless of course you have some way to "search and replace" all references to an object. And if you were to say, "well, we'll only use this trick for non-heap types", my question would be, how's the code doing *(p&MASK) going to know how *not* to do that? If heap types have a different layout, how can you inherit from a builtin type in pure Python? And so on.

Michael Hudson

7:59 p.m.

"Phillip J. Eby" writes:

...

At 11:35 AM 11/3/03 +0000, Michael Hudson wrote:

...
Armin Rigo writes:

...
What seems to me like a good solution would be to use one relatively large "arena" per type and Python's memory allocator to subdivide each arena. If each arena starts at a pointer address which is properly aligned, then *(p&MASK) gives you the type of any object, and possibly even without much cache-miss overhead because there are not so many arenas in total (probably only 1-2 per type in common cases, and arenas can be large).

Hmm, maybe. I'm not going to make guesses about that one :-)

You guys do realize that this scheme would make it impossible to change an object's type, right? Unless of course you have some way to "search and replace" all references to an object.

I'd got this far...

...

And if you were to say, "well, we'll only use this trick for non-heap types", my question would be, how's the code doing *(p&MASK) going to know how *not* to do that? If heap types have a different layout, how can you inherit from a builtin type in pure Python? And so on.

... but somehow this point had escaped me. Well, you could do something like having a cookie[1] at the start of heap type pools that says "the pointer to the type object is actually at *(p-4)" but that's pretty sick (and puts branches in every type access). Darn. Oh well, I suspected my idea had to have some large problem, it just took longer that I expected for someone to spot it :-) Cheers, mwh [1] e.g. NULL... -- Presumably pronging in the wrong place zogs it. -- Aldabra Stoddart, ucam.chat

Jeremy Fincher

5:34 a.m.

On Friday 31 October 2003 12:08 pm, Michael Hudson wrote:

...

More than that in the good cases. Something I forgot was that you'd probably have to knock variable length types on the head.

That's something I've always wondered about -- what exactly is a "variable length type" and why are they special? From what I gather, they're types (long, str, and tuple are the main ones I know of) whose struct is actually of variable size -- rather than contain a pointer to a variable-size thing, they contain the variable-size thing themselves. What do we gain from them? (if there's some documentation I overlooked, feel free to point me to it.) Thanks, Jeremy

"Martin v. Löwis"

4:56 a.m.

Jeremy Fincher wrote:

...

That's something I've always wondered about -- what exactly is a "variable length type" and why are they special? From what I gather, they're types (long, str, and tuple are the main ones I know of) whose struct is actually of variable size -- rather than contain a pointer to a variable-size thing, they contain the variable-size thing themselves.

Correct. Examples include strings and tuples, but not lists and dictionaries.

...

What do we gain from them?

Speed, by saving an extra allocation upon creation; also some speed by saving an indirection upon access. It only works if the number of items in the object is not going to change over the lifetime of the object - in particular, for immutable objects. There is actually an exception to this rule: If you own the only reference to the object, you can afford to change its size (available for strings only). Regards, Martin

7433

Age (days ago)

7483

Last active (days ago)

List overview

Download

46 comments

15 participants

participants (15)

"Martin v. Löwis"
Alex Martelli
Armin Rigo
Brett C.
Dennis Allison
Fernando Perez
Holger Krekel
Jeremy Fincher
Jeremy Hylton
martin＠v.loewis.de
Michael Hudson
Neil Schemenauer
Phillip J. Eby
Samuele Pedroni
Tim Peters

Looking for master thesis ideas involving Python

Holger Krekel

Fernando Perez

Alex Martelli

Michael Hudson

Samuele Pedroni

Dennis Allison

Michael Hudson

Michael Hudson

Michael Hudson

Dennis Allison

Michael Hudson

Michael Hudson

Michael Hudson

Michael Hudson

Michael Hudson

Michael Hudson

Michael Hudson

Jeremy Fincher

tags

participants (15)