From tomerfiliba at gmail.com Tue Aug 1 19:27:58 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Tue, 1 Aug 2006 19:27:58 +0200 Subject: [Python-3000] gettype In-Reply-To: <44CC6A3E.8000003@v.loewis.de> References: <1d85506f0607061119w1c3cab60o6f762a8e3849e45c@mail.gmail.com> <44CC6A3E.8000003@v.loewis.de> Message-ID: <1d85506f0608011027v4402f905ge6bc18e25ef0aa9e@mail.gmail.com> that's surly anachronism :) o.__class__ is a little more typing and will surely scare newbies. moreover, type(x) and x.__class__ can return different things (you can fool __class__, but not type()). for my part, i'm fine with any form that makes a distinction between the metaclass "type" and the inquire-type "type". call it o.__class__, gettype() or typeof(), just don't mix that with the metaclass -tomer On 7/30/06, "Martin v. L?wis" wrote: > tomer filiba schrieb: > > so why not choose the "get%s()" notation? > > Why not o.__class__? > > Regards, > Martin > From talin at acm.org Wed Aug 2 04:29:51 2006 From: talin at acm.org (Talin) Date: Tue, 01 Aug 2006 19:29:51 -0700 Subject: [Python-3000] gettype In-Reply-To: <1d85506f0608011027v4402f905ge6bc18e25ef0aa9e@mail.gmail.com> References: <1d85506f0607061119w1c3cab60o6f762a8e3849e45c@mail.gmail.com> <44CC6A3E.8000003@v.loewis.de> <1d85506f0608011027v4402f905ge6bc18e25ef0aa9e@mail.gmail.com> Message-ID: <44D00E1F.2040209@acm.org> tomer filiba wrote: > that's surly anachronism :) > > o.__class__ is a little more typing and will surely scare newbies. > moreover, type(x) and x.__class__ can return different things > (you can fool __class__, but not type()). > > for my part, i'm fine with any form that makes a distinction between > the metaclass "type" and the inquire-type "type". > call it o.__class__, gettype() or typeof(), just don't mix that with > the metaclass From a code style perspective, I've always felt that the magical __underscore__ names should not be referred to ouside of the class implementing those names. The double underscores are an indication that this method or property is in most normal use cases referred to implicitly by use rather than explicitly by name; Thus str() invokes __str__ and so on. -- Talin From jack at psynchronous.com Wed Aug 2 05:14:37 2006 From: jack at psynchronous.com (Jack Diederich) Date: Tue, 1 Aug 2006 23:14:37 -0400 Subject: [Python-3000] gettype In-Reply-To: <44D00E1F.2040209@acm.org> References: <1d85506f0607061119w1c3cab60o6f762a8e3849e45c@mail.gmail.com> <44CC6A3E.8000003@v.loewis.de> <1d85506f0608011027v4402f905ge6bc18e25ef0aa9e@mail.gmail.com> <44D00E1F.2040209@acm.org> Message-ID: <20060802031437.GJ25353@performancedrivers.com> On Tue, Aug 01, 2006 at 07:29:51PM -0700, Talin wrote: > tomer filiba wrote: > > that's surly anachronism :) > > > > o.__class__ is a little more typing and will surely scare newbies. > > moreover, type(x) and x.__class__ can return different things > > (you can fool __class__, but not type()). > > > > for my part, i'm fine with any form that makes a distinction between > > the metaclass "type" and the inquire-type "type". > > call it o.__class__, gettype() or typeof(), just don't mix that with > > the metaclass > > From a code style perspective, I've always felt that the magical > __underscore__ names should not be referred to ouside of the class > implementing those names. The double underscores are an indication that > this method or property is in most normal use cases referred to > implicitly by use rather than explicitly by name; Thus str() invokes > __str__ and so on. The paired double underscores indicate that the function is special to the instance's class. C++ converts understand this just fine until you mention that classes are themselves instances at which point the grey matter takes a while to settle again [guilty]. After that reshuffling you are again assaulted because the stack stops. The class of a class is a type but the class of a class of a class is still a type. Turtles all the way down. See the recent thread on python-checkins for some discussion on why "isinstance(ob, type(type))" isn't just legal -- it's backwards compatible! -Jack From ark-mlist at att.net Wed Aug 2 06:56:18 2006 From: ark-mlist at att.net (Andrew Koenig) Date: Wed, 2 Aug 2006 00:56:18 -0400 Subject: [Python-3000] gettype In-Reply-To: <44D00E1F.2040209@acm.org> Message-ID: <001001c6b5ef$fdd258f0$6402a8c0@arkdesktop> > From a code style perspective, I've always felt that the magical > __underscore__ names should not be referred to ouside of the class > implementing those names. The double underscores are an indication that > this method or property is in most normal use cases referred to > implicitly by use rather than explicitly by name; Thus str() invokes > __str__ and so on. Haven't we seen this argument somewhere before? :-) (needless to say, I'm in agreement with it in this context too) From ncoghlan at iinet.net.au Thu Aug 3 14:58:44 2006 From: ncoghlan at iinet.net.au (Nick Coghlan) Date: Thu, 03 Aug 2006 22:58:44 +1000 Subject: [Python-3000] Rounding in Py3k Message-ID: <44D1F304.4020700@iinet.net.au> Some musings inspired by the rounding discussion on python-dev. The Decimal module provides all of the rounding modes from the general decimal arithmetic specification [1]. Both Decimal rounding methods (quantize() and to_integral()) return Decimal instances - a subsequent explicit conversion to int() is needed if you want a real integer (just like the builtin round()). Normal floats, OTOH, only have easy access to truncate (through int()) and round-half-up (through round()). Additionally, the Decimal 'quantize' method signature is fine if you have decimal literals, but not so good for Python where you have to write "n.quantize(d('1e-2'))" to round to two decimal places. The implicit Decimal->float conversion also allows Decimals to be rounded with the round() builtin, but that can lead to errors in rounding near the limits of floating point precision due to the use of an imprecise conversion in Decimal.__float__(): >>> n = (1 + d("5e-16")) >>> n Decimal("1.0000000000000005") >>> float(n.quantize(d('1e-15'))) 1.0 >>> round(n, 15) 1.0000000000000011 Would it be worthwhile to design a common rounding mechanism that can be used to cleanly round values to the built in floating point type, as well as being able to access the different rounding modes for decimal instances? For example, replace the builtin function round() with a non-instantiable class like the following: _TEN = decimal.Decimal(10) class round(object): @staticmethod def half_up(num, ndigits=0): if isinstance(num, decimal.Decimal): return float(num.quantize(_TEN**(-ndigits)), rounding = decimal.ROUND_HALF_UP) return float(num)._round_half_up() __call__ = half_up @staticmethod def down(num, ndigits=0): if isinstance(num, decimal.Decimal): return float(num.quantize(_TEN**(-ndigits)), rounding = decimal.ROUND_DOWN) return float(num)._round_down() # etc for the other 5 rounding modes Cheers, Nick. [1] The 7 decimal rounding modes: round-down (truncate; round towards 0) round-half-up (school rounding) round-half-even (bankers' rounding) round-ceiling (round towards positive infinity) round-floor (round towards negative infinity) round-half-down (WTF rounding :) round-up (round away from zero) -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From greg.ewing at canterbury.ac.nz Fri Aug 4 03:51:19 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 04 Aug 2006 13:51:19 +1200 Subject: [Python-3000] Rounding in Py3k In-Reply-To: <44D1F304.4020700@iinet.net.au> References: <44D1F304.4020700@iinet.net.au> Message-ID: <44D2A817.8040303@canterbury.ac.nz> Nick Coghlan wrote: > The implicit Decimal->float conversion Hang on, I thought there weren't supposed to be any implicit conversions between Decimal and float. -- Greg From greg.ewing at canterbury.ac.nz Fri Aug 4 03:51:25 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 04 Aug 2006 13:51:25 +1200 Subject: [Python-3000] Rounding in Py3k In-Reply-To: <44D1F304.4020700@iinet.net.au> References: <44D1F304.4020700@iinet.net.au> Message-ID: <44D2A81D.2050204@canterbury.ac.nz> Nick Coghlan wrote: > Would it be worthwhile to design a common rounding mechanism that can be used > to cleanly round values to the built in floating point type, as well as being > able to access the different rounding modes for decimal instances? Sounds like a job for a new protocol, such as __round__(self, mode, places). -- Greg From rrr at ronadam.com Fri Aug 4 07:33:01 2006 From: rrr at ronadam.com (Ron Adam) Date: Fri, 04 Aug 2006 00:33:01 -0500 Subject: [Python-3000] Rounding in Py3k In-Reply-To: <44D2A81D.2050204@canterbury.ac.nz> References: <44D1F304.4020700@iinet.net.au> <44D2A81D.2050204@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > Nick Coghlan wrote: > >> Would it be worthwhile to design a common rounding mechanism that can be used >> to cleanly round values to the built in floating point type, as well as being >> able to access the different rounding modes for decimal instances? > > Sounds like a job for a new protocol, such as __round__(self, mode, places). > > -- > Greg Yes I agree. And viewing this in the larger sense of how it works with all numeric types is better than just sticking a function into the math module I think. (Although that might end up the way to do it.) Nicks proposal adds a private method to each of the types for each mode, which I think clutters things up a bit, but his method does create a single interface to them which is nice. I'm still not sure why "__round__" should be preferred in place of "round" as a method name. There isn't an operator associated to rounding so wouldn't the method name not have underscores? I think rounding any type should return that same type. For example: def round(n, places, mode='half-down'): return n.round(places, mode) round(i, 2) -> integer, unchanged value round(i) -> integer, precision == 0 round(i, -2) -> integer round(f, 2) -> float round(f) -> float, precision == 0 round(f, -2) -> float round(d, 2) -> decimal round(d) -> decimal, precision == max (*) round(d, -2) -> decimal (*) The default decimal rounding behavior is not the same as the default builtin round behavior. Should one be changed to match the other? Calling the desired types method directly could be a good way to handle getting an integer when a float is given. It's explicit. int.round(f, 2) -> integer int.round(f) -> integer int.round(f -2) -> integer Or if you prefer... int.__round__(f) Having modes seems to me to be the best way not to clutter the namespace although sometimes that seems like it's not an issue, and sometimes it seems like it is. Here's the list of java rounding modes for comparison. It's nearly identical to the ones in Decimal. http://java.sun.com/j2se/1.5.0/docs/api/java/math/RoundingMode.html Cheers, Ron From greg.ewing at canterbury.ac.nz Fri Aug 4 11:24:26 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 04 Aug 2006 21:24:26 +1200 Subject: [Python-3000] Rounding in Py3k In-Reply-To: References: <44D1F304.4020700@iinet.net.au> <44D2A81D.2050204@canterbury.ac.nz> Message-ID: <44D3124A.6010300@canterbury.ac.nz> Ron Adam wrote: > I'm still not sure why "__round__" should be preferred in place of > "round" as a method name. There isn't an operator associated to > rounding so wouldn't the method name not have underscores? I was thinking there would be functions such as round(), trunc(), etc. that use __round__ to do their work. That's why I called it a protocol and not just a method. -- Greg From rrr at ronadam.com Fri Aug 4 12:46:42 2006 From: rrr at ronadam.com (Ron Adam) Date: Fri, 04 Aug 2006 05:46:42 -0500 Subject: [Python-3000] Rounding in Py3k In-Reply-To: <44D3124A.6010300@canterbury.ac.nz> References: <44D1F304.4020700@iinet.net.au> <44D2A81D.2050204@canterbury.ac.nz> <44D3124A.6010300@canterbury.ac.nz> Message-ID: Greg Ewing wrote: > Ron Adam wrote: > >> I'm still not sure why "__round__" should be preferred in place of >> "round" as a method name. There isn't an operator associated to >> rounding so wouldn't the method name not have underscores? > > I was thinking there would be functions such as round(), > trunc(), etc. that use __round__ to do their work. That's > why I called it a protocol and not just a method. > > -- > Greg I understood your point. :-) If you look at the methods in int, long, and float, there are no methods that do not have double underscores. While there are many that don't in unicode and string. There also are many methods in Decimal that do not use the double underscore naming convention. I am just curious why not in general for the builtin numeric types. The style guide says... > - __double_leading_and_trailing_underscore__: "magic" objects or > attributes that live in user-controlled namespaces. E.g. __init__, > __import__ or __file__. Never invent such names; only use them > as documented. So would __round__ interact with the interpreter in some "magic" way? I take "magic" to mean the interpreter calls the method directly at times without having python coded instructions to do so. Such as when we create an object from a class and __init__ gets called by the interpreter directly. The same goes for methods like __add__ and __repr__, etc... But that doesn't explain why int, long, and float, don't have other non-magic methods. I'm not attempting taking sides for or against either way, I just want to understand the reasons as it seems like by knowing that, the correct way to do it would be clear, instead of trying to wag the dog by the tail if you know what I mean. Cheers, Ron From tomerfiliba at gmail.com Fri Aug 4 17:36:40 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Fri, 4 Aug 2006 17:36:40 +0200 Subject: [Python-3000] improved threading in py3k Message-ID: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> python's threading model seems too weak imo. i'm not talking about the GIL and the fact threads run one at a time -- i'm talking about the incompleteness of the API of thread module. once a thread is created, there is no way to kill it *externally*. which is a pity, since the thread must be "willing" to die, for example: def threadfunc(): while i_am_alive: .... i_am_alive = True thread.start_new_thread(threadfunc) i_am_alive = False but of course you can't trust all threads work this way. moreover, if the thread calls an internal function that blocks but doesn't check i_am_alive, it will never exit. not to mention messing around with globals, etc. the proposed solution is introducing thread.kill, for example: >>> import time >>> import thread >>> thread.start_new_thread(time.sleep, (10,)) 476 >>> thread.kill(476) thread.kill() would raise the ThreadExit exception at the context of the given thread, which, unless caught, causes the thread to exit silently. if it is the last thread of the process, ThreadExit is equivalent to SystemExit. another issue is sys.exit()/SystemExit -- suppose a thread wants to cause the interpreter to exit. calling sys.exit in any thread but the main one will simply kill the *calling* thread. the only way around it is calling os.abort or os._exit(*)... but these functions do not perform cleanups. i would suggest raising SystemExit at the context of any thread, when the exception is not caught, will re-raise the exception at the context of the main thread, where it can be re-caught or the interpreter would exit. and of course, once the functionality of the thread module is extended, the threading module must be extended to support it as well. - - - - (*) about os._exit -- how about introducing os.exit, which would serve as the "nicer" version of os._exit? os.exit would kill the process in the same way SystemExit kills it (performing cleanups and all). in fact, the interpreter would just call os.exit() when catching SystemExit. it would also allow you to ensure the interpreter is killed, as SystemExit can be caught by external code against your will. -tomer From jcarlson at uci.edu Fri Aug 4 20:17:49 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 04 Aug 2006 11:17:49 -0700 Subject: [Python-3000] improved threading in py3k In-Reply-To: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> References: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> Message-ID: <20060804105349.E6C3.JCARLSON@uci.edu> "tomer filiba" wrote: > python's threading model seems too weak imo. i'm not talking about > the GIL and the fact threads run one at a time -- i'm talking about the > incompleteness of the API of thread module. I could have sworn that it could be implemented as a debugging trace function [1], but my tests [2] seem to imply that non-mainthread code doesn't actually have the trace function called. - Josiah [1] >>> import sys >>> import threading >>> >>> kill_these = {} >>> >>> def killthread(thread): ... kill_these[thread] = None ... >>> def trace(*args): ... del args ... if threading.currentThread() in kill_these: ... #pick some exception unlikely/impossible to catch ... raise MemoryError ... return trace ... >>> sys.settrace(trace) >>> def waster(): ... while 1: ... a = 1 ... b = 2 ... c = 3 ... >>> x = threading.Thread(target=waster) >>> x.start() >>> killthread(x) >>> kill_these {: None} >>> x in kill_these True >>> x in threading.enumerate() True >>> threading.enumerate() [, <_MainThread(MainThread, started)>] >>> [2] >>> import threading >>> import sys >>> seen = {} >>> def trace(*args): ... x = threading.currentThread() ... if x not in seen: ... print x ... seen[x] = None ... return trace ... >>> sys.settrace(trace) >>> def waster(): <_MainThread(MainThread, started)> ... while 1: ... a = 1 ... b = 2 ... c = 3 ... >>> x = threading.Thread(target=waster) >>> x.start() >>> This is in Python 2.4.3 on Windows. > - - - - > > (*) about os._exit -- how about introducing os.exit, which would serve > as the "nicer" version of os._exit? os.exit would kill the process in > the same way SystemExit kills it (performing cleanups and all). > in fact, the interpreter would just call os.exit() when catching SystemExit. Already exists as sys.exit() - Josiah From tomerfiliba at gmail.com Fri Aug 4 20:55:55 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Fri, 4 Aug 2006 20:55:55 +0200 Subject: [Python-3000] improved threading in py3k In-Reply-To: <20060804105349.E6C3.JCARLSON@uci.edu> References: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> <20060804105349.E6C3.JCARLSON@uci.edu> Message-ID: <1d85506f0608041155rbf7b38egbae39f521a6f8a2a@mail.gmail.com> > [...] it could be implemented as a debugging trace function even if it could be, *why*? you can't really suggest that from now on, every multithreaded app must run in trace mode, right? it's a performance penalty for no good reason -- it's a question of API. just as the API lets you *create* threads, it should allow you to *kill* them, once you decide so. your code shouldn't rely on the "cooperativeness" of other functions (i.e., the thread does blocking IO using some external library, but you wish to stop it after some timeout, etc.). all i was talking about was adding a new function to the thread module, as well as a new builtin exception to completement it. it's no such a big change that you should work extra hours in inventing creative workarounds for. - - - - you said: > Already exists as sys.exit() but i said: >> it would also allow you to ensure the interpreter is killed, as SystemExit >> can be caught by external code against your will. please take the time to read my post before you reply. here is what i mean by "against your will": >>> import sys >>> >>> try: ... sys.exit() ... except: ... print "fooled you" ... fooled you >>> if my library raises SystemExit, but the user is not aware of that, he/she can block it [un]intentionally, causing undefined behavior in my library. os.exit() would really just perform cleanup and exit (not by the means of exceptions)... just like os._exit(), but not as crude. -tomer On 8/4/06, Josiah Carlson wrote: > > "tomer filiba" wrote: > > python's threading model seems too weak imo. i'm not talking about > > the GIL and the fact threads run one at a time -- i'm talking about the > > incompleteness of the API of thread module. > > I could have sworn that it could be implemented as a debugging trace > function [1], but my tests [2] seem to imply that non-mainthread code > doesn't actually have the trace function called. > > - Josiah > > [1] > > >>> import sys > >>> import threading > >>> > >>> kill_these = {} > >>> > >>> def killthread(thread): > ... kill_these[thread] = None > ... > >>> def trace(*args): > ... del args > ... if threading.currentThread() in kill_these: > ... #pick some exception unlikely/impossible to catch > ... raise MemoryError > ... return trace > ... > >>> sys.settrace(trace) > >>> def waster(): > ... while 1: > ... a = 1 > ... b = 2 > ... c = 3 > ... > >>> x = threading.Thread(target=waster) > >>> x.start() > >>> killthread(x) > >>> kill_these > {: None} > >>> x in kill_these > True > >>> x in threading.enumerate() > True > >>> threading.enumerate() > [, <_MainThread(MainThread, started)>] > >>> > > > [2] > >>> import threading > >>> import sys > >>> seen = {} > >>> def trace(*args): > ... x = threading.currentThread() > ... if x not in seen: > ... print x > ... seen[x] = None > ... return trace > ... > >>> sys.settrace(trace) > >>> def waster(): > <_MainThread(MainThread, started)> > ... while 1: > ... a = 1 > ... b = 2 > ... c = 3 > ... > >>> x = threading.Thread(target=waster) > >>> x.start() > >>> > > This is in Python 2.4.3 on Windows. > > > - - - - > > > > (*) about os._exit -- how about introducing os.exit, which would serve > > as the "nicer" version of os._exit? os.exit would kill the process in > > the same way SystemExit kills it (performing cleanups and all). > > in fact, the interpreter would just call os.exit() when catching SystemExit. > > Already exists as sys.exit() > > - Josiah > > From jcarlson at uci.edu Fri Aug 4 21:29:09 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 04 Aug 2006 12:29:09 -0700 Subject: [Python-3000] improved threading in py3k In-Reply-To: <1d85506f0608041155rbf7b38egbae39f521a6f8a2a@mail.gmail.com> References: <20060804105349.E6C3.JCARLSON@uci.edu> <1d85506f0608041155rbf7b38egbae39f521a6f8a2a@mail.gmail.com> Message-ID: <20060804121614.E6D4.JCARLSON@uci.edu> "tomer filiba" wrote: > > > [...] it could be implemented as a debugging trace function > > even if it could be, *why*? you can't really suggest that from now on, > every multithreaded app must run in trace mode, right? it's a performance > penalty for no good reason -- it's a question of API. You can remove the performance penalty by resetting the trace function to None. > just as the API lets you *create* threads, it should allow you to *kill* them, > once you decide so. your code shouldn't rely on the "cooperativeness" of > other functions (i.e., the thread does blocking IO using some external > library, but you wish to stop it after some timeout, etc.). According to recent unrelated research with regards to the Win32 API, most thread killing methods (if not all?) leaves the thread state broken in such a way that the only way to fix it is to close down the process. Then again, I could be misremembering, the Win32 API is huge. > all i was talking about was adding a new function to the thread module, > as well as a new builtin exception to completement it. it's no such a big > change that you should work extra hours in inventing creative workarounds > for. It took me 5 minutes to generate that possible solution and a test for it. I wasn't saying that the functionality was generally undesireable, just that I believed it should be possible in pure Python today (rather than waiting for Py3k as is the implication by your posting in the Py3k mailing list), and showing why it couldn't be done today. It also brings up the implied question as to whether non-mainthreads should actually execute trace functions. > you said: > > Already exists as sys.exit() > > but i said: > >> it would also allow you to ensure the interpreter is killed, as SystemExit > >> can be caught by external code against your will. > > please take the time to read my post before you reply. > here is what i mean by "against your will": I wasn't aware that sys.exit() raised SystemExit, as I tend to not use bare excepts or sys.exit() in my code (I prefer os._exit(), because when I want to quit, cleanup is the least of my worries). You could have said "sys.exit() raises SystemExit" and I would have understood my mistake. I'm curious as to what I have done to deserve the rudeness of your reply. - Josiah From tomerfiliba at gmail.com Fri Aug 4 22:21:54 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Fri, 4 Aug 2006 22:21:54 +0200 Subject: [Python-3000] improved threading in py3k In-Reply-To: <20060804121614.E6D4.JCARLSON@uci.edu> References: <20060804105349.E6C3.JCARLSON@uci.edu> <1d85506f0608041155rbf7b38egbae39f521a6f8a2a@mail.gmail.com> <20060804121614.E6D4.JCARLSON@uci.edu> Message-ID: <1d85506f0608041321h5a3b1d76gfae5bca45c37ff7e@mail.gmail.com> > I'm curious as to what I have done to deserve the rudeness of your reply. well, i'm kinda pissed off by rockets flying over my house, svn giving me a hard life, and what not. but what you have done was dismissing my post on shaky grounds. if all you meant was adding this support for the 2.x branch as a *workaround*, i truly apologize. > According to recent unrelated research with regards to the Win32 API, > most thread killing methods (if not all?) leaves the thread state broken > in such a way that the only way to fix it is to close down the process. > Then again, I could be misremembering, the Win32 API is huge. that may be so, but my suggestion wasn't *killing* the thread directly - i'm sure one can use win32api to forcefully kill threads. my idea, which is loosely based on dotNET (perhaps also applicable in java), was raising a ThreadExit exception in the context of the given thread. that way, the exception propagates up normally, and will eventually cause the thread's main function to exit silently, unless caught (just as it works today). the issue here is raising the exception in *another* thread (externally); this could only be done from a builtin-function (AFAIK); the rest of the mechanisms are already in place. - - - sorry for bursting out. -tomer On 8/4/06, Josiah Carlson wrote: > > "tomer filiba" wrote: > > > > > [...] it could be implemented as a debugging trace function > > > > even if it could be, *why*? you can't really suggest that from now on, > > every multithreaded app must run in trace mode, right? it's a performance > > penalty for no good reason -- it's a question of API. > > You can remove the performance penalty by resetting the trace function > to None. > > > > just as the API lets you *create* threads, it should allow you to *kill* them, > > once you decide so. your code shouldn't rely on the "cooperativeness" of > > other functions (i.e., the thread does blocking IO using some external > > library, but you wish to stop it after some timeout, etc.). > > According to recent unrelated research with regards to the Win32 API, > most thread killing methods (if not all?) leaves the thread state broken > in such a way that the only way to fix it is to close down the process. > Then again, I could be misremembering, the Win32 API is huge. > > > > all i was talking about was adding a new function to the thread module, > > as well as a new builtin exception to completement it. it's no such a big > > change that you should work extra hours in inventing creative workarounds > > for. > > It took me 5 minutes to generate that possible solution and a test for > it. I wasn't saying that the functionality was generally undesireable, > just that I believed it should be possible in pure Python today (rather > than waiting for Py3k as is the implication by your posting in the Py3k > mailing list), and showing why it couldn't be done today. It also > brings up the implied question as to whether non-mainthreads should > actually execute trace functions. > > > > you said: > > > Already exists as sys.exit() > > > > but i said: > > >> it would also allow you to ensure the interpreter is killed, as SystemExit > > >> can be caught by external code against your will. > > > > please take the time to read my post before you reply. > > here is what i mean by "against your will": > > I wasn't aware that sys.exit() raised SystemExit, as I tend to not use > bare excepts or sys.exit() in my code (I prefer os._exit(), because when > I want to quit, cleanup is the least of my worries). You could have > said "sys.exit() raises SystemExit" and I would have understood my > mistake. > > > I'm curious as to what I have done to deserve the rudeness of your reply. > - Josiah > > From jcarlson at uci.edu Fri Aug 4 23:02:28 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 04 Aug 2006 14:02:28 -0700 Subject: [Python-3000] improved threading in py3k In-Reply-To: <1d85506f0608041321h5a3b1d76gfae5bca45c37ff7e@mail.gmail.com> References: <20060804121614.E6D4.JCARLSON@uci.edu> <1d85506f0608041321h5a3b1d76gfae5bca45c37ff7e@mail.gmail.com> Message-ID: <20060804134148.E6D7.JCARLSON@uci.edu> "tomer filiba" wrote: > > > I'm curious as to what I have done to deserve the rudeness of your reply. > well, i'm kinda pissed off by rockets flying over my house, svn giving me a > hard life, and what not. but what you have done was dismissing my post on > shaky grounds. Ick. I can understand how you are frustrated. > > According to recent unrelated research with regards to the Win32 API, > > most thread killing methods (if not all?) leaves the thread state broken > > in such a way that the only way to fix it is to close down the process. > > Then again, I could be misremembering, the Win32 API is huge. > > that may be so, but my suggestion wasn't *killing* the thread directly - > i'm sure one can use win32api to forcefully kill threads. > my idea, which is loosely based on dotNET (perhaps also applicable in java), > was raising a ThreadExit exception in the context of the given thread. > that way, the exception propagates up normally, and will eventually cause > the thread's main function to exit silently, unless caught (just as it works > today). > > the issue here is raising the exception in *another* thread (externally); > this could only be done from a builtin-function (AFAIK); the rest of the > mechanisms are already in place. One of the use-cases you specified was that C calls could perhaps be aborted (an artificial timeout). Does there exist a mechanism that is able to abort the execution of C code from another C thread without killing the process? If so, then given that the C could be aborted at literally any point of execution, how could any cleanup be done? - Josiah From qrczak at knm.org.pl Fri Aug 4 23:42:07 2006 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Fri, 04 Aug 2006 23:42:07 +0200 Subject: [Python-3000] improved threading in py3k In-Reply-To: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> (tomer filiba's message of "Fri, 4 Aug 2006 17:36:40 +0200") References: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> Message-ID: <87r6zwp49c.fsf@qrnik.zagroda> "tomer filiba" writes: > once a thread is created, there is no way to kill it *externally*. > which is a pity, since the thread must be "willing" to die, Doing that unconditionally is impractical: the thread has no way to protect itself from being killed at moments it has invariants of shared data temporarily violated. I agree that it should not require continuous checking for a thread-local "ask to terminate" flag spread into all potentially long-running loops, i.e. it requires a language mechanism. But it must be temporarily blockable and catchable. Here is how I think the design should look like: http://www.cs.ioc.ee/tfp-icfp-gpce05/tfp-proc/06num.pdf This is the same issue as with other asynchronous exceptions like ^C. What has happened to Freund's & Mitchell's "Safe Asynchronous Exceptions For Python" ? My design is an extension of that. -- __("< Marcin Kowalczyk \__/ qrczak at knm.org.pl ^^ http://qrnik.knm.org.pl/~qrczak/ From jcarlson at uci.edu Sat Aug 5 00:16:33 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 04 Aug 2006 15:16:33 -0700 Subject: [Python-3000] improved threading in py3k In-Reply-To: <87r6zwp49c.fsf@qrnik.zagroda> References: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> <87r6zwp49c.fsf@qrnik.zagroda> Message-ID: <20060804150338.E6DA.JCARLSON@uci.edu> "Marcin 'Qrczak' Kowalczyk" wrote: > "tomer filiba" writes: > > > once a thread is created, there is no way to kill it *externally*. > > which is a pity, since the thread must be "willing" to die, > > Doing that unconditionally is impractical: the thread has no way > to protect itself from being killed at moments it has invariants of > shared data temporarily violated. > > I agree that it should not require continuous checking for a > thread-local "ask to terminate" flag spread into all potentially > long-running loops, i.e. it requires a language mechanism. But it > must be temporarily blockable and catchable. > > Here is how I think the design should look like: > http://www.cs.ioc.ee/tfp-icfp-gpce05/tfp-proc/06num.pdf I did not read all of that paper, but it seems to rely on the (un)masking of signals in threads, as well as the sending of signals to 'kill' a thread. One problem is that Windows doesn't really allow the sending/recieving of any non-process-killing signals, so it would be a platform-specific feature. If you want a sample implementation of that kind of thing, SAGE (http://modular.math.washington.edu/sage/) performs signal masking/unmasking to stop the execution of underlying computation threads. - Josiah From qrczak at knm.org.pl Sat Aug 5 12:29:59 2006 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Sat, 05 Aug 2006 12:29:59 +0200 Subject: [Python-3000] improved threading in py3k In-Reply-To: <20060804150338.E6DA.JCARLSON@uci.edu> (Josiah Carlson's message of "Fri, 04 Aug 2006 15:16:33 -0700") References: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> <87r6zwp49c.fsf@qrnik.zagroda> <20060804150338.E6DA.JCARLSON@uci.edu> Message-ID: <87ac6j326w.fsf@qrnik.zagroda> Josiah Carlson writes: > I did not read all of that paper, but it seems to rely on the > (un)masking of signals in threads, as well as the sending of signals > to 'kill' a thread. They are not OS signals: it's entirely the matter of the language's runtime system. (But Unix signals can be nicely exposed as these signals for the programmer.) -- __("< Marcin Kowalczyk \__/ qrczak at knm.org.pl ^^ http://qrnik.knm.org.pl/~qrczak/ From robinbryce at gmail.com Mon Aug 7 17:11:22 2006 From: robinbryce at gmail.com (Robin Bryce) Date: Mon, 7 Aug 2006 16:11:22 +0100 Subject: [Python-3000] improved threading in py3k In-Reply-To: <87ac6j326w.fsf@qrnik.zagroda> References: <1d85506f0608040836g1ccd894ck6b4a7b0607e7cd36@mail.gmail.com> <87r6zwp49c.fsf@qrnik.zagroda> <20060804150338.E6DA.JCARLSON@uci.edu> <87ac6j326w.fsf@qrnik.zagroda> Message-ID: On 05/08/06, Marcin 'Qrczak' Kowalczyk wrote: > Josiah Carlson writes: > > > I did not read all of that paper, but it seems to rely on the > > (un)masking of signals in threads, as well as the sending of signals > > to 'kill' a thread. > > They are not OS signals: it's entirely the matter of the language's > runtime system. > Have you come across the Pi-Calculus ? Every time I see this topic come up (GIL, threads, concurrency) it seems to founder on the fact[1] that this can not be solved without language support. This is not unique to python[2]. The thing that caught my attention with the Pi-Calculus is that it does not draw an artificial lines between os process, threads, functional program units or data parameters and it starts out by demonstrating very clearly why language equivalence (deterministic automata a == DAb) does not prevent *very* annoying behavioural differences. A result of the work (as far as I understood it) is that all can be treated as equivalent and strong formal tools are given for both modeling the interactions and proving things like behavioral equivalence. The book[4] references work done to show this is viable in interpreted/objecty languages as well as functional ones. Coming back a little way towards planet earth I remember the last time this sort of thing came up someone half heatedly suggested "active objects with messaging"[3] and things died off. Python has always struck me as a language for pragmatists, rather than a place to play about with esoteric academic curiosities. May be some one on this list can pick something useful to py3k out of Pi-calculus ? quoting:http://www.python.org/dev/summary/2005-09-16_2005-09-30.html#concurrency-in-python Guido threw down the gauntlet: rather than the endless discussion about this topic, someone should come up with a GIL-free Python (not necessarily CPython) and demonstrate its worth. [1] err, ok I can't locate the paper that shows this but I *swear* some one far better qualified than me has written one to this effect. [2] http://www.decadentplace.org.uk/pipermail/cpp-threads/2005-October/000715.html [3] http://www.python.org/dev/summary/2005-09-16_2005-09-30.html#concurrency-in-python also, http://aspn.activestate.com/ASPN/Cookbook/Python/Recipe/365292 [4] http://www.amazon.com/gp/product/0521658691/ref=si3_rdr_err_product/002-5641420-6196034?ie=UTF8 Cheers, Robin From talin at acm.org Tue Aug 8 18:49:08 2006 From: talin at acm.org (Talin) Date: Tue, 08 Aug 2006 09:49:08 -0700 Subject: [Python-3000] Set literals - another try Message-ID: <44D8C084.8090503@acm.org> Part 1: The concrete proposal part. I noticed that a lot of folks seemed to like the idea of making the empty set resemble the greek letter Phi, using a combination of parentheses and the vertical bar or forward slash character. So lets expand on this: slice Phi in half and say that (| and |) are delimiters for a set literal, as follows: (|) # Empty set (|a|) # Set with 1 item (|a,b|) # Set with 2 items The advantage of this proposal is that it maintains visual consistency between the 0, 1, and N element cases. Part 2: The idle speculation part, not to be considered as a actual proposal. I've often said that "whenever a programmer has the urge to invent a new programming language, that they should lie down on the couch until the feeling passes". One of the reasons for this is that many times, a programmer's motivation in creating a new language is not that they actually need a new language, but rather as a means of *criticising* an existing language. Inventing their own language gives them the opportunity to show how they would have done it. I think that kind of criticism can be valid, and that languages invented for this purpose can be useful, as long as you don't actually sit down and try to implement the thing. As a thought experiment, I decided to apply this idea to the Python set literal case - i.e. if we were going to do a massive "do over" of Python, how would we approach the problem of set literals? The syntax that comes to mind is something like this: a = b|c Where the vertical bar character means "forms a set with". Larger sets could be made using the same syntax: a = b|c|c|d You can also wrap parens around the set if you want: a = (b|c) Like tuples, a set with a single member still requires at least one delimiter: a = (b|) And the for the empty set, we're back to phi again: a = (|) However, the parens aren't generally required - the rules are pretty much the same as for tuples and the comma operator. Thus, passing sets as arguments: index = s.find_first_of( 'a'|'b'|'c'|'d' ) Of course, by doing this, we're re-assigning the meaning of the '|' operator from 'bitwise or' to 'set construction'. This only makes sense if you assume that either (a) set construction is more common than bitwise-or operations or (b) you provide some reasonable alternative way to express bitwise-or operations. Lets assume that we create some reasonable replacement and move on. Another thing to note is that the set construction operator resembles in some ways the "alternative" operator of BNF notation. In the previous example, 'find_first_of' looks for the first of the given alternatives. Since dictionaries are similar to sets, we can represent a dictionary as a set of keys and associated values. Dictionary literals already use the ':' operator to indicate a key - we can continue that with: a = ('Monday':1 | 'Tuesday':2 | 'Wednesday':3) Unlike the current language, however, you can omit the parens: a = 'Monday':1 | 'Tuesday':2 | 'Wednesday':3 (This creates a syntax ambiguity with colon, but let's move on :) One of the fun things about this line of speculation is watching how such a tiny change ripples outward, affecting the entire language definition. In this case, the change to set construction has much farther-reaching effects than what I have described here, assuming that you take each effect to its logical conclusion. I find it an enjoyable mental excersize :) -- Talin From talin at acm.org Tue Aug 8 18:52:36 2006 From: talin at acm.org (Talin) Date: Tue, 08 Aug 2006 09:52:36 -0700 Subject: [Python-3000] Range literals Message-ID: <44D8C154.9020406@acm.org> I've seen some languages that use a double-dot (..) to mean a range of items. This could be syntactic sugar for range(), like so: for x in 1..10: ... -- Talin From jcarlson at uci.edu Tue Aug 8 19:36:40 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Tue, 08 Aug 2006 10:36:40 -0700 Subject: [Python-3000] Set literals - another try In-Reply-To: <44D8C084.8090503@acm.org> References: <44D8C084.8090503@acm.org> Message-ID: <20060808100536.E706.JCARLSON@uci.edu> Talin wrote: > > Part 1: The concrete proposal part. > > I noticed that a lot of folks seemed to like the idea of making the > empty set resemble the greek letter Phi, using a combination of > parentheses and the vertical bar or forward slash character. > > So lets expand on this: slice Phi in half and say that (| and |) are > delimiters for a set literal, as follows: > > (|) # Empty set > > (|a|) # Set with 1 item > > (|a,b|) # Set with 2 items > > The advantage of this proposal is that it maintains visual consistency > between the 0, 1, and N element cases. That's quite a bit of punctuation to define a set literal. In fact, for 1+ element sets, it's only 1 character shy of the set() punctuation, while also being more difficult to type on at least US keyboards. And if I remember my set math correctly, phi wasn't the character generally used, it was usually a zero with a diagonal cross through it, making (/) a better empty set literal. But from there, the notation devolves into a place I don't want to go. > Part 2: The idle speculation part, not to be considered as a actual > proposal. > > I've often said that "whenever a programmer has the urge to invent a new > programming language, that they should lie down on the couch until the > feeling passes". Presumably you again don't remember the source of this quote, but it is still applicable. > As a thought experiment, I decided to apply this idea to the Python set > literal case - i.e. if we were going to do a massive "do over" of > Python, how would we approach the problem of set literals? > > The syntax that comes to mind is something like this: > > a = b|c The pipe character/bitwise or operator doesn't say to me "make a set". Knowing what I do about set math, the only literal that really makes sense to me is: {a,b,c,...} With the empty set being: {/} Interstingly enough, the non-empty set case has already been proposed, and if I remember correctly, was generally liked, except for the somewhat ambiguity with regards to dictionary literals. I personally don't see much of a use for set literals, considering that there is a non-ambiguous spelling of it currently; set(...), whose only cost above and beyond that of a set literal is a global name lookup. It is 'different' from some other first-class objects (tuple, list, dictionary, string, unicode, ...), but other first-class objects also require such spelling: bool, enumerate, iter, len, property, reduce. Each of which may be used sufficiently often to make sense as having a syntax for their operations, though perhaps only len having an obvious syntax of |obj| -> len(obj), though |obj| could also mean abs(obj), but presumably objects would only ever have __len__ or __abs__ and not both. I digress. -.5 for a set literal syntax at all, -1 for offering your particular set literal variant, -2 for your change propagating to dictionaries and beyond. - Josiah From jcarlson at uci.edu Tue Aug 8 19:44:17 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Tue, 08 Aug 2006 10:44:17 -0700 Subject: [Python-3000] Range literals In-Reply-To: <44D8C154.9020406@acm.org> References: <44D8C154.9020406@acm.org> Message-ID: <20060808104049.E709.JCARLSON@uci.edu> Talin wrote: > > I've seen some languages that use a double-dot (..) to mean a range of > items. This could be syntactic sugar for range(), like so: > > > for x in 1..10: > ... In the pronouncement on PEP 284: http://www.python.org/dev/peps/pep-0284/ Guido did not buy the premise that the range() format needed fixing, "The whole point (15 years ago) of range() was to *avoid* needing syntax to specify a loop over numbers. I think it's worked out well and there's nothing that needs to be fixed (except range() needs to become an iterator, which it will in Python 3.0)." Unless Guido has decided that range/xrange are the wrong way to do things, I don't think there is much discussion here. - Josiah From tomerfiliba at gmail.com Tue Aug 8 20:22:24 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Tue, 8 Aug 2006 20:22:24 +0200 Subject: [Python-3000] threading, part 2 Message-ID: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> let me bring this anew, as the the previous discussion has gone quite off tracks. i know there are many theories/paradigms concerning parallel execution, some require language level constructs, other being external, and let's not ever start talking about the GIL. (on a side note, if i may add my opinion on the subject matter, stackless python has the best approach to concurrency -- don't lock, yield!) my previous suggestion asked for is a means to raise exceptions in the context of *other* threads. all it calls for is for a new builtin function, that would raise a given exception at the context of a given thread. there are some points to address: * native calls -- well, calling builtin functions can't be interrupted that way, and it is problematic, but not directly related to this proposal. that's a problem of machine code. * breaking the thread's state -- that's not really an issue. i'm not talking about *forcefully* killing the thread, without cleanup. after all, exceptions can occur anywhere in the code, and at any time... you code should always be aware of that, with no regard to being thread-safe. for example: def f(a, b): return a + b an innocent function, but now suppose i pass two huge strings... bad input can cause MemoryError, although unlikely. you can't take care of *everything*, you must learn to live with the occasional unexpected exception. so it's may seem brute to suggest a mechanism that raises exceptions at arbitrary points in your code-flow, but: * cleanup will be performed (objects will be reclaimed) * you can handle it anywhere in the call chain (just as any other exception) * most of the time, i'd use that to *kill* threads (the ThreadExit exception), so i don't expect the thread to recover. it should just die silently. sounds better now? -tomer -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060808/ee3aa259/attachment.htm From qrczak at knm.org.pl Tue Aug 8 21:05:24 2006 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Tue, 08 Aug 2006 21:05:24 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> (tomer filiba's message of "Tue, 8 Aug 2006 20:22:24 +0200") References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> Message-ID: <877j1jf3pn.fsf@qrnik.zagroda> "tomer filiba" writes: > after all, exceptions can occur anywhere in the code, and at any time... It's impossible to write safe code when exceptions can occur at any time, except when you already happen have the needed atomic primitives available. Let's say we have a mutable doubly linked list (the list have first and last pointers, each node has next and prev pointers). Please show how to append a first node if exceptions can occur at any time. Not adding the element at all if an asynchronous exception is coming is acceptable, but corrupting the list structure is not. -- __("< Marcin Kowalczyk \__/ qrczak at knm.org.pl ^^ http://qrnik.knm.org.pl/~qrczak/ From pje at telecommunity.com Tue Aug 8 21:30:30 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Tue, 08 Aug 2006 15:30:30 -0400 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? In-Reply-To: Message-ID: <5.1.1.6.0.20060808151352.02604b30@sparrow.telecommunity.com> [Note: Discussion should move to the python-3000 list] At 11:28 AM 8/8/2006 -0700, Paul Prescod wrote: >I'll use up a little bit of my post-conference goodwill to push a >long-term obsession of mine...using a Python variant as the "standard" >extension/FFI model for Python (3000). I've heard variants of this idea >from many people I respect, some of who are cc:ed. > >I want to guage interest before doing any next steps. If it's >preemptively -1 then I won't bother. Therefore I would like to poll the >assembled brains about the feasibility of using something like >RPython/Pyrex as an abstraction layer to be compiled to Py2.5 PyObjects, >Py3000 PyObjects, JNI, .NET, ... > >Rationale: > >Each Python implementation needs an FFI. Any Python without a C-oriented >FFI lacks compatibility with C modules like Numeric and PIL. For this >reason, PyPy re-invented something like Pyrex as RPython. Just FYI, but if I understand correctly, PyPy is now using the ctypes API for its FFI. Also, RPython is entirely unrelated to Pyrex. RPython is Python with restrictions on how it's used, and doesn't include an FFI of its own. I would suggest that PyPy's use of ctypes, coupled with the inclusion of ctypes in the Python 2.5 stdlib, means that ctypes could reasonably be considered a defacto standard for a C FFI in Python at this point. While I *like* Pyrex a lot and use it for most extension modules I write, it is currently heavily tied to the CPython API, lacks many Python features that even RPython allows, it invents its own object model for C inheritance and imports, and has a lot of quirks due to being "not quite Python" in syntax or semantics. These characteristics are undesirable for a cross-interpreter FFI, IMO. A major advantage of using ctypes as the FFI, however, is that ctypes is a library, and thus does not require language or interpreter changes. This means, for example, that a third party could implement a ctypes clone for Jython or IronPython without burdening the core developers of those interpreters. > The two are >obviously not identical but I'm looking at the core idea of a language >that merges Python and C concepts to achieve a usable extension >mechanism. I overheard Jim musing about something similar for >IronPython. > >But most important: Python 3000 needs something like Pyrex. Python 3000 >and Python 2.6, 2.7, 2.8 may be arbitrarily different internally. If the >goal is for it to be "just a bit" incompatible then Guido's design space >is quite constrained. If it is allowed to be massively incompatible then >extension authors will scream. The Python 2.x line will co-exist with >the Python 3000 line for a while, and both with co-exist with >IronPython, Jython, PyPy and others. It would probably be best if you catch up on the current work by the PyPy team in this area, since my understanding is that PyPy is now able to compile "RPython+ctypes" code to create CPython extensions in C. This suggests that it should be possible to backends for C# and Java, because (again, if I understand correctly) the ctypes handling is done at a relatively high level of the translation tool chain, such that the backend code generators don't need to know anything about ctypes. Hopefully Armin or somebody else will jump in on this point if I'm getting something wrong about all that. > * it would be simpler to write competitive Python interpreters to test >out different design ideas...one wouldn't have to worry that such an >interpreter would be inherently a toy because of the unavailability of >third-party software Note that this is also a goal of the PyPy project, and they have many such options now, such as "pure" GC and refcounted variants, even if you entirely ignore the part where backends can generate code for a variety of languages. From jimjjewett at gmail.com Tue Aug 8 21:31:37 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Tue, 8 Aug 2006 15:31:37 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> Message-ID: On 8/8/06, tomer filiba wrote: > my previous suggestion asked for is a means to raise exceptions in the > context of *other* threads. ... > * breaking the thread's state -- that's not really an issue. i'm not talking > about *forcefully* killing the thread, without cleanup. This has the same inherent problem as Java's Thread.stop -- that data shared beyond the thread may be left in an inconsistent state because the cleanup wasn't done, perhaps because a lock was held. https://java.sun.com/j2se/1.4.2/docs/guide/misc/threadPrimitiveDeprecation.html > so it's may seem brute to suggest a mechanism that raises exceptions > at arbitrary points in your code-flow, but: If you're willing to forget about native code (and you suggested that you were), then you could just check[*] every N bytecodes, the way the interpreters already checks to decide whether it should switch threads. Whether the performance overhead is worthwhile is a different question. It might be better to just add an example thread to threading.py (or Queue) that does its processing in a loop, and checks its own stop variable every time through the loop. [*] What to do in case of a raise it a bit trickier, of course -- basically, replace the next bytecode with a RAISE_VARARGS bytecode, but that might violate some current try-except assumptions. -jJ From collinw at gmail.com Tue Aug 8 21:50:20 2006 From: collinw at gmail.com (Collin Winter) Date: Tue, 8 Aug 2006 15:50:20 -0400 Subject: [Python-3000] Set literals - another try In-Reply-To: <20060808100536.E706.JCARLSON@uci.edu> References: <44D8C084.8090503@acm.org> <20060808100536.E706.JCARLSON@uci.edu> Message-ID: <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> On 8/8/06, Josiah Carlson wrote: > I personally don't see much of a use for set literals, considering that > there is a non-ambiguous spelling of it currently; set(...), whose only > cost above and beyond that of a set literal is a global name lookup. I thought one of the main arguments in favor of set literals is that a literal form would allow the compiler to perform optimisations that the set(...) spelling doesn't allow. Collin Winter From jcarlson at uci.edu Tue Aug 8 22:21:39 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Tue, 08 Aug 2006 13:21:39 -0700 Subject: [Python-3000] Set literals - another try In-Reply-To: <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> References: <20060808100536.E706.JCARLSON@uci.edu> <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> Message-ID: <20060808131458.E70C.JCARLSON@uci.edu> "Collin Winter" wrote: > > On 8/8/06, Josiah Carlson wrote: > > I personally don't see much of a use for set literals, considering that > > there is a non-ambiguous spelling of it currently; set(...), whose only > > cost above and beyond that of a set literal is a global name lookup. > > I thought one of the main arguments in favor of set literals is that a > literal form would allow the compiler to perform optimisations that > the set(...) spelling doesn't allow. The optimization argument used to define language syntax seems a bit like the "tail wagging the dog" cliche. For immutable literals that are used a huge number of times (int, tuple, and other immutables), a literal syntax for compiler optimization makes sense. But for mutables (list, dict, etc.), literal syntax is more a convenience as than an optimization, as the compiler hasn't historically created once and copied for re-use, but pushed values on the stack and called the relevant create list bytecode. [1] - Josiah [1] Python 2.4.3 (#69, Mar 29 2006, 17:35:34) [MSC v.1310 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import dis >>> def foo(): ... return [1,2,3] ... >>> def goo(): ... return (1,2,3) ... >>> dis.dis(foo) 2 0 LOAD_CONST 1 (1) 3 LOAD_CONST 2 (2) 6 LOAD_CONST 3 (3) 9 BUILD_LIST 3 12 RETURN_VALUE >>> dis.dis(goo) 2 0 LOAD_CONST 4 ((1, 2, 3)) 3 RETURN_VALUE >>> From tjreedy at udel.edu Tue Aug 8 23:12:25 2006 From: tjreedy at udel.edu (Terry Reedy) Date: Tue, 8 Aug 2006 17:12:25 -0400 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? References: <5.1.1.6.0.20060808151352.02604b30@sparrow.telecommunity.com> Message-ID: For those as ignorant as I was, FFI does not here mean Friendly File Interface Fauna and Flora International Family Firm Institute Forsvarets forskningsinstitutt Film Finances, Inc. Financial Freedom Institute Focus on the Family Institute ... (all but the first from Google) but Foreign Function Interface (from the PHP FFI package). > I would suggest that PyPy's use of ctypes, coupled with the inclusion of > ctypes in the Python 2.5 stdlib, means that ctypes could reasonably be > considered a defacto standard for a C FFI in Python at this point. Intriguing idea. I know that the Pygame folks, for example, are experimenting with rewrapping the SDL (Simple Directmedia Library, the core of Pygame) in ctypes. Terry Jan Reedy From guido at python.org Tue Aug 8 23:31:59 2006 From: guido at python.org (Guido van Rossum) Date: Tue, 8 Aug 2006 14:31:59 -0700 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? In-Reply-To: References: <5.1.1.6.0.20060808151352.02604b30@sparrow.telecommunity.com> Message-ID: On 8/8/06, Terry Reedy wrote: > > I would suggest that PyPy's use of ctypes, coupled with the inclusion of > > ctypes in the Python 2.5 stdlib, means that ctypes could reasonably be > > considered a defacto standard for a C FFI in Python at this point. > > Intriguing idea. I know that the Pygame folks, for example, are > experimenting with rewrapping the SDL (Simple Directmedia Library, the core > of Pygame) in ctypes. Isn't a problem with ctypes that such extensions can no longer guarantee "no segfaults"? This pretty much completely rules them out for use in sandboxes such as what Brett Cannon is currently working on. With hand-written extensions at least you can audit them to decide whether they are safe enough. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From paul.prescod at xmetal.com Tue Aug 8 23:45:18 2006 From: paul.prescod at xmetal.com (Paul Prescod) Date: Tue, 8 Aug 2006 14:45:18 -0700 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? Message-ID: >... > > Just FYI, but if I understand correctly, PyPy is now using > the ctypes API for its FFI. Also, RPython is entirely > unrelated to Pyrex. RPython is Python with restrictions on > how it's used, and doesn't include an FFI of its own. As you said elsewhere, PyPy can compile an Rpython+rctypes program to a C file, just as Pyrex does. So I don't understand why you see them as "entirely unrelated". There are different syntaxes, but the goals are very similar. Pyrex uses optional type declarations (which are planned for Python 3000). RPython infers types from rctypes API calls (which will also be available in Python 3000). Perhaps it would be better if I dropped the reference to Rpython and merely talked about "extcompiler" there tool which is very parallel to the Pyrex compiler? You make some good points about Pyrex and ctypes. I'd rather explore the design space after I've heard whether this design direction has the potential to be fruitful. I infer that you think "yes". Paul Prescod From pje at telecommunity.com Wed Aug 9 00:40:15 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Tue, 08 Aug 2006 18:40:15 -0400 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? In-Reply-To: Message-ID: <5.1.1.6.0.20060808180036.03b61bd8@sparrow.telecommunity.com> At 02:45 PM 8/8/2006 -0700, Paul Prescod wrote: >As you said elsewhere, PyPy can compile an Rpython+rctypes program to a >C file, just as Pyrex does. So I don't understand why you see them as >"entirely unrelated". Disclaimer again: I like and use Pyrex; I even built additional support for it into setuptools. Conversely, I've only used ctypes once and am not sure I care for its API. But as a practical matter, these preferences are irrelevant; I will end up learning to use ctypes and liking it, and so will everybody else, because ctypes' *dynamic* advantage will clean Pyrex's clock at the very moment that extcompiler is as easy to use as Pyrex is now. To summarize the differences, Pyrex is: * A *Python-like* language, rather than Python * Invents new inheritance/import facilities * Imports various bits of syntax from C, including operators, pointers, etc. * Inherently tied to the CPython API in its implementation * Has its own system of "header" files for compile-time import/include * Generates C code directly from Pyrex * Cannot be executed by standard Python PyPy's RPython+rctypes is: * 100% Python, with certain dynamicity constraints * Is not tied to any particular back end -- it can be translated to C, LLVM code, or even JavaScript if you like, as the type inference, annotation, and optimization machinery is backend-independent * Code can be run in a normal Python interpreter if a ctypes library is available The only relationship I see between the two are some overlap in use cases, and the letters "R", "P" and "Y" in the names. :) In particular, Pyrex cannot be used in the interpreter, and I can't see Guido allowing Pyrex's C syntax to infect Python-the-language, so this is likely to be a stable barrier keeping Pyrex from having this feature, unless Greg or somebody else decides to create a Pyrex interpreter, or perhaps an import hook to translates Pyrex source code to Python bytecode that invokes the ctypes API. :) (Note, by the way, that such an import hook/translator would be equally usable in PyPy, instantly making it possible to compile Pyrex to any backend supported by PyPy! I suggest you let that idea sink in for a little bit, as it helps to illustrate why making ctypes the standard FFI is the One Obvious Way To Do It.) >There are different syntaxes, but the goals are very similar. Well, you could say that about Python and Ruby, to name just two. Syntax is important. :) But that's also entirely ignoring the wide range of practical issues alluded to above, and some more I'll dig into below. >Pyrex uses optional type declarations (which are planned >for Python 3000). RPython infers types from rctypes API calls (which >will also be available in Python 3000). They're available in Python 2.5, which means code can be written for them today. The dynamic usability of ctypes from interpreted Python means that Pyrex will become a historical footnote as soon as the RPython+rctypes->CPython translator is practically usable; i.e., when it can compete with Pyrex for code generation speed (and speed of generated code), installability, documentation, and user community. At that point, the advantage of being able to debug your C interface using the interpreter's ctypes library, and then to compile the code only if/when you need to, will be a killer advantage. IMO, it doesn't make sense to fight that now-inevitable future, either on behalf of Pyrex or some imagined "better" alternative; instead, we might as well hasten that future's arrival. We can always provide better syntax for ctypes at a later date, the way 'classmethod' and friends arrived in Python 2.2 but didn't get syntax until 2.4. If you can't wait that long, write that import hook to turn Pyrex source into Python bytecode. :) > Perhaps it would be better if I >dropped the reference to Rpython and merely talked about "extcompiler" >there tool which is very parallel to the Pyrex compiler? I'm at a bit of a loss as to how to explain how very not-useful that comparison is. I would suggest reading up on PyPy architecture and Pyrex architecture a bit. From an end-user perspective you can compare them as things that take Python-looking stuff in and spit C code out, but the devil is definitely in the details. See also the lists I gave above. >You make some good points about Pyrex and ctypes. I'd rather explore the >design space after I've heard whether this design direction has the >potential to be fruitful. I infer that you think "yes". See http://dirtsimple.org/2005/10/children-of-lesser-python.html for what I think. :) In that article, I highlighted the absence of a standard Python FFI as being a stumbling block to the future evolution of the language, but noted that PyPy would likely end up with a solution. The subsequent emergence of ctypes as an FFI shared by CPython and PyPy has already solved this problem; it is merely a question of recognizing the fact. As of Python 2.5, anything else is going to have a serious uphill battle to fight -- even if it's something like Pyrex, that at least already *exists* and has at least *one* part-time maintainer. A brand-new FFI invented by committee and with nobody yet stepping up to implement or maintain it, really has no chance at all. (This is all IMO, of course, but I find it hard to imagine how anything else could succeed.) From 2006a at usenet.alexanderweb.de Wed Aug 9 01:08:50 2006 From: 2006a at usenet.alexanderweb.de (Alexander Schremmer) Date: Wed, 9 Aug 2006 01:08:50 +0200 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? References: <5.1.1.6.0.20060808151352.02604b30@sparrow.telecommunity.com> Message-ID: <1ci9un3z1n806.dlg@usenet.alexanderweb.de> On Tue, 8 Aug 2006 14:31:59 -0700, Guido van Rossum wrote: > Isn't a problem with ctypes that such extensions can no longer > guarantee "no segfaults"? How would you guarantee the "no segfaults" policy for every other bindings involved? In either case, auditing an extension written using ctypes or rctypes is potentially simpler than looking at Pyrex or C code. (Think of memory management, ref counting etc.) > This pretty much completely rules them out for use in sandboxes such > as what Brett Cannon is currently working on. Of course you will have severe problems if you allow somebody to do unprotected calls to dynamic libraries. But at least I am not sure if this a problem of using CTypes ... it should be possible to e.g. flag the code using CTypes classes to be in a different security class than the user-sandboxed code. Building the barrier on the C level might be too restrictive in real world applications. > With hand-written extensions at least you can audit them to decide > whether they are safe enough. Please elaborate on that point - why isn't a ctypes extension "hand-written"? Kind regards, Alexander From pedronis at strakt.com Wed Aug 9 01:18:06 2006 From: pedronis at strakt.com (Samuele Pedroni) Date: Wed, 09 Aug 2006 01:18:06 +0200 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? In-Reply-To: References: <5.1.1.6.0.20060808151352.02604b30@sparrow.telecommunity.com> Message-ID: <44D91BAE.5040507@strakt.com> Guido van Rossum wrote: > On 8/8/06, Terry Reedy wrote: > >>>I would suggest that PyPy's use of ctypes, coupled with the inclusion of >>>ctypes in the Python 2.5 stdlib, means that ctypes could reasonably be >>>considered a defacto standard for a C FFI in Python at this point. >> >>Intriguing idea. I know that the Pygame folks, for example, are >>experimenting with rewrapping the SDL (Simple Directmedia Library, the core >>of Pygame) in ctypes. > > > Isn't a problem with ctypes that such extensions can no longer > guarantee "no segfaults"? This pretty much completely rules them out > for use in sandboxes such as what Brett Cannon is currently working > on. With hand-written extensions at least you can audit them to decide > whether they are safe enough. in PyPy rctypes approach the extensions still get compiled to c code, and ctypes-like calls get resolved to normal c calls, although at some point a ctypes module is going to be exposed by PyPy, in the rctypes approach such an exposed ctypes is not a requirement at all. Rctypes gives ctypes-like C gluing to RPython, a different level from normal application-level full Python. And indeed (although with rough edges and some missing features at the moment) PyPy tool-chain can produce CPython extensions from such rpython+rctypes extension code. From ncoghlan at gmail.com Wed Aug 9 12:17:25 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 09 Aug 2006 20:17:25 +1000 Subject: [Python-3000] Set literals - another try In-Reply-To: <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> References: <44D8C084.8090503@acm.org> <20060808100536.E706.JCARLSON@uci.edu> <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> Message-ID: <44D9B635.9010200@gmail.com> Collin Winter wrote: > On 8/8/06, Josiah Carlson wrote: >> I personally don't see much of a use for set literals, considering that >> there is a non-ambiguous spelling of it currently; set(...), whose only >> cost above and beyond that of a set literal is a global name lookup. > I thought one of the main arguments in favor of set literals is that a > literal form would allow the compiler to perform optimisations that > the set(...) spelling doesn't allow. A different way to enable that would be to include a set of non-keyword names (a subset of the default builtin namespace) in the language definition that the compiler is explicitly permitted to treat as constants if they are not otherwise defined in the current lexical scope. Then constant-folding could turn "len('abcde')" into 5, and "str(3+2)" into '5' and "set((1, 2, 3))" into the corresponding set object. The only thing that would break is hacks like poking an alternate implementation of str or set or len into the global namespace from somewhere outside the module. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Wed Aug 9 12:45:08 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 09 Aug 2006 20:45:08 +1000 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> Message-ID: <44D9BCB4.5010404@gmail.com> Jim Jewett wrote: > On 8/8/06, tomer filiba wrote: >> my previous suggestion asked for is a means to raise exceptions in the >> context of *other* threads. > > ... > >> * breaking the thread's state -- that's not really an issue. i'm not talking >> about *forcefully* killing the thread, without cleanup. > > This has the same inherent problem as Java's Thread.stop -- that data > shared beyond the thread may be left in an inconsistent state because > the cleanup wasn't done, perhaps because a lock was held. > > https://java.sun.com/j2se/1.4.2/docs/guide/misc/threadPrimitiveDeprecation.html > >> so it's may seem brute to suggest a mechanism that raises exceptions >> at arbitrary points in your code-flow, but: > > If you're willing to forget about native code (and you suggested that > you were), then you could just check[*] every N bytecodes, the way the > interpreters already checks to decide whether it should switch > threads. Whether the performance overhead is worthwhile is a > different question. That check is already there: int PyThreadState_SetAsyncExc( long id, PyObject *exc) Asynchronously raise an exception in a thread. The id argument is the thread id of the target thread; exc is the exception object to be raised. This function does not steal any references to exc. To prevent naive misuse, you must write your own C extension to call this. Must be called with the GIL held. Returns the number of thread states modified; if it returns a number greater than one, you're in trouble, and you should call it again with exc set to NULL to revert the effect. This raises no exceptions. New in version 2.3. In Python 2.5, you can use ctypes to get at the whole C API from Python code, and calling thread.get_ident() in the run() method will allow you to find out the thread id of your thread (you'll need to save that value somewhere so other code can get at it). All Tober is really asking for is a method on threading.Thread objects that uses this existing API to set a builtin ThreadExit exception. The thread module would consider a thread finishing with ThreadExit to be non-exceptional, so you could easily do: th.terminate() # Raise ThreadExit in th's thread of control th.join() # Should finish up pretty quickly Proper resource cleanup would be reliant on correct use of try/finally or with statements, but that's the case regardless of whether or not asynchronous exceptions are allowed. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Wed Aug 9 12:57:12 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 09 Aug 2006 20:57:12 +1000 Subject: [Python-3000] Cross-interpreter FFI for Python 3000? In-Reply-To: <5.1.1.6.0.20060808180036.03b61bd8@sparrow.telecommunity.com> References: <5.1.1.6.0.20060808180036.03b61bd8@sparrow.telecommunity.com> Message-ID: <44D9BF88.6080705@gmail.com> Phillip J. Eby wrote: > (This is all IMO, of course, but I find it hard to imagine how anything > else could succeed.) Having just made the point in another thread that it is possible to use ctypes to access the CPython API functions like PyThreadState_SetAsyncExc that have been designated "extension module only", I'm one who agrees with you - adding ctypes to the standard library effectively adopted it as Python's foreign function interface. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From jimjjewett at gmail.com Wed Aug 9 18:36:24 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Wed, 9 Aug 2006 12:36:24 -0400 Subject: [Python-3000] Set literals - another try In-Reply-To: <44D9B635.9010200@gmail.com> References: <44D8C084.8090503@acm.org> <20060808100536.E706.JCARLSON@uci.edu> <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> <44D9B635.9010200@gmail.com> Message-ID: On 8/9/06, Nick Coghlan wrote: > A different way to enable that would be to include a set of non-keyword names > (a subset of the default builtin namespace) in the language definition that > the compiler is explicitly permitted to treat as constants if they are not > otherwise defined in the current lexical scope. Realistically, I want my own functions and class definitions to be treated that way (inlinable) most of the time. I don't want to start marking them with "stable". > The only thing that would break is hacks like poking an alternate > implementation of str or set or len into the global namespace from somewhere > outside the module. So what we need is a module that either rejects changes (after it is sealed) or at least provides notification (so things can be recompiled). In theory, this could even go into python 2.x (though not as the default), though it is a bit difficult in practice. (By the time you can specify an alternative dict factory, it is too late.) -jJ From guido at python.org Wed Aug 9 20:36:32 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 9 Aug 2006 11:36:32 -0700 Subject: [Python-3000] Set literals - another try In-Reply-To: <44D8C084.8090503@acm.org> References: <44D8C084.8090503@acm.org> Message-ID: On 8/8/06, Talin wrote: > Part 1: The concrete proposal part. > > I noticed that a lot of folks seemed to like the idea of making the > empty set resemble the greek letter Phi, using a combination of > parentheses and the vertical bar or forward slash character. > > So lets expand on this: slice Phi in half and say that (| and |) are > delimiters for a set literal, as follows: > > (|) # Empty set > > (|a|) # Set with 1 item > > (|a,b|) # Set with 2 items > > The advantage of this proposal is that it maintains visual consistency > between the 0, 1, and N element cases. -1. This attempts to solve the lack of an empty set literal in the current best proposal, which is set(), {1}, {1, 2}, {1, 2, 3} etc. But it does so at the tremendous cost of inventing new unfamiliar brackets. > Part 2: The idle speculation part, not to be considered as a actual > proposal. [...] > The syntax that comes to mind is something like this: > > a = b|c This would be ambiguous since b|c also means set union. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Wed Aug 9 20:43:50 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 9 Aug 2006 11:43:50 -0700 Subject: [Python-3000] Set literals - another try In-Reply-To: References: <44D8C084.8090503@acm.org> <20060808100536.E706.JCARLSON@uci.edu> <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> <44D9B635.9010200@gmail.com> Message-ID: > On 8/9/06, Nick Coghlan wrote: > > A different way to enable that would be to include a set of non-keyword names > > (a subset of the default builtin namespace) in the language definition that > > the compiler is explicitly permitted to treat as constants if they are not > > otherwise defined in the current lexical scope. Right. This has been considered many times. I would love it if someone wrote up a PEP for this. On 8/9/06, Jim Jewett wrote: > Realistically, I want my own functions and class definitions to be > treated that way (inlinable) most of the time. I don't want to start > marking them with "stable". I'm not sure what you mean here. Inlining user code really isn't on the table; it's unrealistic to expect this to happen any time soon (especially since you're likely to want to inline things imported from other modules too, and methds, etc.). > > The only thing that would break is hacks like poking an alternate > > implementation of str or set or len into the global namespace from somewhere > > outside the module. The PEP should consider this use case and propose a solution. I'm fine with requiring a module to write len = len near the top to declare that it wants len patchable. OTOH for open I think the compiler should *not* inline this as it is fairly common to monkey-patch it. > So what we need is a module that either rejects changes (after it is > sealed) or at least provides notification (so things can be > recompiled). In theory, this could even go into python 2.x (though > not as the default), though it is a bit difficult in practice. (By > the time you can specify an alternative dict factory, it is too late.) Recompilation upon notification seems way over the top; it's not like anything we currently do or are even considering. I'd much rather pick one of the following: (a) if the module doesn't have a global named 'len' and you add one (e.g. by "m.len = ...") the behavior is undefined (b) module objects actively reject attempts to inject new globals that would shadow built-ins in the list that Nick proposes. (BTW having such a list is a good idea. Requiring the compiler to know about *all* built-ins is not realistic since some frameworks patch the __builtin__ module.) PS. Nick, how's the book coming along? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Wed Aug 9 20:45:34 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 9 Aug 2006 11:45:34 -0700 Subject: [Python-3000] Set literals - another try In-Reply-To: <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> References: <44D8C084.8090503@acm.org> <20060808100536.E706.JCARLSON@uci.edu> <43aa6ff70608081250m5e2f9b6fm547fe0b0fd265a48@mail.gmail.com> Message-ID: On 8/8/06, Collin Winter wrote: > I thought one of the main arguments in favor of set literals is that a > literal form would allow the compiler to perform optimisations that > the set(...) spelling doesn't allow. Let me clear up this misunderstanding. Optimizations have nothing to do with it (they would be invalid anyway since sets are mutable). It's a matter of writing more readable code. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Wed Aug 9 20:53:31 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 9 Aug 2006 11:53:31 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <44D9BCB4.5010404@gmail.com> References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> Message-ID: On 8/9/06, Nick Coghlan wrote: > That check is already there: > > int PyThreadState_SetAsyncExc( long id, PyObject *exc) > Asynchronously raise an exception in a thread. The id argument is the > thread id of the target thread; exc is the exception object to be raised. This > function does not steal any references to exc. To prevent naive misuse, you > must write your own C extension to call this. Must be called with the GIL > held. Returns the number of thread states modified; if it returns a number > greater than one, you're in trouble, and you should call it again with exc set > to NULL to revert the effect. This raises no exceptions. New in version 2.3. Note that it is intentionally not directly accessible from Python -- but this can be revised. > In Python 2.5, you can use ctypes to get at the whole C API from Python code, > and calling thread.get_ident() in the run() method will allow you to find out > the thread id of your thread (you'll need to save that value somewhere so > other code can get at it). > > All Tober is really asking for is a method on threading.Thread objects that > uses this existing API to set a builtin ThreadExit exception. The thread > module would consider a thread finishing with ThreadExit to be > non-exceptional, so you could easily do: > > th.terminate() # Raise ThreadExit in th's thread of control > th.join() # Should finish up pretty quickly > > Proper resource cleanup would be reliant on correct use of try/finally or with > statements, but that's the case regardless of whether or not asynchronous > exceptions are allowed. I'm +0 on this. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From tim.peters at gmail.com Wed Aug 9 21:48:58 2006 From: tim.peters at gmail.com (Tim Peters) Date: Wed, 9 Aug 2006 15:48:58 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> Message-ID: <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> [Nick Coghlan] >> That check is already there: >> >> int PyThreadState_SetAsyncExc( long id, PyObject *exc) >> Asynchronously raise an exception in a thread. The id argument is the >> thread id of the target thread; exc is the exception object to be raised. This >> function does not steal any references to exc. To prevent naive misuse, you >> must write your own C extension to call this. Must be called with the GIL >> held. Returns the number of thread states modified; if it returns a number >> greater than one, you're in trouble, and you should call it again with exc set >> to NULL to revert the effect. This raises no exceptions. New in version 2.3. Guido, do you have any idea now what the "number greater than one" business is about? That would happen if and only if we found more than one thread state with the given thread id in the interpreter's list of thread states, but we're counting those with both the GIL and the global head_mutex lock held. My impression has been that it would be an internal logic error if we ever saw this count exceed 1. While I'm at it, I expect: Py_CLEAR(p->async_exc); Py_XINCREF(exc); p->async_exc = exc; is better written: Py_XINCREF(exc); Py_CLEAR(p->async_exc); p->async_exc = exc; for the same reason one should always incref B before decrefing A in A = B ... >> All Tober is really asking for is a method on threading.Thread objects that >> uses this existing API to set a builtin ThreadExit exception. The thread >> module would consider a thread finishing with ThreadExit to be >> non-exceptional, so you could easily do: >> >> th.terminate() # Raise ThreadExit in th's thread of control >> th.join() # Should finish up pretty quickly >> >> Proper resource cleanup would be reliant on correct use of try/finally or with >> statements, but that's the case regardless of whether or not asynchronous >> exceptions are allowed. [Guido] > I'm +0 on this. Me too, although it won't stay that simple, and I'm clear as mud on how implementations other than CPython could implement this. From guido at python.org Wed Aug 9 22:39:25 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 9 Aug 2006 13:39:25 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> Message-ID: On 8/9/06, Tim Peters wrote: > [Nick Coghlan] > >> That check is already there: > >> > >> int PyThreadState_SetAsyncExc( long id, PyObject *exc) > >> Asynchronously raise an exception in a thread. The id argument is the > >> thread id of the target thread; exc is the exception object to be raised. This > >> function does not steal any references to exc. To prevent naive misuse, you > >> must write your own C extension to call this. Must be called with the GIL > >> held. Returns the number of thread states modified; if it returns a number > >> greater than one, you're in trouble, and you should call it again with exc set > >> to NULL to revert the effect. This raises no exceptions. New in version 2.3. > > Guido, do you have any idea now what the "number greater than one" > business is about? That would happen if and only if we found more > than one thread state with the given thread id in the interpreter's > list of thread states, but we're counting those with both the GIL and > the global head_mutex lock held. My impression has been that it would > be an internal logic error if we ever saw this count exceed 1. Right, I think that's it. I guess I was in a grumpy mood when I wrote this (and Just & Alex never ended up using it!). > While I'm at it, I expect: > > Py_CLEAR(p->async_exc); > Py_XINCREF(exc); > p->async_exc = exc; > > is better written: > > Py_XINCREF(exc); > Py_CLEAR(p->async_exc); > p->async_exc = exc; > > for the same reason one should always incref B before decrefing A in > > A = B > > ... That reason that A and B might already be the same object, right? > >> All Tober is really asking for is a method on threading.Thread objects that > >> uses this existing API to set a builtin ThreadExit exception. The thread > >> module would consider a thread finishing with ThreadExit to be > >> non-exceptional, so you could easily do: > >> > >> th.terminate() # Raise ThreadExit in th's thread of control > >> th.join() # Should finish up pretty quickly > >> > >> Proper resource cleanup would be reliant on correct use of try/finally or with > >> statements, but that's the case regardless of whether or not asynchronous > >> exceptions are allowed. > > [Guido] > > I'm +0 on this. > > Me too, although it won't stay that simple, and I'm clear as mud on > how implementations other than CPython could implement this. Another good reason to keep it accessible from the C API only. Now I'm -0 on adding it. I suggest that if someone really wants this accessible from Python, they should research how Jython, IronPython, PyPy and Stackless could handle this, and report their research in a PEP. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From qrczak at knm.org.pl Thu Aug 10 00:27:16 2006 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Thu, 10 Aug 2006 00:27:16 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: (Guido van Rossum's message of "Wed, 9 Aug 2006 13:39:25 -0700") References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> Message-ID: <871wrp8rzv.fsf@qrnik.zagroda> "Guido van Rossum" writes: >> for the same reason one should always incref B before decrefing A in >> >> A = B >> >> ... > > That reason that A and B might already be the same object, right? Or B might be a subobject of A, not referenced elsewhere. -- __("< Marcin Kowalczyk \__/ qrczak at knm.org.pl ^^ http://qrnik.knm.org.pl/~qrczak/ From talin at acm.org Thu Aug 10 01:13:05 2006 From: talin at acm.org (Talin) Date: Wed, 09 Aug 2006 16:13:05 -0700 Subject: [Python-3000] Python/C++ question Message-ID: <44DA6C01.2040904@acm.org> A while back someone proposed switching to C++ as the implementation language for CPython, and the response was that this would make ABI compatibility too difficult, since the different C++ compilers don't have a common way to represent things like vtables and such. However, I was thinking - if you remove all of the ABI-breaking features of C++, such as virtual functions, name mangling, RTTI, exceptions, and so on, its still a pretty nice language compared to C - you still have things like namespaces, constructors/destructors (especially nice for stack-local objects), overloadable type conversion, automatic upcasting/downcasting, references, plus you don't have to keep repeating the word 'struct' everywhere. Think how much cleaner the Python source would be if just one C++ feature - namespaces - could be used. Imagine being able to put all of your enumeration values in their own namespace, instead of mixing them in with all the other global symbols. Think of the gazillions of cast operators you could get rid of if you could assign from PyString* to PyObject*, without having to explicitly cast between pointer types. My question is, however - would this even work? That is, if you wrapped all the source files in 'extern "C"', turned off the exception and RTTI compiler switches, suppressed the use of the C++ runtime libs and forbade use of the word 'virtual', would that effectively avoid the ABI compatibility issues? Would you be able to produce, on all supported platforms, a binary executable that was interoperable with ones produced by straight C? I actually have a personal motivation in asking this - it has been so many years since I've written in C, that I've actually *forgotten how*. Despite the fact that my very first C program, written in 1982, was a C compiler, today I find writing C programs a considerable challenge, because I don't remember exactly where the dividing line between C and C++ is - and I will either end up accidentally using a C++-specific language feature, or worse, I'll unconsciously avoid a valid C language feature because I don't remember whether it's C++ specific or not. (For example, I don't remember whether its valid to define an enumeration within a struct, which is something that I do all the time in C++.) -- Talin From guido at python.org Thu Aug 10 01:18:02 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 9 Aug 2006 16:18:02 -0700 Subject: [Python-3000] Python/C++ question In-Reply-To: <44DA6C01.2040904@acm.org> References: <44DA6C01.2040904@acm.org> Message-ID: On 8/9/06, Talin wrote: > A while back someone proposed switching to C++ as the implementation > language for CPython, and the response was that this would make ABI > compatibility too difficult, since the different C++ compilers don't > have a common way to represent things like vtables and such. > > However, I was thinking - if you remove all of the ABI-breaking features > of C++, such as virtual functions, name mangling, RTTI, exceptions, and > so on, its still a pretty nice language compared to C - you still have > things like namespaces, constructors/destructors (especially nice for > stack-local objects), overloadable type conversion, automatic > upcasting/downcasting, references, plus you don't have to keep repeating > the word 'struct' everywhere. > > Think how much cleaner the Python source would be if just one C++ > feature - namespaces - could be used. Imagine being able to put all of > your enumeration values in their own namespace, instead of mixing them > in with all the other global symbols. > > Think of the gazillions of cast operators you could get rid of if you > could assign from PyString* to PyObject*, without having to explicitly > cast between pointer types. > > My question is, however - would this even work? That is, if you wrapped > all the source files in 'extern "C"', turned off the exception and RTTI > compiler switches, suppressed the use of the C++ runtime libs and > forbade use of the word 'virtual', would that effectively avoid the ABI > compatibility issues? Would you be able to produce, on all supported > platforms, a binary executable that was interoperable with ones produced > by straight C? > > I actually have a personal motivation in asking this - it has been so > many years since I've written in C, that I've actually *forgotten how*. > Despite the fact that my very first C program, written in 1982, was a C > compiler, today I find writing C programs a considerable challenge, > because I don't remember exactly where the dividing line between C and > C++ is - and I will either end up accidentally using a C++-specific > language feature, or worse, I'll unconsciously avoid a valid C language > feature because I don't remember whether it's C++ specific or not. (For > example, I don't remember whether its valid to define an enumeration > within a struct, which is something that I do all the time in C++.) For the majority of Python developers it's probably the other way around. It's been 15 years since I wrote C++, and unlike C, that language has changed a lot since then... It would be a complete rewrite; I prefer doing a gradual transmogrification of the current codebase into Py3k rather than starting from scratch (read Joel Spolsky on why). -- --Guido van Rossum (home page: http://www.python.org/~guido/) From collinw at gmail.com Thu Aug 10 02:32:19 2006 From: collinw at gmail.com (Collin Winter) Date: Wed, 9 Aug 2006 20:32:19 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations Message-ID: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> After letting the discussions from the Spring stew in my head for a few months, here's my first draft of the proto-PEP for function annotations. This is intended to lay out in a single document the basic ideas for function annotations, to get community feedback on the fundamentals before proceeding to the nitty-gritty. As such, the implementation section isn't filled out; that's still in progress. Also, the list of references is incomplete. Both of these will be completed before the initial submission to the PEP editors. Without further ado... PEP: 3XXX Title: Function Annotations Version: $Revision: 43251 $ Last-Modified: $Date: 2006-03-23 09:28:55 -0500 (Thu, 23 Mar 2006) $ Author: Collin Winter Discussions-To: python-3000 at python.org Status: Draft Type: Standards Track Requires: 3XXX (Brett Cannon's __signature__ PEP) Content-Type: text/x-rst Created: 03-Aug-2006 Python-Version: 3.0 Post-History: Abstract ======== This PEP introduces a syntax for adding annotations to Python functions [#func-term#]_. In addition to annotations for function parameters, the syntax includes support for annotating a function's return value(s). In section one, I outline the "philosophy" and fundamentals needed to understand function annotations before launching into an in-depth discussion. In section two, the syntax for function annotations is presented, including a full explanation of the changes needed in Python's grammar. In section three, I discuss how user code will be able to access the annotation information. Section four describes a possible implementation of function annotations for Python 3.0. In section five, a C-language API for use by extension modules is discussed. Lastly, section six lists a number of ideas that were considered for inclusion but were ultimately rejected. Rationale ========= Because Python's 2.x series lacks a standard way of annotating a function's parameters and return values (e.g., with information about a what type a function's return value should be), a variety of tools and libraries have appeared to fill this gap [#tail-examp#]_. Some utilise the decorators introduced in "PEP 318", while others parse a function's doctext strings, looking for annotations there. This PEP aims to provide a single, standard way of specifying this information, reducing the confusion caused by the wide variation in mechanism and syntax that has existed until this point. Fundamentals of Function Annotations ==================================== Before launching into a discussion of the precise ins and outs of Python 3.0's function annotations, let's first talk broadly about what annotations are and are not: 1. Function annotations, both for parameters and return values, are completely optional. 2. Function annotations are nothing more than a way of associating arbitrary Python expressions with various parts of a function at compile-time. Re-read that. Once more. By itself, Python does not attach any particular meaning or significance to annotations. Left to its own, Python simply takes these expressions and uses them as the values in some theoretical parameter-name-to-annotation-expression mapping. The only way that annotations take on meaning is when they are interpreted by third-party libraries. These third-party, annotation-interpreting libraries (TAILs, for short) can do anything they want with a function's annotations. For example, one library might use string-based annotations to provide improved help messages, like so: :: def compile(source: "something compilable", filename: "where the compilable thing comes from", mode: "is this a single statement or a suite?"): ... Another library might be used to provide typechecking for Python functions and methods. This library could use annotations to indicate the function's expected input and return types, possibly something like :: def sum(*vargs: Number) -> Number: ... where ``Number`` is some description of the protocol for numeric types. However, neither the strings in the first example nor the type information in the second example have any meaning on their own; meaning comes from third-party libraries alone. 3. Following from point 2, this PEP makes no attempt to introduce any kind of standard semantics, even for the built-in types. This work will be left to third-party libraries. There is no worry that these libraries will assign semantics at random, or that a variety of libraries will appear, each with varying semantics and interpretations of what, say, a tuple of strings means. The difficulty inherent in writing annotation interpreting libraries will keep their number low and their authorship in the hands of people who, frankly, know what they're doing. Syntax ====== Parameters ---------- Annotations for parameters take the form of optional expressions that follow the parameter name. This example indicates that parameters 'a' and 'c' should both be a ``Number``, while parameter 'b' should both be a ``Mapping``: :: def foo(a: Number, b: Mapping, c: Number = 5): ... In pseudo-grammar, parameters now look like ``identifier [: expression] [= expression]``. That is, type annotations always precede a parameter's default value and both type annotations and default values are optional. Just like how equal signs are used to indicate a default value, colons are used to mark annotations. All annotation expressions are evaluated at the time the function is compiled. Annotations for excess parameters (i.e., *vargs and **kwargs) are indicated similarly. In the follow function definition, ``*vargs`` is flagged as a list of ``Number``s, and ``**kwargs`` is marked as a dict whose keys are strings and whose values are ``Sequence``s. :: def foo(*vargs: Number, **kwargs: Sequence): ... Note that, depending on what annotation-interpreting library you're using, the following might also be a valid spelling of the above: :: def foo(*vargs: [Number], **kwargs: {str: Sequence}): ... Only the first, however, has the BDFL's blessing [#blessed-excess#]_ as the One Obvious Way. Return Values ------------- The examples thus far have omitted examples of how to annotate the type of a function's return value. This is done like so: :: def sum(*vargs: Number) -> Number: ... The parameter list can now be followed by a literal ``->`` and a Python expression. Like the annotations for parameters, this expression will be evaluated when the function is compiled. The pseudo-grammar for function definition is now something like :: vargs = '*' identifier [':' expression] kwargs = '**' identifier [':' expression] parameter = identifier [':' expression] ['=' expression] funcdef = 'def' identifier '(' [parameter ',']* [vargs ','] [kwargs] ')' ['->' expression] ':' suite For a complete discussion of the changes to Python's grammar, see the section `Grammar Changes`_. Accessing Function Annotations ============================== Once compiled, a function's annotations are available via the function's ``__signature__`` attribute, introduced by PEP 3XXX. Signature objects include an attribute just for annotations, appropriately called ``annotations``. This attribute is a dictionary, mapping parameter names to an object representing the evaluated annotation expression. There is a special key in the ``annotations`` mapping, ``"return"``. This key is present only if an annotation was supplied for the function's return value. For example, the following annotation: :: def foo(a: Number, b: 5 + 6, c: list) -> String: ... would result in a ``__signature__.annotations`` mapping of :: {'a': Number, 'b': 11, 'c': list, 'return': String} The ``return`` key was chosen because it cannot conflict with the name of a parameter; any attempt to use ``return`` as a parameter name would result in a ``SyntaxError``. Implementation ============== XXX This is all very much TODO. Beyond the obvious changes to Python's grammar, the eventual implementation will probably involve a change to the MAKE_FUNCTION opcode, though the details haven't been fully worked out yet. I'm still working on a sample implementation that works separately from the __signature__ mechanism. API for Annotations in C-language Extension Modules =================================================== XXX TODO This will probably involve macros around CPython API calls to set and fetch the annotation expression for a given parameter. Rejected Proposals ================== + The BDFL rejected the author's idea for a special syntax for adding annotations to generators as being "too ugly" [#reject-gen-syn]_. + Though discussed early on ([#thread-gen#]_, [#thread-hof#]_), including special objects in the stdlib for annotating generator functions and higher-order functions was ultimately rejected as being more appropriate for third-party libraries: including them in the standard library raised too many thorny issues. + Despite considerable discussion about a standard type parameterisation syntax, it was decided that this should also be left to third-party libraries. ([#thread_imm-list#]_, [#thread-mixing#]_, [#emphasis-tpls#]_) Footnotes ========= .. _[#func-term#] - Unless specifically stated, "function" is generally used as a synonym for "callable" throughout this document. .. _[#tail-examp#] - The author's typecheck_ library makes use of decorators, while `Maxime Bourget's own typechecker`_ utilises parsed doctext strings. References ########## .. _[#blessed-excess#] - http://mail.python.org/pipermail/python-3000/2006-May/002173.html .. _[#reject-gen-syn#] - http://mail.python.org/pipermail/python-3000/2006-May/002103.html .. _typecheck - http://oakwinter.com/code/typecheck/ .. _Maxime Bourget's own typechecker - http://maxrepo.info/taxonomy/term/3,6/all .. _[#thread-gen#] - http://mail.python.org/pipermail/python-3000/2006-May/002091.html .. _[#thread-hof#] - http://mail.python.org/pipermail/python-3000/2006-May/001972.html .. _[#thread-imm-list#] - http://mail.python.org/pipermail/python-3000/2006-May/002105.html .. _[#thread-mixing#] - http://mail.python.org/pipermail/python-3000/2006-May/002209.html .. _[#emphasis-tpls#] - http://mail.python.org/pipermail/python-3000/2006-June/002438.html From talin at acm.org Thu Aug 10 02:51:02 2006 From: talin at acm.org (Talin) Date: Wed, 09 Aug 2006 17:51:02 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> Message-ID: <44DA82F6.5030907@acm.org> Collin Winter wrote: > There is no worry that these libraries will assign semantics at > random, or that a variety of libraries will appear, each with varying > semantics and interpretations of what, say, a tuple of strings > means. The difficulty inherent in writing annotation interpreting > libraries will keep their number low and their authorship in the > hands of people who, frankly, know what they're doing. I find this assumption extremely dubious. > In pseudo-grammar, parameters now look like > ``identifier [: expression] [= expression]``. That is, type > annotations always precede a parameter's default value and both type > annotations and default values are optional. Just like how equal > signs are used to indicate a default value, colons are used to mark > annotations. All annotation expressions are evaluated at the time > the function is compiled. Only one annotation per parameter? What if I want to specify both a docstring *and* a type constraint? -- Talin From collinw at gmail.com Thu Aug 10 03:02:08 2006 From: collinw at gmail.com (Collin Winter) Date: Wed, 9 Aug 2006 21:02:08 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DA82F6.5030907@acm.org> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> <44DA82F6.5030907@acm.org> Message-ID: <43aa6ff70608091802sc2cd03bg9c43a237bcf13d8@mail.gmail.com> On 8/9/06, Talin wrote: > Collin Winter wrote: > > There is no worry that these libraries will assign semantics at > > random, or that a variety of libraries will appear, each with varying > > semantics and interpretations of what, say, a tuple of strings > > means. The difficulty inherent in writing annotation interpreting > > libraries will keep their number low and their authorship in the > > hands of people who, frankly, know what they're doing. > > I find this assumption extremely dubious. Why? This is something Guido and I have discussed and agreed on. What's your reasoning? > > In pseudo-grammar, parameters now look like > > ``identifier [: expression] [= expression]``. That is, type > > annotations always precede a parameter's default value and both type > > annotations and default values are optional. Just like how equal > > signs are used to indicate a default value, colons are used to mark > > annotations. All annotation expressions are evaluated at the time > > the function is compiled. > > Only one annotation per parameter? What if I want to specify both a > docstring *and* a type constraint? If the grammar were something like ``identifier [: expression]* [= expression]`` instead, it would be possible to add multiple annotations to parameters. But what of the return value? Would you want to write def foo() -> Number -> "total number of frobnications": ... I wouldn't. The way to make this explicit, if you need it, would be something like this: def bar(a: ("number of whatzits", Number)) -> ("frobnication count", Number): then use a decorator to determine which annotation-interpreting decorators are assigned which annotations, something like this, perhaps: @chain(annotation_as_docstring, annotation_as_type) def bar(a: ("number of whatzits", Number)) -> ("frobnication count", Number): Collin Winter From tim.peters at gmail.com Thu Aug 10 03:38:20 2006 From: tim.peters at gmail.com (Tim Peters) Date: Wed, 9 Aug 2006 21:38:20 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> Message-ID: <1f7befae0608091838u594de27ctb83dd0845ccaa0@mail.gmail.com> [back and forth on PyThreadState_SetAsyncExc()] [Tim] >> Guido, do you have any idea now what the "number greater than one" >> business is about? >> ... >> My impression has been that it would be an internal logic error if we >> ever saw this count exceed 1. [Guido] > Right, I think that's it. I guess I was in a grumpy mood when I wrote > this I forgot that we talked about this close to two years ago: http://www.python.org/sf/1069160 As comments there say, it's still the case that it's clearly possible to provoke this into deadlocking (but unlikely if you're not deliberately trying to). > (and Just & Alex never ended up using it!). They spoke for themselves on this matter in that bug report ;-) >> While I'm at it, I expect: >> >> Py_CLEAR(p->async_exc); >> Py_XINCREF(exc); >> p->async_exc = exc; >> >> is better written: >> >> Py_XINCREF(exc); >> Py_CLEAR(p->async_exc); >> p->async_exc = exc; >> >> for the same reason one should always incref B before decrefing A in >> >> A = B >> >> ... > That reason that A and B might already be the same object, right? Right, or that B's only owned reference is on a chain only reachable from A, and in either case A's incoming refcount is 1. The suggested deadlock-avoiding rewrite in the patch comment addresses that too. ... >>> I'm +0 on [exposing] this [from Python]. >> Me too, although it won't stay that simple, and I'm clear as mud on >> how implementations other than CPython could implement this. > Another good reason to keep it accessible from the C API only. Now I'm > -0 on adding it. I suggest that if someone really wants this > accessible from Python, they should research how Jython, IronPython, > PyPy and Stackless could handle this, and report their research in a > PEP. As a full-blown language feature, I'm -1 unless that work is done first. I'm still +0 on adding it to CPython if it's given a leading-underscore name and docs to make clear that it's a CPython-specific hack that may never work under any other implementation. From greg.ewing at canterbury.ac.nz Thu Aug 10 04:47:48 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 10 Aug 2006 14:47:48 +1200 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> Message-ID: <44DA9E54.5020105@canterbury.ac.nz> Collin Winter wrote: > one library might use string-based annotations to provide > improved help messages, like so: > > def compile(source: "something compilable", > filename: "where the compilable thing comes from", > mode: "is this a single statement or a suite?"): > > Another library might be used to provide typechecking for Python > functions and methods. > > def sum(*vargs: Number) -> Number: > ... And what are you supposed to do if you want to write a function that has improved help messages *and* type checking? > The difficulty inherent in writing annotation interpreting > libraries will keep their number low and their authorship in the > hands of people who, frankly, know what they're doing. Even if there are only two of them, they can still conflict. I think the idea of having totally undefined annotations is fundamentally flawed. -- Greg From greg.ewing at canterbury.ac.nz Thu Aug 10 04:49:55 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 10 Aug 2006 14:49:55 +1200 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608091802sc2cd03bg9c43a237bcf13d8@mail.gmail.com> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> <44DA82F6.5030907@acm.org> <43aa6ff70608091802sc2cd03bg9c43a237bcf13d8@mail.gmail.com> Message-ID: <44DA9ED3.3040304@canterbury.ac.nz> Collin Winter wrote: > On 8/9/06, Talin wrote: > >>Collin Winter wrote: >> >>> The difficulty inherent in writing annotation interpreting >>> libraries will keep their number low and their authorship in the >>> hands of people who, frankly, know what they're doing. >> >>I find this assumption extremely dubious. > > Why? This is something Guido and I have discussed and agreed on. It smells like something akin to security by obscurity to me. -- Greg From collinw at gmail.com Thu Aug 10 04:58:55 2006 From: collinw at gmail.com (Collin Winter) Date: Wed, 9 Aug 2006 22:58:55 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DA9E54.5020105@canterbury.ac.nz> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> <44DA9E54.5020105@canterbury.ac.nz> Message-ID: <43aa6ff70608091958u2d00db76s48260853942bed32@mail.gmail.com> On 8/9/06, Greg Ewing wrote: > Collin Winter wrote: > > one library might use string-based annotations to provide > > improved help messages, like so: > > > > def compile(source: "something compilable", > > filename: "where the compilable thing comes from", > > mode: "is this a single statement or a suite?"): > > > > Another library might be used to provide typechecking for Python > > functions and methods. > > > > def sum(*vargs: Number) -> Number: > > ... > > And what are you supposed to do if you want to write > a function that has improved help messages *and* > type checking? I already answered this in my response to Talin. The next draft will address this directly. > > The difficulty inherent in writing annotation interpreting > > libraries will keep their number low and their authorship in the > > hands of people who, frankly, know what they're doing. > > Even if there are only two of them, they can still > conflict. No-one is arguing that there won't be conflicting ideas about how to spell different annotation ideas; just look at the number of interface/role/typeclass/whatever implementations. The idea is that each developer can pick the notation/semantics that's most natural to them. I'll go even further: say one library offers a semantics you find handy for task A, while another library's ideas about type annotations are best suited for task B. Without a single standard, you're free to mix and match these libraries to give you a combination that allows you to best express the ideas you're going for. Collin Winter From guido at python.org Thu Aug 10 06:14:03 2006 From: guido at python.org (Guido van Rossum) Date: Wed, 9 Aug 2006 21:14:03 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <1f7befae0608091838u594de27ctb83dd0845ccaa0@mail.gmail.com> References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> <1f7befae0608091838u594de27ctb83dd0845ccaa0@mail.gmail.com> Message-ID: On 8/9/06, Tim Peters wrote: > [back and forth on PyThreadState_SetAsyncExc()] > > [Tim] > >> Guido, do you have any idea now what the "number greater than one" > >> business is about? > >> ... > >> My impression has been that it would be an internal logic error if we > >> ever saw this count exceed 1. > > [Guido] > > Right, I think that's it. I guess I was in a grumpy mood when I wrote > > this > > I forgot that we talked about this close to two years ago: > > http://www.python.org/sf/1069160 > > As comments there say, it's still the case that it's clearly possible > to provoke this into deadlocking (but unlikely if you're not > deliberately trying to). > > > (and Just & Alex never ended up using it!). > > They spoke for themselves on this matter in that bug report ;-) > > >> While I'm at it, I expect: > >> > >> Py_CLEAR(p->async_exc); > >> Py_XINCREF(exc); > >> p->async_exc = exc; > >> > >> is better written: > >> > >> Py_XINCREF(exc); > >> Py_CLEAR(p->async_exc); > >> p->async_exc = exc; > >> > >> for the same reason one should always incref B before decrefing A in > >> > >> A = B > >> > >> ... > > > That reason that A and B might already be the same object, right? > > Right, or that B's only owned reference is on a chain only reachable > from A, and in either case A's incoming refcount is 1. The suggested > deadlock-avoiding rewrite in the patch comment addresses that too. So why didn't we check that in? > ... > > >>> I'm +0 on [exposing] this [from Python]. > > >> Me too, although it won't stay that simple, and I'm clear as mud on > >> how implementations other than CPython could implement this. > > > Another good reason to keep it accessible from the C API only. Now I'm > > -0 on adding it. I suggest that if someone really wants this > > accessible from Python, they should research how Jython, IronPython, > > PyPy and Stackless could handle this, and report their research in a > > PEP. > > As a full-blown language feature, I'm -1 unless that work is done > first. I'm still +0 on adding it to CPython if it's given a > leading-underscore name and docs to make clear that it's a > CPython-specific hack that may never work under any other > implementation. Fine with me then. In 2.5? 2.6? Or py3k? (This is the py3k list.) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From paul at prescod.net Thu Aug 10 07:19:03 2006 From: paul at prescod.net (Paul Prescod) Date: Wed, 9 Aug 2006 22:19:03 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface Message-ID: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> Thanks for everyone who contributed. It seems that the emerging consensus (bar a security question from Guido) is that ctypes it the way forward for calling C code in Python 3000. I'd like to clarify what this might mean: 1. Is ctypes and pure python fast enough for most real-world extension modules like PyOpenGL, PyExpat, Tkinter, and socket programming? I know that experimentation is ongoing. Are any results in? 2. If not, will Python 3000's build or runtime system use some kind of optimization technique such as static compilation (e.g. extcompiler[1]) or JIT compilation to allow parts of its library (especially new parts) to be written using ctypes instead of C? 3. Presuming that the performance issue can be worked out one way or another, are there arguments in favour of interpreter-specific C-coded extensions other than those doing explicitly interpreter-specific stuff (e.g. tweaking the GC). 4. Will the Python 3000 standard library start to migrate towards ctypes (for new extensions)? Paul Prescod [1] http://codespeak.net/pypy/dist/pypy/doc/extcompiler.html -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060809/3b2d1dd1/attachment.html From krstic at solarsail.hcs.harvard.edu Thu Aug 10 07:32:38 2006 From: krstic at solarsail.hcs.harvard.edu (Ivan Krstic) Date: Thu, 10 Aug 2006 01:32:38 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> <1f7befae0608091838u594de27ctb83dd0845ccaa0@mail.gmail.com> Message-ID: <44DAC4F6.3010002@solarsail.hcs.harvard.edu> Guido van Rossum wrote: > Fine with me then. In 2.5? 2.6? Or py3k? (This is the py3k list.) FWIW, we'll ship 2.5 on the OLPC (laptop.org) machines, and it looks like we'll need this. It'd be useful to have it directly in CPython, so people running our software outside the laptops don't have to fuss with an extension. -- Ivan Krstic | GPG: 0x147C722D From pje at telecommunity.com Thu Aug 10 08:28:23 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Thu, 10 Aug 2006 02:28:23 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: Message-ID: <5.1.1.6.0.20060810021302.0262fcd0@sparrow.telecommunity.com> At 14:47 8/10/2006 +1200, Greg Ewing wrote: >And what are you supposed to do if you want to write >a function that has improved help messages *and* >type checking? Create a type annotation object that wraps multiple objects -- or better yet, use a list or tuple of annotations. (See below.) > > The difficulty inherent in writing annotation interpreting > > libraries will keep their number low and their authorship in the > > hands of people who, frankly, know what they're doing. > >Even if there are only two of them, they can still >conflict. > >I think the idea of having totally undefined >annotations is fundamentally flawed. No, your assumption is fundamentally flawed. ;-) This is a trivial application of overloaded functions. In PEAK, there is a similar concept called "attribute metadata" that can be applied to the attributes of a class. A single overloaded function called "declareAttribute" is used to "declare" the metadata. These metadata annotations can be anything you want. Certain PEAK frameworks use them for security declarations. Others use them to mark an attribute as providing a certain interface for child components, to describe the attribute's syntax for parsing or formatting, and so on. There is no predefined semantics for these metadata objects -- none whatsoever. Each framework that needs a new kind of metadata object simply defines a class that holds whatever metadata is desired, and adds a method to the "declareAttribute" function to handle objects of that type. The added method can do anything: modify the class or descriptor in some way, register something in a registry, or whatever else you want it to do. In addition, the declareAttribute function comes with predefined methods for processing tuples and lists by iterating over them and calling declareAttribute recursively. This makes it easy to combine groups of metadata objects and reuse them. So I see no problems with this concept that overloaded functions don't trivially solve. Any operation you want to perform on function annotations need only be implemented as an overloaded function, and there is then no conflict to worry about. For example, if you are writing a documentation tool that needs to generate a short HTML string for an annotation, you just create an overloaded function for that. Then somebody using the documentation tool with arbitrary type annotation frameworks (e.g. their own) can just add methods to the documentation tool's overloaded functions to support that. Indeed, many a time I've wished that epydoc was written using overloaded functions, as it then would've been easy to extend it to gracefully handle PEAK's more esoteric descriptors and metaclasses. From behnel_ml at gkec.informatik.tu-darmstadt.de Thu Aug 10 09:28:12 2006 From: behnel_ml at gkec.informatik.tu-darmstadt.de (Stefan Behnel) Date: Thu, 10 Aug 2006 09:28:12 +0200 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> Message-ID: <44DAE00C.400@gkec.informatik.tu-darmstadt.de> Hi, Collin Winter wrote: > def compile(source: "something compilable", > filename: "where the compilable thing comes from", > mode: "is this a single statement or a suite?"): > ... > > def sum(*vargs: Number) -> Number: > ... Admittedly, I'm not so much in the "Spring stew" discussion, but I'm not a big fan of cluttering up my function signature with "make them short to make them fit" comments. What would be wrong in adding a standard decorator for this purpose? Something like: @type_annotate("This is a filename passed as string", filename = str) @type_annotate(source = str) def compile(source, filename, mode): ... or, more explicitly: @arg_docstring(filename = "This is a filename passed as string") @arg_type(filename = str) @arg_type(source = str) def compile(source, filename, mode): ... Stefan From behnel_ml at gkec.informatik.tu-darmstadt.de Thu Aug 10 09:31:24 2006 From: behnel_ml at gkec.informatik.tu-darmstadt.de (Stefan Behnel) Date: Thu, 10 Aug 2006 09:31:24 +0200 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> Message-ID: <44DAE0CC.8040909@gkec.informatik.tu-darmstadt.de> Paul Prescod wrote: > 2. If not, will Python 3000's build or runtime system use some kind of > optimization technique such as static compilation ( e.g. extcompiler[1]) > or JIT compilation to allow parts of its library (especially new parts) > to be written using ctypes instead of C? What's the problem? Just take PyPy and brand it as Python 3000. Stefan From l.oluyede at gmail.com Thu Aug 10 10:15:10 2006 From: l.oluyede at gmail.com (Lawrence Oluyede) Date: Thu, 10 Aug 2006 10:15:10 +0200 Subject: [Python-3000] Changing behavior of sequence multiplication by negative integer Message-ID: <9eebf5740608100115g1fa7a861rd0b9a84a7b64d4be@mail.gmail.com> I've never seen bugs determined by operations such as: "foobar" * -1 and to be honest I've never seen code like that because the semantics is somewhat senseless to me but I think the behavior of the expression evaluation of "Sequence * negative integer" should be changed from: >>> "foobar" * -1 '' >>> ["foobar"] * -1 [] >>> ("foobar") * -1 '' to something throwing an exception like when you try to multiplicate the sequence by a floating point number: >>> "foobar" * 1.0 Traceback (most recent call last): File "", line 1, in ? TypeError: can't multiply sequence by non-int It's not a big deal to me but maybe this can be addressed in the python3000 branch -- Lawrence http://www.oluyede.org/blog From ncoghlan at gmail.com Thu Aug 10 13:19:55 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 10 Aug 2006 21:19:55 +1000 Subject: [Python-3000] threading, part 2 In-Reply-To: <44DAC4F6.3010002@solarsail.hcs.harvard.edu> References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> <1f7befae0608091838u594de27ctb83dd0845ccaa0@mail.gmail.com> <44DAC4F6.3010002@solarsail.hcs.harvard.edu> Message-ID: <44DB165B.2040901@gmail.com> Ivan Krstic wrote: > Guido van Rossum wrote: >> Fine with me then. In 2.5? 2.6? Or py3k? (This is the py3k list.) > > FWIW, we'll ship 2.5 on the OLPC (laptop.org) machines, and it looks > like we'll need this. It'd be useful to have it directly in CPython, so > people running our software outside the laptops don't have to fuss with > an extension. Given the time frame, I think you might be stuck with using ctypes to get at the functionality for Python 2.5. Now that Guido & Tim have mentioned it, I also vaguely recall portability to GIL-free implementations being one of the problems with the idea back when the C API function was added, so exposing this officially to Python code should probably wait until 2.6. Peter Hansen worked out the necessary incantations to invoke it through ctypes back in 2004 [1]. The difference now is that "import ctypes" will work on a vanilla 2.5 installation. Cheers, Nick. [1] http://groups.google.com/group/comp.lang.python/msg/d310502f7c7133a9 -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Thu Aug 10 13:40:32 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 10 Aug 2006 21:40:32 +1000 Subject: [Python-3000] Changing behavior of sequence multiplication by negative integer In-Reply-To: <9eebf5740608100115g1fa7a861rd0b9a84a7b64d4be@mail.gmail.com> References: <9eebf5740608100115g1fa7a861rd0b9a84a7b64d4be@mail.gmail.com> Message-ID: <44DB1B30.1030200@gmail.com> Lawrence Oluyede wrote: > I've never seen bugs determined by operations such as: > > "foobar" * -1 > > and to be honest I've never seen code like that because the semantics > is somewhat senseless to me but I think the behavior of the expression > evaluation of "Sequence * negative integer" should be changed from: > >>>> "foobar" * -1 > '' >>>> ["foobar"] * -1 > [] >>>> ("foobar") * -1 > '' > > to something throwing an exception like when you try to multiplicate > the sequence by a floating point number: > >>>> "foobar" * 1.0 > Traceback (most recent call last): > File "", line 1, in ? > TypeError: can't multiply sequence by non-int > > It's not a big deal to me but maybe this can be addressed in the > python3000 branch > The "negative coerced to 0" behaviour is to make it easy to do things like padding a sequence to a minimum length: seq = seq + pad * (min_length- len(seq)) Without the current behaviour, all such operations would need to be rewritten as: seq = seq + pad * max((min_length- len(seq)), 0) Gratuitous breakage that leads to a more verbose result gets a solid -1 from me :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From l.oluyede at gmail.com Thu Aug 10 14:03:28 2006 From: l.oluyede at gmail.com (Lawrence Oluyede) Date: Thu, 10 Aug 2006 14:03:28 +0200 Subject: [Python-3000] Changing behavior of sequence multiplication by negative integer In-Reply-To: <44DB1B30.1030200@gmail.com> References: <9eebf5740608100115g1fa7a861rd0b9a84a7b64d4be@mail.gmail.com> <44DB1B30.1030200@gmail.com> Message-ID: <9eebf5740608100503l16238585yf1f2c38b1a4a4142@mail.gmail.com> > The "negative coerced to 0" behaviour is to make it easy to do things like > padding a sequence to a minimum length: > > seq = seq + pad * (min_length- len(seq)) > > Without the current behaviour, all such operations would need to be rewritten as: > > seq = seq + pad * max((min_length- len(seq)), 0) > > Gratuitous breakage that leads to a more verbose result gets a solid -1 from me :) That sound a -1 to me too. Thanks for the explanation. I was sure there was one for that kind of behavior. -- Lawrence http://www.oluyede.org/blog From behnel_ml at gkec.informatik.tu-darmstadt.de Thu Aug 10 15:00:30 2006 From: behnel_ml at gkec.informatik.tu-darmstadt.de (Stefan Behnel) Date: Thu, 10 Aug 2006 15:00:30 +0200 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DAE00C.400@gkec.informatik.tu-darmstadt.de> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> <44DAE00C.400@gkec.informatik.tu-darmstadt.de> Message-ID: <44DB2DEE.7020601@gkec.informatik.tu-darmstadt.de> Stefan Behnel wrote: > Collin Winter wrote: >> def compile(source: "something compilable", >> filename: "where the compilable thing comes from", >> mode: "is this a single statement or a suite?"): >> ... >> >> def sum(*vargs: Number) -> Number: >> ... > > Admittedly, I'm not so much in the "Spring stew" discussion, but I'm not a big > fan of cluttering up my function signature with "make them short to make them > fit" comments. > > What would be wrong in adding a standard decorator for this purpose? Something > like: > > @type_annotate("This is a filename passed as string", filename = str) > @type_annotate(source = str) > def compile(source, filename, mode): > ... > > or, more explicitly: > > @arg_docstring(filename = "This is a filename passed as string") > @arg_type(filename = str) > @arg_type(source = str) > def compile(source, filename, mode): > ... Ah, never mind, that only applies to docstrings. The type annotation would not be available to the compiler... So, it would be a good idea to split the two: docstrings and types. Where a decorator provides a readable (and extensible) solution for the first, type annotations should be part of the signature IMHO. Stefan From jimjjewett at gmail.com Thu Aug 10 16:13:14 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Thu, 10 Aug 2006 10:13:14 -0400 Subject: [Python-3000] Changing behavior of sequence multiplication by negative integer In-Reply-To: <44DB1B30.1030200@gmail.com> References: <9eebf5740608100115g1fa7a861rd0b9a84a7b64d4be@mail.gmail.com> <44DB1B30.1030200@gmail.com> Message-ID: Lawrence Oluyede wrote: > seq * -5 > and to be honest I've never seen code like that because the semantics > is somewhat senseless to me To be honest, I would almost expect the negative to mean "count from the end", so that it also reversed the sequence. It doesn't, but ... it does make for a hard-to-explain case. > ... evaluation of "Sequence * negative integer" should be changed from: > >>> "foobar" * -1 > '' > > ... to something throwing an exception like when you try to multiplicate > > the sequence by a floating point number: Agreed. On 8/10/06, Nick Coghlan wrote: > The "negative coerced to 0" behaviour is to make it easy to do things like > padding a sequence to a minimum length: > seq = seq + pad * (min_length- len(seq)) Typically, if I need to pad a sequence to a minimum length, I really need it to be a specific length. Having it already be too long is likely to cause problems later. So I really do prefer the explicit version. Also compare this to the recent decision that __index__ should *not* silently clip to a C long > Without the current behaviour, all such operations would need to be rewritten as: > seq = seq + pad * max((min_length- len(seq)), 0) I would write it as # Create a record-size pad outside the loop pad = " "*length ... seq = (seq+pad)[:length] -jJ From ncoghlan at gmail.com Thu Aug 10 16:33:27 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 11 Aug 2006 00:33:27 +1000 Subject: [Python-3000] Changing behavior of sequence multiplication by negative integer In-Reply-To: References: <9eebf5740608100115g1fa7a861rd0b9a84a7b64d4be@mail.gmail.com> <44DB1B30.1030200@gmail.com> Message-ID: <44DB43B7.2060608@gmail.com> Jim Jewett wrote: > I would write it as > > # Create a record-size pad outside the loop > pad = " "*length > ... > seq = (seq+pad)[:length] I'd generally do padding to a fixed length that way as well, but any code relying on the current 'clip to 0' behaviour would break if this changed. Without a really compelling reason to change it, it's hard to justify any breakage at all (even if there may be better ways of doing things). While I take your point about the comparison to __index__, the difference is that clipping sequence repetition to 0 has been the expected behaviour for many releases, whereas in the __index__ overflow case the expected behaviour was for the code to raise an exception. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From slawek at cs.lth.se Thu Aug 10 16:49:07 2006 From: slawek at cs.lth.se (Slawomir Nowaczyk) Date: Thu, 10 Aug 2006 16:49:07 +0200 Subject: [Python-3000] Changing behavior of sequence multiplication by negative integer In-Reply-To: References: <44DB1B30.1030200@gmail.com> Message-ID: <20060810164518.EF5E.SLAWEK@cs.lth.se> On Thu, 10 Aug 2006 10:13:14 -0400 Jim Jewett wrote: #> > seq = seq + pad * (min_length- len(seq)) #> #> Typically, if I need to pad a sequence to a minimum length, I really #> need it to be a specific length. Having it already be too long is #> likely to cause problems later. So I really do prefer the explicit #> version. Well, for whatever it is worth, if I pad the data to present it in a readable form I *most definitely* do not want values to become truncated just because they turn out to be bigger than I originally expected. An ugly result is worse than nice result, but still better than wrong result. -- Best wishes, Slawomir Nowaczyk ( Slawomir.Nowaczyk at cs.lth.se ) All I want is a warm bed, and a kind word, and unlimited power. From guido at python.org Thu Aug 10 19:50:24 2006 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Aug 2006 10:50:24 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> Message-ID: I worry that this may be too ambitious to add to the already significant load for the Py3k project. You've seen my timeline -- alpha in early 07, final a year later. Don't get me wrong! I think that completely changing the FFI paradigm (as opposed to evolutionary changes to the existing C API, which py3k is doing) is a very worthy project, but I'd rather conceive it as something orthogonal to the py3k transition. It doesn't have to wait for py3k, nor should py3k have to wait for it. Tying too many projects together in terms of mutual dependencies is a great way to cause total paralysis. --Guido On 8/9/06, Paul Prescod wrote: > Thanks for everyone who contributed. It seems that the emerging consensus > (bar a security question from Guido) is that ctypes it the way forward for > calling C code in Python 3000. I'd like to clarify what this might mean: > > 1. Is ctypes and pure python fast enough for most real-world extension > modules like PyOpenGL, PyExpat, Tkinter, and socket programming? I know that > experimentation is ongoing. Are any results in? > > 2. If not, will Python 3000's build or runtime system use some kind of > optimization technique such as static compilation ( e.g. extcompiler[1]) or > JIT compilation to allow parts of its library (especially new parts) to be > written using ctypes instead of C? > > 3. Presuming that the performance issue can be worked out one way or > another, are there arguments in favour of interpreter-specific C-coded > extensions other than those doing explicitly interpreter-specific stuff ( > e.g. tweaking the GC). > > 4. Will the Python 3000 standard library start to migrate towards ctypes > (for new extensions)? > > Paul Prescod > > [1] > http://codespeak.net/pypy/dist/pypy/doc/extcompiler.html > > > > > _______________________________________________ > Python-3000 mailing list > Python-3000 at python.org > http://mail.python.org/mailman/listinfo/python-3000 > Unsubscribe: > http://mail.python.org/mailman/options/python-3000/guido%40python.org > > > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Thu Aug 10 20:05:51 2006 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Aug 2006 11:05:51 -0700 Subject: [Python-3000] Range literals In-Reply-To: <20060808104049.E709.JCARLSON@uci.edu> References: <44D8C154.9020406@acm.org> <20060808104049.E709.JCARLSON@uci.edu> Message-ID: I haven't changed my mind. Do you really want to add atrocities such as having both .. and ... in the language where one includes the end point and the other excludes it? How would a casual user remember which is which? --Guido On 8/8/06, Josiah Carlson wrote: > > Talin wrote: > > > > I've seen some languages that use a double-dot (..) to mean a range of > > items. This could be syntactic sugar for range(), like so: > > > > > > for x in 1..10: > > ... > > In the pronouncement on PEP 284: http://www.python.org/dev/peps/pep-0284/ > > Guido did not buy the premise that the range() format needed fixing, > "The whole point (15 years ago) of range() was to *avoid* needing syntax > to specify a loop over numbers. I think it's worked out well and there's > nothing that needs to be fixed (except range() needs to become an > iterator, which it will in Python 3.0)." > > Unless Guido has decided that range/xrange are the wrong way to do > things, I don't think there is much discussion here. > > - Josiah > > _______________________________________________ > Python-3000 mailing list > Python-3000 at python.org > http://mail.python.org/mailman/listinfo/python-3000 > Unsubscribe: http://mail.python.org/mailman/options/python-3000/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Thu Aug 10 20:40:46 2006 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Aug 2006 11:40:46 -0700 Subject: [Python-3000] Rounding in Py3k In-Reply-To: References: <44D1F304.4020700@iinet.net.au> <44D2A81D.2050204@canterbury.ac.nz> <44D3124A.6010300@canterbury.ac.nz> Message-ID: On 8/4/06, Ron Adam wrote: > But that doesn't explain why int, long, and float, don't have other > non-magic methods. > > I'm not attempting taking sides for or against either way, I just want > to understand the reasons as it seems like by knowing that, the correct > way to do it would be clear, instead of trying to wag the dog by the > tail if you know what I mean. I'm probably the source of this convention. For numbers, I find foo(x) more readable than x.foo(), mostly because of the longstanding tradition in mathematics to write things like f(x) and sin(x). Originally I had extended the same convention to strings; but over time it became clear that there was a common set of operations on strings that were so fundamental that having to import a module to use them was a mistake, and there were too many to make them all built-ins. (I didn't insist on not using methods/attributes for complex, since I was already used to seeing z.re and z.im in Algol-68). I'm not convinced that there are enough common operations on the standard numbers to change my mind now. I'd rather see the built-in round() use a new protocol __round__() than switching to a round() method on various numbers; this should hopefully make it possible to use round() on Decimal instances. A question is what the API for __round__() should be. It seems Decimal uses a different API than round(). Can someone think about this more and propose a unified and backwards compatible solution? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From tomerfiliba at gmail.com Thu Aug 10 21:14:27 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Thu, 10 Aug 2006 21:14:27 +0200 Subject: [Python-3000] threading, part 2 Message-ID: <1d85506f0608101214g594d2dal282ab2ae60f29f11@mail.gmail.com> [Tim] > Me too, although it won't stay that simple, and I'm clear as mud on > how implementations other than CPython could implement this. [Guido] > Another good reason to keep it accessible from the C API only. Now I'm > -0 on adding it. I suggest that if someone really wants this > accessible from Python, they should research how Jython, IronPython, > PyPy and Stackless could handle this, and report their research in a > PEP. then how does interrupt_main work? is it implementation-agnostic? ----- >>> import thread >>> help(thread.interrupt_main) Help on built-in function interrupt_main in module thread: interrupt_main(...) interrupt_main() Raise a KeyboardInterrupt in the main thread. A subthread can use this function to interrupt the main thread. ----- just let me raise arbitrary exceptions (don't limit it to KeyboardInterrupt) -tomer From tim.peters at gmail.com Thu Aug 10 23:40:59 2006 From: tim.peters at gmail.com (Tim Peters) Date: Thu, 10 Aug 2006 17:40:59 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> <1f7befae0608091838u594de27ctb83dd0845ccaa0@mail.gmail.com> Message-ID: <1f7befae0608101440i30590f4dv2740e584f801881c@mail.gmail.com> [back and forth on PyThreadState_SetAsyncExc(), and the 2-year old discussion in http://www.python.org/sf/1069160 ] [Tim] >> [still-current deadlock & refcount issues not fixed at the time] [Guido] > So why didn't we check that in? The shallow answer is that you closed the report without checking it in, so ask a mirror ;-) The real answer seems to be that nobody (including me) really cared about this function, since it's both unused and untested in the core, and there were no known uses from anyone's C extensions either. [on adding it to the language] >>>>> +0 >>>> Me too, although ... I'm clear as mud on how implementations other >>>> than CPython could implement this. >>> Now I'm -0 on adding it. I suggest that if someone really wants this >>> accessible from Python, they should research how Jython, IronPython, >>> PyPy and Stackless could handle this, and report their research in a >>> PEP. >> As a full-blown language feature, I'm -1 unless that work is done >> first. I'm still +0 on adding it to CPython if it's given a >> leading-underscore name and docs to make clear that it's a >> CPython-specific hack that may never work under any other >> implementation. > Fine with me then. In 2.5? 2.6? Or py3k? (This is the py3k list.) Since the 2.5 beta series is supposedly done with, I strongly doubt Anthony wants to see a new feature snuck into 2.5c1. Someone who wants it enough could target 2.6. I'm only +0, so I'd do that only if someone wants it enough to pay for it. For 2.5, I'll check in the anal correctness changes, add a ctypes-based test case, and reword the docs to stop warning about a return value > 1 (all those are just fixing what's going to be in 2.5 anyway). From guido at python.org Fri Aug 11 01:17:52 2006 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Aug 2006 16:17:52 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <1cb725390608101319j19731f91vfc472d9113a03ccf@mail.gmail.com> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> <1cb725390608101319j19731f91vfc472d9113a03ccf@mail.gmail.com> Message-ID: (Adding python-3000 back to the CC: list.) On 8/10/06, Paul Prescod wrote: > The only reason to tie it to Py3K is because Py3K is breaking APIs anyhow. > It will be in the overlap period between Py3K and Py2x that the need for an > abstraction will be most acute. Otherwise extensions will probably end up > with a lot of #ifdef py3k #else etc. > > It isn't clear how ambitious or not this is until we drill in. For example, > if pure "ctypes" is sufficiently efficient for 90% of all extensions, then > moving in this direction for Py3K might require nothing more than a > declaration from you that new extensions should be written using ctypes > instead of the PyObject APIs unless there is a very good reason. After all, > people will take their cue from you as to what sort of coding convention is > appropriate for the standard library. Is this first step doable? Just a > declaration that (with a few exceptions) ctypes is preferable to C code for > new extensions? > > But if that's totally unreasonable because ctypes is seldom performant > enough then the project gets more ambitious because it would have to pull in > extcompiler... I don't know enough about ctypes, but assuming I have a reason to write an extension in C (e.g. Tkinter, which uses the Tcl/Tk API), how to I use ctypes to call things like PyDict_GetItem() or PyErr_SetString()? --Guido > On 8/10/06, Guido van Rossum wrote: > > I worry that this may be too ambitious to add to the already > > significant load for the Py3k project. You've seen my timeline -- > > alpha in early 07, final a year later. > > > > Don't get me wrong! I think that completely changing the FFI paradigm > > (as opposed to evolutionary changes to the existing C API, which py3k > > is doing) is a very worthy project, but I'd rather conceive it as > > something orthogonal to the py3k transition. It doesn't have to wait > > for py3k, nor should py3k have to wait for it. Tying too many projects > > together in terms of mutual dependencies is a great way to cause total > > paralysis. > > > > --Guido > > > > On 8/9/06, Paul Prescod wrote: > > > Thanks for everyone who contributed. It seems that the emerging > consensus > > > (bar a security question from Guido) is that ctypes it the way forward > for > > > calling C code in Python 3000. I'd like to clarify what this might mean: > > > > > > 1. Is ctypes and pure python fast enough for most real-world extension > > > modules like PyOpenGL, PyExpat, Tkinter, and socket programming? I know > that > > > experimentation is ongoing. Are any results in? > > > > > > 2. If not, will Python 3000's build or runtime system use some kind of > > > optimization technique such as static compilation ( e.g. extcompiler[1]) > or > > > JIT compilation to allow parts of its library (especially new parts) to > be > > > written using ctypes instead of C? > > > > > > 3. Presuming that the performance issue can be worked out one way or > > > another, are there arguments in favour of interpreter-specific C-coded > > > extensions other than those doing explicitly interpreter-specific stuff > ( > > > e.g. tweaking the GC). > > > > > > 4. Will the Python 3000 standard library start to migrate towards > ctypes > > > (for new extensions)? > > > > > > Paul Prescod > > > > > > [1] > > > > http://codespeak.net/pypy/dist/pypy/doc/extcompiler.html > > > > > > > > > > > > > > > _______________________________________________ > > > Python-3000 mailing list > > > Python-3000 at python.org > > > http://mail.python.org/mailman/listinfo/python-3000 > > > Unsubscribe: > > > > http://mail.python.org/mailman/options/python-3000/guido%40python.org > > > > > > > > > > > > > > > -- > > --Guido van Rossum (home page: http://www.python.org/~guido/) > > > > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Fri Aug 11 01:21:02 2006 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Aug 2006 16:21:02 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <1d85506f0608101214g594d2dal282ab2ae60f29f11@mail.gmail.com> References: <1d85506f0608101214g594d2dal282ab2ae60f29f11@mail.gmail.com> Message-ID: On 8/10/06, tomer filiba wrote: > [Tim] > > Me too, although it won't stay that simple, and I'm clear as mud on > > how implementations other than CPython could implement this. > > [Guido] > > Another good reason to keep it accessible from the C API only. Now I'm > > -0 on adding it. I suggest that if someone really wants this > > accessible from Python, they should research how Jython, IronPython, > > PyPy and Stackless could handle this, and report their research in a > > PEP. > > then how does interrupt_main work? is it implementation-agnostic? I expect that Jython doesn't implement this; it doesn't handle ^C either AFAIK. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From paul at prescod.net Fri Aug 11 01:45:00 2006 From: paul at prescod.net (Paul Prescod) Date: Thu, 10 Aug 2006 16:45:00 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> <1cb725390608101319j19731f91vfc472d9113a03ccf@mail.gmail.com> Message-ID: <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> Sorry for the cc mistake. I don't know enough about ctypes, but assuming I have a reason to > write an extension in C (e.g. Tkinter, which uses the Tcl/Tk API), how > to I use ctypes to call things like PyDict_GetItem() or > PyErr_SetString()? There are two answers to your question. The simplest is that if you have a dict object called "foo" you just call 'foo["abc"]'. It's just Python. Same for the other one: you'd just call 'raise'. Ctypes is the opposite model of the standard extension stuff. You're writing in Python so Python stuff is straightforward (just Python) and C stuff is a bit weird. So if you had to populate a Python dictionary from a C struct then it is the reading from the C struct that takes a bit of doing. The writing the Python dictionary is straightforward. If there was a reason to call PyDict_GetItem directly (performance maybe???) then that's possible. You need to set up the function prototype (which you would probably do in a helper library) and then you just call PyDict_GetItem. CTypes would coerce the types. py_object is a native data type. So I think it ends up looking like from PythonConvenienceFunctions import PyDict_GetItem obj = {} key = "Guido" rc = PyDict_GetItem(obj, key) I'm sure an expert will correct me if I'm wrong... Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060810/d4b50a3e/attachment.html From paul at prescod.net Fri Aug 11 01:57:59 2006 From: paul at prescod.net (Paul Prescod) Date: Thu, 10 Aug 2006 16:57:59 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> <1cb725390608101319j19731f91vfc472d9113a03ccf@mail.gmail.com> <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> Message-ID: <1cb725390608101657x447df09cm888228b31e424a87@mail.gmail.com> And if you're curious about how to use ctypes without all of the helper functions set up for you, then I guess it is easiest to poke around the documentation for code samples. >>> printf.argtypes = [c_char_p, c_char_p, c_int, c_double] >>> printf("String '%s', Int %d, Double %f\n", "Hi", 10, 2.2) String 'Hi', Int 10, Double 2.200000 37 >>> >>> from ctypes import c_int, WINFUNCTYPE, windll >>> from ctypes.wintypes import HWND, LPCSTR, UINT >>> prototype = WINFUNCTYPE(c_int, HWND, LPCSTR, LPCSTR, c_uint) >>> paramflags = (1, "hwnd", 0), (1, "text", "Hi"), (1, "caption", None), (1, "flags", 0) >>> MessageBox = prototype(("MessageBoxA", windll.user32), paramflags) It's ugly but in the typical case you would hide all of the declarations in a module (maybe an auto-generated module) and just focus on your logic: >>> MessageBox() >>> MessageBox(text="Spam, spam, spam") >>> MessageBox(flags=2, text="foo bar") >> Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060810/07616e7c/attachment.htm From guido at python.org Fri Aug 11 02:56:47 2006 From: guido at python.org (Guido van Rossum) Date: Thu, 10 Aug 2006 17:56:47 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> <1cb725390608101319j19731f91vfc472d9113a03ccf@mail.gmail.com> <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> Message-ID: On 8/10/06, Paul Prescod wrote: > > I don't know enough about ctypes, but assuming I have a reason to > > write an extension in C (e.g. Tkinter, which uses the Tcl/Tk API), how > > to I use ctypes to call things like PyDict_GetItem() or > > PyErr_SetString()? > > There are two answers to your question. The simplest is that if you have a > dict object called "foo" you just call 'foo["abc"]'. It's just Python. Same > for the other one: you'd just call 'raise'. That doesn't make sense if you want to write your extension in C. Surely you don't propose to rewrite all of tkinter.c in Python? That would be insane. Or Numeric? That would kill performance. > Ctypes is the opposite model of the standard extension stuff. You're writing > in Python so Python stuff is straightforward (just Python) and C stuff is a > bit weird. So if you had to populate a Python dictionary from a C struct > then it is the reading from the C struct that takes a bit of doing. The > writing the Python dictionary is straightforward. > > If there was a reason to call PyDict_GetItem directly (performance maybe???) > then that's possible. You need to set up the function prototype (which you > would probably do in a helper library) and then you just call > PyDict_GetItem. CTypes would coerce the types. py_object is a native data > type. > > So I think it ends up looking like > > from PythonConvenienceFunctions import PyDict_GetItem > > obj = {} > key = "Guido" > > rc = PyDict_GetItem(obj, key) > > I'm sure an expert will correct me if I'm wrong... I guess I object against the idea that we have to write all extensions in Python using ctypes for all C calls. This is okay if there's relatively little interaction with C code. It's insane if you're doing serious C code. And what about C++ extensions? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From greg.ewing at canterbury.ac.nz Fri Aug 11 03:47:38 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 11 Aug 2006 13:47:38 +1200 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> Message-ID: <44DBE1BA.6000204@canterbury.ac.nz> Paul Prescod wrote: > It seems that the emerging > consensus (bar a security question from Guido) is that ctypes it the way > forward for calling C code in Python 3000. I'd like to clarify what this > might mean: What's the state of play concerning ctypes support on non-x86 platforms? Until ctypes is uniformly supported on all platforms, it can't be considered a complete replacement for C-coded extensions (whether handwritten or generated by something else). -- Greg From lcaamano at gmail.com Fri Aug 11 05:01:45 2006 From: lcaamano at gmail.com (Luis P Caamano) Date: Thu, 10 Aug 2006 23:01:45 -0400 Subject: [Python-3000] threading, part 2 Message-ID: Yes, I also wonder about how non-CPython implementations would handle this but I'd just like to say that this feature, making a thread raise a specific exception from another thread asynchronously is a very useful feature. We have a subsystem that schedules requests that are dispatched in a thread each. The only way to cancel one of those requests right now is via a cooperative checking method in which we explicitly make calls through out the code to see if the request has been canceled, and in such case, the check raises an exception that triggers clean up and cancellation. Problem is we have to spread check calls all over the place. All this would be a lot easier if we could do thread.terminate() as proposed, especially for new code. On 8/9/06, "Guido van Rossum" wrote: > On 8/9/06, Tim Peters wrote: > > [Nick Coghlan] > > >> That check is already there: > > >> > > >> int PyThreadState_SetAsyncExc( long id, PyObject *exc) > > >> Asynchronously raise an exception in a thread. The id argument is the > > >> thread id of the target thread; exc is the exception object to be raised. This > > >> function does not steal any references to exc. To prevent naive misuse, you > > >> must write your own C extension to call this. Must be called with the GIL > > >> held. Returns the number of thread states modified; if it returns a number > > >> greater than one, you're in trouble, and you should call it again with exc set > > >> to NULL to revert the effect. This raises no exceptions. New in version 2.3. > > > > Guido, do you have any idea now what the "number greater than one" > > business is about? That would happen if and only if we found more > > than one thread state with the given thread id in the interpreter's > > list of thread states, but we're counting those with both the GIL and > > the global head_mutex lock held. My impression has been that it would > > be an internal logic error if we ever saw this count exceed 1. > > Right, I think that's it. I guess I was in a grumpy mood when I wrote > this (and Just & Alex never ended up using it!). > > > While I'm at it, I expect: > > > > Py_CLEAR(p->async_exc); > > Py_XINCREF(exc); > > p->async_exc = exc; > > > > is better written: > > > > Py_XINCREF(exc); > > Py_CLEAR(p->async_exc); > > p->async_exc = exc; > > > > for the same reason one should always incref B before decrefing A in > > > > A = B > > > > ... > > That reason that A and B might already be the same object, right? > > > >> All Tober is really asking for is a method on threading.Thread objects that > > >> uses this existing API to set a builtin ThreadExit exception. The thread > > >> module would consider a thread finishing with ThreadExit to be > > >> non-exceptional, so you could easily do: > > >> > > >> th.terminate() # Raise ThreadExit in th's thread of control > > >> th.join() # Should finish up pretty quickly > > >> > > >> Proper resource cleanup would be reliant on correct use of try/finally or with > > >> statements, but that's the case regardless of whether or not asynchronous > > >> exceptions are allowed. > > > > [Guido] > > > I'm +0 on this. > > > > Me too, although it won't stay that simple, and I'm clear as mud on > > how implementations other than CPython could implement this. > > Another good reason to keep it accessible from the C API only. Now I'm > -0 on adding it. I suggest that if someone really wants this > accessible from Python, they should research how Jython, IronPython, > PyPy and Stackless could handle this, and report their research in a > PEP. > > -- > --Guido van Rossum (home page: http://www.python.org/~guido/) > > -- Luis P Caamano Atlanta, GA USA From greg.ewing at canterbury.ac.nz Fri Aug 11 05:48:48 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 11 Aug 2006 15:48:48 +1200 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> <1cb725390608101319j19731f91vfc472d9113a03ccf@mail.gmail.com> <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> Message-ID: <44DBFE20.7040900@canterbury.ac.nz> Another thought about ctypes: What if you want to pass a Python function into C as a callback? Does ctypes have a way of handling that? -- Greg Ewing, Computer Science Dept, +--------------------------------------+ University of Canterbury, | Carpe post meridiem! | Christchurch, New Zealand | (I'm not a morning person.) | greg.ewing at canterbury.ac.nz +--------------------------------------+ From talin at acm.org Fri Aug 11 15:10:31 2006 From: talin at acm.org (Talin) Date: Fri, 11 Aug 2006 06:10:31 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608091958u2d00db76s48260853942bed32@mail.gmail.com> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> <44DA9E54.5020105@canterbury.ac.nz> <43aa6ff70608091958u2d00db76s48260853942bed32@mail.gmail.com> Message-ID: <44DC81C7.1070905@acm.org> Collin Winter wrote: > The idea is that each developer can pick the notation/semantics that's > most natural to them. I'll go even further: say one library offers a > semantics you find handy for task A, while another library's ideas > about type annotations are best suited for task B. Without a single > standard, you're free to mix and match these libraries to give you a > combination that allows you to best express the ideas you're going > for. Let me tell you a story. Once upon a time, there was a little standard called Midi (Musical Instrument Digital Interface). The Midi standard was small and lightweight, containing less than a dozen commands of 2-3 bytes each. However, they realized that they needed a way to allow hardware vendors to add their own custom message types, so they created a special message type called "System Exclusive Message" or SysEx for short. The idea is that you would send a 3-byte manufacturer ID, and then any subsequent bytes would be considered to be in a vendor-specific format. The MMA (Midi Manufacturers Association) did not provide any guidelines or suggestions as to what the format of those bytes should be - it would be completely up to the vendors to decide what the format of their system exclusive message would be. Since the Midi standard did not define a way to save and load the instrument's memory, vendors typically would use the SysEx message to allow a "bulk dump" of patch information - essentially it was a way to access the instrument's internal state of sounds, programs, sequences, and so on. This would have worked fine, except for the fact that the vendors and the MMA were not the only stakeholders. Just about this time (mid-80s) there began to rise a new type of music company: companies like Mark of the Unicorn, Steinberg Audio and Blue Ribbon Soundworks that created professional music software for personal computers. Some companies made sequencer programs that would allow you to enter musical scores on the computer screen and play them back through your Midi instrument. Other companies worked on a different type of product - a "Universal Librarian", essentially a computer program which would store all of your patches and sound programs for all your different instruments. In 1987 I created a program for the Amiga called Music-X, which was a combination of sequencer and Universal Librarian. In order to create the librarian module, I needed to get information about all of the various vendor-specific protocols Interrupt - as I was typing this last sentence, I knocked over my glass of ice water onto my Powerbook G4, completely toasting the motherboard and damaging the display. 24 hours, and $2700 later, I have completed my "forced upgrade" and can now continue this posting. Lesson to be learned: Internet rants and prescription pain meds do not mix! Be warned! ...which was not that difficult, since most of the vendors wold include an appendix in the back of the users manual (generally written in very bad english) describing the SysEx protocol for that device. I was also able to get my hands on "The Big Midi Book of SysEx protocols", which was essentially the xerox of all of these various appendices, bound up in book form and sold commercially. At the time there were approximately 150 registered vendor IDs, but my idea was that I wouldn't have to implement every protocol - I figured, since all I wanted to do was load and store the resulting information, I didn't really need to *interpret* the data, I just needed to store it. Of course, I would need to interpret any transport-layer instructions (commands, block headers, checksums and so on), since a lot of instruments sent their "data dumps" as multiple SysEx messages which would need to be stored together. But I figured, since I was only supporting two vendor-specific commands for each vendor - bulk dump and bulk load - how different can they all be? Sure, there were likely to be individual variations on how things were done, but I could solve that by creating a per-instrument "personality file" - essentially a set of parameters which would tweak the behavior of my transport module. So for example, one parameter would indicate the type of checksum algorithm to be used, the second would indicate the number of checksum bytes, and so on. For instruments that I couldn't borrow to test, I would rely on my users to fill in the holes (Ah, the heady optimism of the early days of the computer revolution!) and I would then add the user-contributed parameters to each update of the product. I think by now you can start to see where this all goes wrong. I started with a small set of 3 instruments, each from a different manufacturer. I analyzed their bulk data protocols, and came up with an abstract model that encompassed all of them as a superset. Then I added a 4th synth, only to discover that its bulk dump protocol was completely different than the previous three, and so my model had to be rebuild from scratch. No problem, I thought, 3 is too small a sample size anyway. Then I added a 5th synth, and the same thing happened. And a 6th. And so on. For example, every vendor I investigated used a *completely different* algorithm for computing checksums. Some used CRCs, some did simple addition, others used XOR - and some had odd ideas of *which* bytes should be checksummed. Some of the algorithms were really bad too. Different vendors also used different byte encodings. Because Midi is designed to work in an environment where cables can be unplugged at any moment, and because all other Midi messages (other than SysEx) were at most 3 bytes long, the Midi standard required that only 7 bits of each byte could be used to carry data, the 8th bit was reserved for a "start of new message" flag. Different vendors adapted to this challenge with surprising creativity. Some would simply slice the whole dump into units of 7 bits each, crossing the normal byte boundaries. Some would only send 4 bits per Midi Byte. Some did things like: For each 7 bytes of input data, send the bottom 7 bits of each input byte as the first 7 bytes, and then send an 8th byte containing the missing top-bits from the first seven. And then there were those clever manufacturers who simply decided to design their instruments so that no control parameter could have a magnitude greater than 127. Another example of variation was in timing. Roland machines (of certain models) were notorious for rejecting messages if they were sent too fast - you had to wait at least 20 ms from the time you received a message to the time you sent the response. Others would "time out" if you waited too long. There were half-duplex and full-duplex, stateless and stateful protocols, and I could go on. The point is, that there was no way for me to come up with some sort of algorithmic way to describe all of these protocols - the only way to do was in code, with a separate implementation for each and every protocol. Nowadays, I'd simply embed Python into the program and make each personality file a Python script, but I didn't have that option back then. I toyed around with the idea of inventing a custom scripting language specifically for representing dump protocols, but the idea was infeasible at the time. So, if you have had the patience to read through this long-winded anecdote and are wondering how in the hell this relates to Colin's question, I can sum it up in a very short motto (and potential QOTW): "Never question the creative power of an infinite number of monkeys." Or to put it another way: If you create a tool, and you assume that tool will only be used in certain specific ways, but you fail to enforce that limitation, then your assumption will be dead wrong. The idea that there will only be a few type annotation providers who will all nicely cooperate with one another is just as naive as I was in the SysEx debacle. I'll have more focused things to say about this later, but I need to rest. (Had to get that out before all the rant energy dissipated.) -- Talin From krstic at solarsail.hcs.harvard.edu Fri Aug 11 08:44:56 2006 From: krstic at solarsail.hcs.harvard.edu (Ivan Krstic) Date: Fri, 11 Aug 2006 02:44:56 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: <44DB165B.2040901@gmail.com> References: <1d85506f0608081122r30f89973paf450514b00dcc92@mail.gmail.com> <44D9BCB4.5010404@gmail.com> <1f7befae0608091248q7f328875x7c2d03723acbf8d2@mail.gmail.com> <1f7befae0608091838u594de27ctb83dd0845ccaa0@mail.gmail.com> <44DAC4F6.3010002@solarsail.hcs.harvard.edu> <44DB165B.2040901@gmail.com> Message-ID: <44DC2768.7060009@solarsail.hcs.harvard.edu> Nick Coghlan wrote: > Given the time frame, I think you might be stuck with using ctypes to > get at the functionality for Python 2.5. That's probably no worse a way to do it than calling an underscored CPython function; I keep forgetting we're getting out-of-the-box ctypes goodness in 2.5. -- Ivan Krstic | GPG: 0x147C722D From theller at python.net Fri Aug 11 08:58:51 2006 From: theller at python.net (Thomas Heller) Date: Fri, 11 Aug 2006 08:58:51 +0200 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <44DBFE20.7040900@canterbury.ac.nz> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> <1cb725390608101319j19731f91vfc472d9113a03ccf@mail.gmail.com> <1cb725390608101645g3a9db04dhcf76cfd03e3a15fc@mail.gmail.com> <44DBFE20.7040900@canterbury.ac.nz> Message-ID: Greg Ewing schrieb: > Another thought about ctypes: What if you want to pass > a Python function into C as a callback? Does ctypes > have a way of handling that? > Sure. The tutorial has an example that calls qsort with a Python comparison function: http://docs.python.org/dev/lib/ctypes-callback-functions.html Thomas From theller at python.net Fri Aug 11 09:10:01 2006 From: theller at python.net (Thomas Heller) Date: Fri, 11 Aug 2006 09:10:01 +0200 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <44DBE1BA.6000204@canterbury.ac.nz> References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> <44DBE1BA.6000204@canterbury.ac.nz> Message-ID: Greg Ewing schrieb: > Paul Prescod wrote: >> It seems that the emerging >> consensus (bar a security question from Guido) is that ctypes it the way >> forward for calling C code in Python 3000. I'd like to clarify what this >> might mean: > > What's the state of play concerning ctypes support > on non-x86 platforms? Pretty good, I would say. Look, for example, at the buildbots. Major architectures that are currently *not* supported: - Linux/BSD/arm (because the libffi/arm doesn't support closures, although ctypes on WindowsCE/arm works) - Windows/AMD64 (This is probably currently not a major platform. Sometimes I'm working on a port for this) - I know that there are some problems on solaris, although the solaris10/sparc buildbot does not report probems. > Until ctypes is uniformly supported on all platforms, > it can't be considered a complete replacement for > C-coded extensions (whether handwritten or generated > by something else). > > -- > Greg Thomas From tomerfiliba at gmail.com Fri Aug 11 09:33:00 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Fri, 11 Aug 2006 09:33:00 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608101214g594d2dal282ab2ae60f29f11@mail.gmail.com> Message-ID: <1d85506f0608110033k2eac1f9h10908ddbef5db8c3@mail.gmail.com> [Guido] > I expect that Jython doesn't implement this; it doesn't handle ^C either AFAIK. threads are at most platform agnostic (old unices, embedded systems, etc. are not likely to have thread support) so keeping this in mind, and having interrupt_main part of the standard thread API, which as you say, may not be implementation agnostic, why is thread.raise_exc(id, excobj) a bad API? and as i recall, dotNET's Thread.AbortThread or whatever it's called works that way (raising an exception in the other thread), so IronPython for once, should be happy with it. by the way, is the GIL part of the python standard? i.e., does IronPython implement it, although it shouldn't be necessary in dotNET? -tomer From slawomir.nowaczyk.847 at student.lu.se Fri Aug 11 12:48:32 2006 From: slawomir.nowaczyk.847 at student.lu.se (Slawomir Nowaczyk) Date: Fri, 11 Aug 2006 12:48:32 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: References: Message-ID: <20060811102346.EFC4.SLAWOMIR.NOWACZYK.847@student.lu.se> On Thu, 10 Aug 2006 23:01:45 -0400 Luis P Caamano wrote: #> Yes, I also wonder about how non-CPython implementations would handle #> this but I'd just like to say that this feature, making a thread raise #> a specific exception from another thread asynchronously is a very #> useful feature. #> #> We have a subsystem that schedules requests that are dispatched in a #> thread each. The only way to cancel one of those requests right now #> is via a cooperative checking method in which we explicitly make calls #> through out the code to see if the request has been canceled, and in #> such case, the check raises an exception that triggers clean up and #> cancellation. #> #> Problem is we have to spread check calls all over the place. All this #> would be a lot easier if we could do thread.terminate() as proposed, #> especially for new code. "All over the place"? Literally? In other words, how likely is it that your code would still be correct if you had this check after *every* single statement? Or even more often -- every N bytecodes? I believe that if asynchronous exception raising ever gets officially approved, there absolutely *needs* to be a way to block it for a piece of code that should execute atomically. It is (more or less) OK to have an unofficial way to terminate the thread, with "use on your own risk", because there are situations where it is useful and (in a cooperative environment) reasonably safe thing to do. But it should not be done lightly and never when the code is not specifically expecting it. -- Best wishes, Slawomir Nowaczyk ( Slawomir.Nowaczyk at cs.lth.se ) Live in the past and future only. From pje at telecommunity.com Fri Aug 11 17:32:55 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 11 Aug 2006 11:32:55 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: Message-ID: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> At 06:10 AM 8/11/2006 -0700, Talin wrote: >Or to put it another way: If you create a tool, and you assume that tool >will only be used in certain specific ways, but you fail to enforce that >limitation, then your assumption will be dead wrong. The idea that there >will only be a few type annotation providers who will all nicely >cooperate with one another is just as naive as I was in the SysEx debacle. Are you saying that function annotations are a bad idea because we won't be able to pickle them? If not, your entire argument seems specious. Actually, even if that *is* your argument, it's specious, since all that's needed to support pickling is to support pickling. All that's needed to support printing is to support printing (via __str__), and so on. Thus, by a similar process of analogy, all that's needed to support any operation is to have an extensible mechanism by which the operation is defined, so that the operation can be extended to include new types -- i.e., an overloadable function, like pickle.dump. Conversely, using your analogy, one could say that the iteration protocol is a bad idea because lots of people might then have to implement their own __iter__ methods. We should thus only have a fixed set of sequence types! In short, your argument is based on a false analogy and is nonsensical when moved out of the realm of on-the-wire protocols and into the realm of a programming language. From jcarlson at uci.edu Fri Aug 11 17:45:54 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 11 Aug 2006 08:45:54 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <20060811102346.EFC4.SLAWOMIR.NOWACZYK.847@student.lu.se> References: <20060811102346.EFC4.SLAWOMIR.NOWACZYK.847@student.lu.se> Message-ID: <20060811082620.192E.JCARLSON@uci.edu> Slawomir Nowaczyk wrote: > I believe that if asynchronous exception raising ever gets officially > approved, there absolutely *needs* to be a way to block it for a piece > of code that should execute atomically. There is already a way of making Python source execution atomic with respect to other Python code [1]. > But it should not be done lightly and never when the code is not > specifically expecting it. If you don't want random exceptions being raised in your threads, then don't use this method that is capable of raising exceptions somewhat randomly. - Josiah [1] Remove the two sys.setcheckinterval calls to verify this works. "proper" use should probably use try/finally wrapping. >>> import sys >>> import threading >>> import time >>> >>> x = 0 >>> >>> >>> def thr(n): ... global x ... while not x: ... time.sleep(.01) ... for i in xrange(n): ... sys.setcheckinterval(sys.maxint) ... _x = x + 1 ... x, _x = _x, x ... sys.setcheckinterval(100) ... >>> >>> for i in xrange(10): ... threading.Thread(target=thr, args=(1000000,)).start() ... >>> x += 1 >>> while threading.activeCount() > 1: ... time.sleep(.1) ... >>> print x 10000001 >>> From jason.orendorff at gmail.com Fri Aug 11 17:47:39 2006 From: jason.orendorff at gmail.com (Jason Orendorff) Date: Fri, 11 Aug 2006 11:47:39 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: <1d85506f0608110033k2eac1f9h10908ddbef5db8c3@mail.gmail.com> References: <1d85506f0608101214g594d2dal282ab2ae60f29f11@mail.gmail.com> <1d85506f0608110033k2eac1f9h10908ddbef5db8c3@mail.gmail.com> Message-ID: On 8/11/06, tomer filiba wrote: > why is thread.raise_exc(id, excobj) a bad API? It breaks seemingly innocent code in subtle ways. Worse, the breakage will always be a race condition, so it'll be especially hard to reproduce and debug. class Foo: ... def close(self): self.f.close() self.closed = True Any code that uses the "closed" attribute obviously depends on it being properly set, right? This close() method gets this right. It sets "closed" if and only if the self.f.close() call succeeds. There are circumstances where this will fail: MemoryError, KeyboardInterrupt, a broken trace function, a broken __setattr__(), del __builtins__.True... but all are extreme cases. I think thread.raise_exc() should be considered extreme too. Otherwise, its existence must be considered to degrade the reliability of the above code. I'm not saying "don't add this". Maybe it's useful, particuarly as a fallback mechanism for killing a runaway thread. But it should be documented as an extreme measure. -j From jcarlson at uci.edu Fri Aug 11 18:04:54 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 11 Aug 2006 09:04:54 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> Message-ID: <20060811084623.1931.JCARLSON@uci.edu> "Phillip J. Eby" wrote: > > At 06:10 AM 8/11/2006 -0700, Talin wrote: > >Or to put it another way: If you create a tool, and you assume that tool > >will only be used in certain specific ways, but you fail to enforce that > >limitation, then your assumption will be dead wrong. The idea that there > >will only be a few type annotation providers who will all nicely > >cooperate with one another is just as naive as I was in the SysEx debacle. > > Are you saying that function annotations are a bad idea because we won't be > able to pickle them? That is not what I got out of the message at all. > If not, your entire argument seems specious. Actually, even if that *is* > your argument, it's specious, since all that's needed to support pickling > is to support pickling. All that's needed to support printing is to > support printing (via __str__), and so on. I think you misunderstood Talin. While it was a pain for him to work his way through implementing all of the loading/etc. protocols, I believe his point was that if we allow any and all arbitrary metadata to be placed on arguments to and from functions, then invariably there will be multiple methods of doing as much. That isn't a problem unto itself, but when there ends up being multiple metadata formats, with multiple interpretations of them, and a user decides that they want to combine the functionality of two metadata formats, they may be stuck due to incompatibilities, etc. I think that it can be fixed by defining a standard mechanism for 'metadata chaining', one involving tuples and/or dictionaries. Say, for example, we have the following function definition: def foo(argn:meta=dflt): ... Since meta can take on the value of a Python expression (executed during compile-time), a tuple-based chaining would work like so: @chainmetadatatuple(meta_fcn1, meta_fcn2) def foo(argn:(meta1, meta2)=dflt): ... And a dictionary-based chaining would work like so: @chainmetadatadict(m1=meta_fcn1, m2=meta_fcn2) def foo(argn:{'m1'=meta1, 'm2'=meta2}=dflt): ... The reason to include the dict-based option is to allow for annotations to be optional. This method may or may not be good. But, if we don't define a standard method for metadata to be combined from multiple protocols, etc., then we could end up with incompatabilities. However, if we do define a standard chaining mechanism, then it can be used, and presumably we shouldn't run into problems relating to incompatible annotation, etc. - Josiah From jason.orendorff at gmail.com Fri Aug 11 18:04:09 2006 From: jason.orendorff at gmail.com (Jason Orendorff) Date: Fri, 11 Aug 2006 12:04:09 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: <20060811082620.192E.JCARLSON@uci.edu> References: <20060811102346.EFC4.SLAWOMIR.NOWACZYK.847@student.lu.se> <20060811082620.192E.JCARLSON@uci.edu> Message-ID: On 8/11/06, Josiah Carlson wrote: > Slawomir Nowaczyk wrote: > > But it should not be done lightly and never when the code is not > > specifically expecting it. > > If you don't want random exceptions being raised in your threads, then > don't use this method that is capable of raising exceptions somewhat > randomly. I agree. The only question is how dire the warnings should be. I'll answer that question with another question: Are we going to make the standard library robust against asynchronous exceptions? For example, class Thread has an attribute __stopped that is set using code similar to the example code I posted. An exception at just the wrong time would kill the thread while leaving __stopped == False. Maybe that particular case is worth fixing, but to find and fix them all? Better to put strong warnings on this one method: may cause unpredictable brokenness. -j From jcarlson at uci.edu Fri Aug 11 18:15:32 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 11 Aug 2006 09:15:32 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <20060811082620.192E.JCARLSON@uci.edu> Message-ID: <20060811091309.1934.JCARLSON@uci.edu> "Jason Orendorff" wrote: > > On 8/11/06, Josiah Carlson wrote: > > Slawomir Nowaczyk wrote: > > > But it should not be done lightly and never when the code is not > > > specifically expecting it. > > > > If you don't want random exceptions being raised in your threads, then > > don't use this method that is capable of raising exceptions somewhat > > randomly. > > I agree. The only question is how dire the warnings should be. > > I'll answer that question with another question: Are we going to make > the standard library robust against asynchronous exceptions? For > example, class Thread has an attribute __stopped that is set using > code similar to the example code I posted. An exception at just the > wrong time would kill the thread while leaving __stopped == False. > > Maybe that particular case is worth fixing, but to find and fix them > all? Better to put strong warnings on this one method: may cause > unpredictable brokenness. Considering that it will not be accessable via standard Python, only through a few ctypes hoops, I believe that is a fairly ready indication that one should be wary of its use. I also think it would make sense to fix that particular instance (to not do so seems to be a bit foolish). - Josiah From qrczak at knm.org.pl Fri Aug 11 19:00:07 2006 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Fri, 11 Aug 2006 19:00:07 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: <20060811082620.192E.JCARLSON@uci.edu> (Josiah Carlson's message of "Fri, 11 Aug 2006 08:45:54 -0700") References: <20060811102346.EFC4.SLAWOMIR.NOWACZYK.847@student.lu.se> <20060811082620.192E.JCARLSON@uci.edu> Message-ID: <87fyg32oo8.fsf@qrnik.zagroda> Josiah Carlson writes: > There is already a way of making Python source execution atomic with > respect to other Python code [1]. It's not realistic to expect sys.setcheckinterval be implementable on other runtimes. Also, it doesn't provide a way to unblock asynchronous exceptions until a particular blocking operation completes. > If you don't want random exceptions being raised in your threads, then > don't use this method that is capable of raising exceptions somewhat > randomly. It's like saying "if you don't want integer addition overflow, then don't do addition". I do want asynchronous exceptions, but not anywhere, only in selected regions (or excluding selected regions). This can be designed well. -- __("< Marcin Kowalczyk \__/ qrczak at knm.org.pl ^^ http://qrnik.knm.org.pl/~qrczak/ From jcarlson at uci.edu Fri Aug 11 20:18:56 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 11 Aug 2006 11:18:56 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <87fyg32oo8.fsf@qrnik.zagroda> References: <20060811082620.192E.JCARLSON@uci.edu> <87fyg32oo8.fsf@qrnik.zagroda> Message-ID: <20060811105742.193A.JCARLSON@uci.edu> "Marcin 'Qrczak' Kowalczyk" wrote: > > Josiah Carlson writes: > > > There is already a way of making Python source execution atomic with > > respect to other Python code [1]. > > It's not realistic to expect sys.setcheckinterval be implementable on > other runtimes. The 'raise an exception in an alternate thread' functionality is a CPython specific functionality. If you believe that it could be implemented in all other runtimes, then you missed the discussion that stated that it would be impossible to implement in Jython. As such, because both are CPython specific features, I don't see a problem with using both if you are going to be using one of them. > Also, it doesn't provide a way to unblock asynchronous exceptions until > a particular blocking operation completes. I thought the point of this 'block asynchronous exceptions' business was to block asynchronous exceptions during a particular bit of code. Now you are saying that there needs to be a method of bypassing such blocking from other threads? > > If you don't want random exceptions being raised in your threads, then > > don't use this method that is capable of raising exceptions somewhat > > randomly. > > It's like saying "if you don't want integer addition overflow, then > don't do addition". No. Integer addition is a defined feature of the language. Raising exceptions in an alternate thread is a generally unsupported feature available to CPython, very likely not implementable in most other runtimes. It has previously been available via ctypes, but its previous non-use is a function of its lack of documentation, lack of cytpes shipping with base Python, etc. > I do want asynchronous exceptions, but not anywhere, only in selected > regions (or excluding selected regions). This can be designed well. Yes, it can be. You can add a lock to each thread (each thread gets its own lock). When a thread doesn't want to be interrupted, it .acquire()s its lock. When it is OK to interrupt it, it .release()s its lock. When you want to kill a thread, .acquire() its lock, and kill it. In effect, the above would be what is necessary to give you what you want. It can easily be defined as a set of 3 functions, whose implementation should be left out of the standard library. Including it in the standard library offers the illusion of support (in the 'this language feature is supported' sense) for raising an exception in an alternate thread, which is not the case (it is available, but not supported). - Josiah From qrczak at knm.org.pl Fri Aug 11 21:33:10 2006 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Fri, 11 Aug 2006 21:33:10 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: <20060811105742.193A.JCARLSON@uci.edu> (Josiah Carlson's message of "Fri, 11 Aug 2006 11:18:56 -0700") References: <20060811082620.192E.JCARLSON@uci.edu> <87fyg32oo8.fsf@qrnik.zagroda> <20060811105742.193A.JCARLSON@uci.edu> Message-ID: <87veozoyo9.fsf@qrnik.zagroda> Josiah Carlson writes: >> It's not realistic to expect sys.setcheckinterval be implementable on >> other runtimes. > > The 'raise an exception in an alternate thread' functionality is a > CPython specific functionality. If you believe that it could be > implemented in all other runtimes, then you missed the discussion that > stated that it would be impossible to implement in Jython. Indeed both are hard to implement on some runtimes. I believe there are runtimes where asynchronous exceptions are practical while blocking context switching is not (e.g. POSIX threads combined with Unix signals and C++ exceptions). In any case, blocking switching the context to any other thread is an overkill. It's hard to say how sys.setcheckinterval should behave on truly parallel runtimes, while the semantics of blockable asynchronous exceptions doesn't depend on threads being dispatched sequentially. >> Also, it doesn't provide a way to unblock asynchronous exceptions until >> a particular blocking operation completes. > > I thought the point of this 'block asynchronous exceptions' business > was to block asynchronous exceptions during a particular bit of code. > Now you are saying that there needs to be a method of bypassing such > blocking from other threads? No, I'm talking about specifying the blocking behavior by the thread to be interrupted. It makes sense to wait for e.g. accept() such that asynchronous exceptions are processed during the wait, but that they are atomically blocked as soon as a connection is accepted. Unfortunately it's yet another obstacle to some runtimes. Yet another issue is asynchronous "signals" which don't necessarily throw an exception but cause the computation to react and possibly continue (e.g. suspend a thread until it's resumed). > Yes, it can be. You can add a lock to each thread (each thread gets its > own lock). When a thread doesn't want to be interrupted, it .acquire()s > its lock. When it is OK to interrupt it, it .release()s its lock. When > you want to kill a thread, .acquire() its lock, and kill it. This works almost well. The thread sending an exception is unnecessarily blocked; this could be solved by starting another thread to send an exception. And it doesn't support the mentioned unblocking only while waiting. The problem is that there is no universally recognized convention: I can't expect third-party libraries to protect their sensitive regions by my mutex. Without an agreed convention they can't even if they want to. My design includes implicit blocking of asynchronous exception by certain language constructs, e.g. by taking *any* mutex. Most cases of taking a mutex also want to block asynchronous signals. I'm surprised that various runtimes that I would expect to be well designed provide mostly either unsafe or too restricted means of asynchronous interruption. http://java.sun.com/j2se/1.5.0/docs/guide/misc/threadPrimitiveDeprecation.html http://www.interact-sw.co.uk/iangblog/2004/11/12/cancellation -- __("< Marcin Kowalczyk \__/ qrczak at knm.org.pl ^^ http://qrnik.knm.org.pl/~qrczak/ From pje at telecommunity.com Fri Aug 11 21:34:01 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 11 Aug 2006 15:34:01 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <20060811084623.1931.JCARLSON@uci.edu> References: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060811152032.023a8fc0@sparrow.telecommunity.com> At 09:04 AM 8/11/2006 -0700, Josiah Carlson wrote: >I think you misunderstood Talin. While it was a pain for him to work >his way through implementing all of the loading/etc. protocols, I >believe his point was that if we allow any and all arbitrary metadata to >be placed on arguments to and from functions, then invariably there will >be multiple methods of doing as much. That isn't a problem unto itself, >but when there ends up being multiple metadata formats, with multiple >interpretations of them, and a user decides that they want to combine >the functionality of two metadata formats, they may be stuck due to >incompatibilities, etc. I was giving him the benefit of the doubt by assuming he was bringing up a *new* objection that I hadn't already answered. This "incompatibility" argument has already been addressed; it is trivially solved by overloaded functions (e.g. pickle.dump(), str(), iter(), etc.). >This method may or may not be good. But, if we don't define a standard >method for metadata to be combined from multiple protocols, etc., then >we could end up with incompatabilities. Not if you use overloaded functions to define the operations you're going to perform. You and Talin are proposing a problem here that is not only hypothetical, it's non-existent. Remember, PEAK already does this kind of openly-extensible metadata for attributes, using a single-dispatch overloaded function (analagous to pickle.dump). If you want to show that it's really possible to create "incompatible" annotations, try creating some for attributes in PEAK. But, you'll quickly find that the only "meaning" that metadata has is *operational*. That is, either some behavior is influenced by the metadata, or no behavior is. If no behavior is involved, then there can be no incompatibility. If there is behavior, there is an operation to be performed, and that operation can be based on the type of the metadata. Ergo, using an overloadable function for the operation to be performed allows a meaning to be defined for the specific combination of operation and type. Therefore, there is no problem - every piece of metadata may be assigned a meaning that is relevant for each operation that needs to be performed. Now, it is of course possible that two pieces of metadata may be contradictory, redundant, overlapping, etc. However, this has nothing to do with whether the semantics of metadata are predefined. Any sufficiently-useful annotation scheme will include these possibilities, and the operations to be performed are going to have to have some defined semantics for them. This is entirely independent of whether there is more than one metadata framework in existence. From jcarlson at uci.edu Fri Aug 11 22:12:15 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 11 Aug 2006 13:12:15 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <87veozoyo9.fsf@qrnik.zagroda> References: <20060811105742.193A.JCARLSON@uci.edu> <87veozoyo9.fsf@qrnik.zagroda> Message-ID: <20060811125449.1940.JCARLSON@uci.edu> Threading is already difficult enough to do 'right' (see the dozens of threads discussing why this is really the case), and designing software that can survive the raising of an exception at any point makes threading even more difficult. I believe that you are attempting to design an interface to make this particular feature foolproof. I think that such is a mistake; killing a thread should be frought with gotchas and should be documented as "may crash the runtime". Offering users anything more is tantamount to encouraging its use, which is counter to the reasons why it is not available via a standard threading.function call: because it shouldn't be used at all, except by people who know what the heck they are doing. I believe that if a user cannot design and implement their own system to handle when a thread can be killed or not to their own satisfaction, then they have no business killing threads. - Josiah "Marcin 'Qrczak' Kowalczyk" wrote: > Josiah Carlson writes: > > >> It's not realistic to expect sys.setcheckinterval be implementable on > >> other runtimes. > > > > The 'raise an exception in an alternate thread' functionality is a > > CPython specific functionality. If you believe that it could be > > implemented in all other runtimes, then you missed the discussion that > > stated that it would be impossible to implement in Jython. > > Indeed both are hard to implement on some runtimes. > > I believe there are runtimes where asynchronous exceptions are > practical while blocking context switching is not (e.g. POSIX threads > combined with Unix signals and C++ exceptions). > > In any case, blocking switching the context to any other thread is an > overkill. It's hard to say how sys.setcheckinterval should behave on > truly parallel runtimes, while the semantics of blockable asynchronous > exceptions doesn't depend on threads being dispatched sequentially. > > >> Also, it doesn't provide a way to unblock asynchronous exceptions until > >> a particular blocking operation completes. > > > > I thought the point of this 'block asynchronous exceptions' business > > was to block asynchronous exceptions during a particular bit of code. > > Now you are saying that there needs to be a method of bypassing such > > blocking from other threads? > > No, I'm talking about specifying the blocking behavior by the thread > to be interrupted. It makes sense to wait for e.g. accept() such that > asynchronous exceptions are processed during the wait, but that they > are atomically blocked as soon as a connection is accepted. > > Unfortunately it's yet another obstacle to some runtimes. > > Yet another issue is asynchronous "signals" which don't necessarily > throw an exception but cause the computation to react and possibly > continue (e.g. suspend a thread until it's resumed). > > > Yes, it can be. You can add a lock to each thread (each thread gets its > > own lock). When a thread doesn't want to be interrupted, it .acquire()s > > its lock. When it is OK to interrupt it, it .release()s its lock. When > > you want to kill a thread, .acquire() its lock, and kill it. > > This works almost well. The thread sending an exception is unnecessarily > blocked; this could be solved by starting another thread to send an > exception. And it doesn't support the mentioned unblocking only while > waiting. > > The problem is that there is no universally recognized convention: > I can't expect third-party libraries to protect their sensitive > regions by my mutex. Without an agreed convention they can't even > if they want to. > > My design includes implicit blocking of asynchronous exception by > certain language constructs, e.g. by taking *any* mutex. Most cases > of taking a mutex also want to block asynchronous signals. > > I'm surprised that various runtimes that I would expect to be well > designed provide mostly either unsafe or too restricted means of > asynchronous interruption. > http://java.sun.com/j2se/1.5.0/docs/guide/misc/threadPrimitiveDeprecation.html > http://www.interact-sw.co.uk/iangblog/2004/11/12/cancellation > > -- > __("< Marcin Kowalczyk > \__/ qrczak at knm.org.pl > ^^ http://qrnik.knm.org.pl/~qrczak/ > _______________________________________________ > Python-3000 mailing list > Python-3000 at python.org > http://mail.python.org/mailman/listinfo/python-3000 > Unsubscribe: http://mail.python.org/mailman/options/python-3000/jcarlson%40uci.edu From jcarlson at uci.edu Fri Aug 11 22:46:42 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Fri, 11 Aug 2006 13:46:42 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811152032.023a8fc0@sparrow.telecommunity.com> References: <20060811084623.1931.JCARLSON@uci.edu> <5.1.1.6.0.20060811152032.023a8fc0@sparrow.telecommunity.com> Message-ID: <20060811131616.1943.JCARLSON@uci.edu> "Phillip J. Eby" wrote: > At 09:04 AM 8/11/2006 -0700, Josiah Carlson wrote: > >I think you misunderstood Talin. While it was a pain for him to work > >his way through implementing all of the loading/etc. protocols, I > >believe his point was that if we allow any and all arbitrary metadata to > >be placed on arguments to and from functions, then invariably there will > >be multiple methods of doing as much. That isn't a problem unto itself, > >but when there ends up being multiple metadata formats, with multiple > >interpretations of them, and a user decides that they want to combine > >the functionality of two metadata formats, they may be stuck due to > >incompatibilities, etc. > > I was giving him the benefit of the doubt by assuming he was bringing up a > *new* objection that I hadn't already answered. This "incompatibility" > argument has already been addressed; it is trivially solved by overloaded > functions (e.g. pickle.dump(), str(), iter(), etc.). In effect, you seem to be saying "when user X wants to add their own metadata with interpretation, they need to overload the previously existing metadata interpreter". However, as has already been stated, because there is no standard metadata interpreter, nor a standard method for chaining metadata, how is user X supposed to overload the previously existing metadata interpreter? Since you brought up pickle.dump(), str(), iter(), etc., I'll point out that str(), iter(), etc., call special methods on the defined object (__str__, __iter__, etc.), and while pickle can have picklers be registered, it also has a special method interface. Because all of the metadata defined is (according to the pre-PEP) attached to a single __signature__ attribute of the function, interpretation of the metadata isn't as easy as calling str(obj), as you claim. Let us say that I have two metadata interpters. One that believes that the metadata is types and wants to verify type on function call. The other believes that the metadata is documentation. Both were written without regards to the other. Please describe to me (in code preferably) how I would be able to use both of them without having a defined metadata interpretation chaining semantic. > >This method may or may not be good. But, if we don't define a standard > >method for metadata to be combined from multiple protocols, etc., then > >we could end up with incompatabilities. > > Not if you use overloaded functions to define the operations you're going > to perform. You and Talin are proposing a problem here that is not only > hypothetical, it's non-existent. > > Remember, PEAK already does this kind of openly-extensible metadata for > attributes, using a single-dispatch overloaded function (analagous to > pickle.dump). If you want to show that it's really possible to create > "incompatible" annotations, try creating some for attributes in PEAK. Could you at least provide a link to where it is documented how to create metadata attributes in PEAK? My attempts to delve into PEAK documentation has thus far failed horribly. - Josiah From pje at telecommunity.com Fri Aug 11 23:11:00 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 11 Aug 2006 17:11:00 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <20060811131616.1943.JCARLSON@uci.edu> References: <5.1.1.6.0.20060811152032.023a8fc0@sparrow.telecommunity.com> <20060811084623.1931.JCARLSON@uci.edu> <5.1.1.6.0.20060811152032.023a8fc0@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060811165113.03cabe60@sparrow.telecommunity.com> At 01:46 PM 8/11/2006 -0700, Josiah Carlson wrote: >"Phillip J. Eby" wrote: > > At 09:04 AM 8/11/2006 -0700, Josiah Carlson wrote: > > >I think you misunderstood Talin. While it was a pain for him to work > > >his way through implementing all of the loading/etc. protocols, I > > >believe his point was that if we allow any and all arbitrary metadata to > > >be placed on arguments to and from functions, then invariably there will > > >be multiple methods of doing as much. That isn't a problem unto itself, > > >but when there ends up being multiple metadata formats, with multiple > > >interpretations of them, and a user decides that they want to combine > > >the functionality of two metadata formats, they may be stuck due to > > >incompatibilities, etc. > > > > I was giving him the benefit of the doubt by assuming he was bringing up a > > *new* objection that I hadn't already answered. This "incompatibility" > > argument has already been addressed; it is trivially solved by overloaded > > functions (e.g. pickle.dump(), str(), iter(), etc.). > >In effect, you seem to be saying "when user X wants to add their own >metadata with interpretation, they need to overload the previously >existing metadata interpreter". No, they need to overload whatever *operation* is being performed *on* the metadata. For example, if I am using a decorator that adds type checking to the function, then that decorator is an example of an operation that should be overloadable. More precisely, that decorator would probably have an operation that generates type checking code for an individual type annotation -- and *that* is the operation that would need overloading. The "generate_typecheck_code()" operation would be an overloadable function. Another possible operation: printing help for a function. You would need a "format_type_annotation()" overloadable operation, and so on. There is no *single* "metadata interpreter", in other words. There are just operations you perform on metadata. If multiple people define different variants of the same operation, let's say "generate_typecheck_code()" and "generate_code_for_typecheck()", and you have some code that defines methods for one overloadable function, but you have code that wants to call the other, you just write some methods for one that call the other, or make one be the default implementation for the other. There is no need for a *single* canonical operation *or* type. This is the whole point of generic functions, really. They eliminate the need for One Framework To Rule Them All, and tend to dissolve the "framework"ness right out of frameworks. What you end up with are extensible libraries instead of frameworks. >Since you brought up pickle.dump(), str(), iter(), etc., I'll point out >that str(), iter(), etc., call special methods on the defined object >(__str__, __iter__, etc.), and while pickle can have picklers be >registered, it also has a special method interface. Because all of the >metadata defined is (according to the pre-PEP) attached to a single >__signature__ attribute of the function, interpretation of the metadata >isn't as easy as calling str(obj), as you claim. Actually, with overloadable functions, it is, since overloadable functions can be extended by anybody, without needing to monkey with the classes. Note that if Guido had originally created Python with overloadable functions, it's rather unlikely that __special__ methods would have arisen. Instead, it's much more likely that there would be syntax sugar for easily defining overloads, like "defop str(self): ...". >Let us say that I have two metadata interpters. One that believes that >the metadata is types and wants to verify type on function call. The >other believes that the metadata is documentation. Both were written >without regards to the other. Please describe to me (in code preferably) >how I would be able to use both of them without having a defined >metadata interpretation chaining semantic. See explanation above. > > Remember, PEAK already does this kind of openly-extensible metadata for > > attributes, using a single-dispatch overloaded function (analagous to > > pickle.dump). If you want to show that it's really possible to create > > "incompatible" annotations, try creating some for attributes in PEAK. > >Could you at least provide a link to where it is documented how to >create metadata attributes in PEAK? My attempts to delve into PEAK >documentation has thus far failed horribly. Here's the tutorial for defining new metadata (among other things): http://svn.eby-sarna.com/PEAK/src/peak/binding/attributes.txt?view=markup The example defines a "Message()" metadata type whose sole purpose is to print a message when the attribute is declared. What's not really explained there is that all the 'addMethod' stuff is basically adding methods to an overloaded function. Anyway, PEAK uses this simple metadata declaration system to implement both security permission declarations: http://peak.telecommunity.com/DevCenter/SecurityRules#linking-actions-to-permissions and command-line options: http://peak.telecommunity.com/DevCenter/OptionsHowTo#declaring-options In PEAK's case, a single overloaded operation is invoked when the metadata is defined, and then that overloaded operation performs whatever actions are relevant for the metadata. For function metadata, however, it's sufficient to use distinct overloaded functions for distinct operations and not actually "do" anything unless it's needed. However, if we wanted things to be able to happen just by declaring metadata (without using any decorators or performing any other operations), then yes, the language would need some equivalent to PEAK's "declareAttribute()" overloaded function. However, my understanding of the proposal was that annotations were intended to be inert and purely informational *unless* processed by a decorator or some other mechanism. From talin at acm.org Sat Aug 12 00:16:11 2006 From: talin at acm.org (Talin) Date: Fri, 11 Aug 2006 15:16:11 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> Message-ID: <44DD01AB.20809@acm.org> Phillip J. Eby wrote: > At 06:10 AM 8/11/2006 -0700, Talin wrote: >> Or to put it another way: If you create a tool, and you assume that tool >> will only be used in certain specific ways, but you fail to enforce that >> limitation, then your assumption will be dead wrong. The idea that there >> will only be a few type annotation providers who will all nicely >> cooperate with one another is just as naive as I was in the SysEx >> debacle. > > Are you saying that function annotations are a bad idea because we won't > be able to pickle them? Huh? What does pickling have to do with anything I said? -- Talin From talin at acm.org Sat Aug 12 00:39:56 2006 From: talin at acm.org (Talin) Date: Fri, 11 Aug 2006 15:39:56 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <20060811084623.1931.JCARLSON@uci.edu> References: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> <20060811084623.1931.JCARLSON@uci.edu> Message-ID: <44DD073C.7030305@acm.org> Josiah Carlson wrote: > "Phillip J. Eby" wrote: >> At 06:10 AM 8/11/2006 -0700, Talin wrote: >>> Or to put it another way: If you create a tool, and you assume that tool >>> will only be used in certain specific ways, but you fail to enforce that >>> limitation, then your assumption will be dead wrong. The idea that there >>> will only be a few type annotation providers who will all nicely >>> cooperate with one another is just as naive as I was in the SysEx debacle. >> Are you saying that function annotations are a bad idea because we won't be >> able to pickle them? > > That is not what I got out of the message at all. > >> If not, your entire argument seems specious. Actually, even if that *is* >> your argument, it's specious, since all that's needed to support pickling >> is to support pickling. All that's needed to support printing is to >> support printing (via __str__), and so on. > > I think you misunderstood Talin. While it was a pain for him to work > his way through implementing all of the loading/etc. protocols, I > believe his point was that if we allow any and all arbitrary metadata to > be placed on arguments to and from functions, then invariably there will > be multiple methods of doing as much. That isn't a problem unto itself, > but when there ends up being multiple metadata formats, with multiple > interpretations of them, and a user decides that they want to combine > the functionality of two metadata formats, they may be stuck due to > incompatibilities, etc. > > I think that it can be fixed by defining a standard mechanism for > 'metadata chaining', one involving tuples and/or dictionaries. > > Say, for example, we have the following function definition: > def foo(argn:meta=dflt): > ... > > Since meta can take on the value of a Python expression (executed during > compile-time), a tuple-based chaining would work like so: > > @chainmetadatatuple(meta_fcn1, meta_fcn2) > def foo(argn:(meta1, meta2)=dflt): > ... > > And a dictionary-based chaining would work like so: > @chainmetadatadict(m1=meta_fcn1, m2=meta_fcn2) > def foo(argn:{'m1'=meta1, 'm2'=meta2}=dflt): > ... > > The reason to include the dict-based option is to allow for annotations > to be optional. > > > This method may or may not be good. But, if we don't define a standard > method for metadata to be combined from multiple protocols, etc., then > we could end up with incompatabilities. However, if we do define a > standard chaining mechanism, then it can be used, and presumably > we shouldn't run into problems relating to incompatible annotation, etc. > > > - Josiah Josiah is essentially correct in his interpretation of my views. I really don't understand what Phillip is talking about here. Say I want to annotate a specific argument with two pieces of information, a type and a docstring. I have two metadata interpreters, one which uses the type information to restrict the kinds of arguments that can be passed in, and another which uses the docstring to enhance the generated documentation. Now, lets say that these two metadata interpreters were written by two people, who are not in communication with each other. Each one decides that they would like to "play nice" with other competing metadata. So Author A, who wrote the annotation decorator that looks for docstrings, decides that not only will he accept docstring annotations, but if the annotation is a tuple, then he will search that tuple for any docstrings, skipping over any annotations that he doesn't understand. (Although how he is supposed to manage that is unclear - since there could also be other annotations that are simple text strings as well.) Author B, who wrote the type-enforcement module, also wants to play nice with others, but since he doesn't know A, comes up with a different solution. His idea is to create a system in which annotations automatically chain each other - so that each annotation has a "next" attribute referring to the next annotation. So programmer C, who wants to incorporate both A and B's work into his program, has a dilemma - each has a sharing mechanism, but the sharing mechanisms are different and incompatible. So he is unable to apply both A-type and B-type metadata to any given signature. What happens next is that C complains to both A and B (and in the process introducing them to each other.) A and B exchange emails, and reach the conclusion that B will modify his library to confirm to the sharing mechanism of A. What this means is that A and B have created a defacto standard. Anyone who wants to interoperate with A and B have to write their interpreter to conform to the sharing mechanism defined by A and B. But it also means that anyone outside of the ABC clique will not know about A&B's sharing convention, which means that their metadata interpreter will not be able to interoperate with A&B-style metadata. So in essence, A&B have now "captured" the space of annotations - that is, anyone who conforms to the A&B protocol can combine their annotations together; Anyone outside that group is excluded from interoperating. Finally, lets say that A&B eventually become well-known enough that their sharing convention becomes the defacto standard. Any metadata that wants to interoperate with other metadata-interpretation libraries will have to follow the A&B convention. Any metadata library that chooses to use a different convention will be at a severe disadvantage, since they won't be able to be used together with other metadata interpreters. What this means is that, despite the statements that annotations have no defined format or meaning, the fact is that they now do: The defacto A&B sharing convention. The sharing convention tells metadata interpreters how to distinguish between metadata that they can interpret, and how to skip over other metadata. So in other words, since the original author of the annotation system failed to provide a convention for multiple annotations, they force the community to fill in the parts of the standard that they left out. -- Talin From seojiwon at gmail.com Sat Aug 12 01:20:20 2006 From: seojiwon at gmail.com (Jiwon Seo) Date: Fri, 11 Aug 2006 16:20:20 -0700 Subject: [Python-3000] PEP3102 Keyword-Only Arguments Message-ID: When we have keyword-only arguments, do we allow 'keyword dictionary' argument? If that's the case, where would we want to place keyword-only arguments? Are we going to allow any of followings? 1. def foo(a, b, *, key1=None, key2=None, **map) 2. def foo(a, b, *, **map, key1=None, key2=None) 3. def foo(a, b, *, **map) -Jiwon From collinw at gmail.com Sat Aug 12 01:49:32 2006 From: collinw at gmail.com (Collin Winter) Date: Fri, 11 Aug 2006 19:49:32 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DD073C.7030305@acm.org> References: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> <20060811084623.1931.JCARLSON@uci.edu> <44DD073C.7030305@acm.org> Message-ID: <43aa6ff70608111649g54e82dd6kef19862f0c281254@mail.gmail.com> I'll combine my replies to Josian and Talin: On 8/11/06, Josiah Carlson wrote: > Let us say that I have two metadata interpters. One that believes that > the metadata is types and wants to verify type on function call. The > other believes that the metadata is documentation. Both were written > without regards to the other. Please describe to me (in code preferably) > how I would be able to use both of them without having a defined > metadata interpretation chaining semantic. On 8/11/06, Talin wrote: > Say I want to annotate a specific argument with two pieces of > information, a type and a docstring. I have two metadata interpreters, > one which uses the type information to restrict the kinds of arguments > that can be passed in, and another which uses the docstring to enhance > the generated documentation. [snipped: the rise of a defacto annotation-sharing standard] > What this means is that, despite the statements that annotations have no > defined format or meaning, the fact is that they now do: The defacto A&B > sharing convention. The sharing convention tells metadata interpreters > how to distinguish between metadata that they can interpret, and how to > skip over other metadata. What Josiah is hinting at -- and what Talin describes more explicitly -- is the problem of how exactly "chaining" annotation interpreters will work. The case I've thought out the most completely is that of using decorators to analyse/utilise the annotations: 1) Each decorator should be written with the assumption that it is the only decorator that will be applied to a given function (with respect to annotations). 2) Chaining will be accomplished by maintaining this illusion for each decorator. For example, if our annotation-sharing convention is that annotations will be n-tuples (n == number of annotation-interpreting decorators), where t[i] is the annotation the i-th decorator should care about, the following chain() function will do the trick (a full demo script is attached): >>> def chain(*decorators): >>> assert len(decorators) >= 2 >>> >>> def decorate(function): >>> sig = function.__signature__ >>> original = sig.annotations >>> >>> for i, dec in enumerate(decorators): >>> fake = dict((p, original[p][i]) for p in original) >>> >>> function.__signature__.annotations = fake >>> function = dec(function) >>> >>> function.__signature__.annotations = original >>> return function >>> return decorate A similar function can be worked out for using dictionaries to specify multiple annotations. I'll update the PEP draft to include a section on guidelines for writing such decorators. Collin Winter -------------- next part -------------- A non-text attachment was scrubbed... Name: chaining_decorators.py Type: text/x-python-script Size: 1497 bytes Desc: not available Url : http://mail.python.org/pipermail/python-3000/attachments/20060811/065a0df9/attachment.bin From tomerfiliba at gmail.com Sat Aug 12 02:13:24 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Sat, 12 Aug 2006 02:13:24 +0200 Subject: [Python-3000] threading, part 2 Message-ID: <1d85506f0608111713m15cf2e67v8b94f06c928e9125@mail.gmail.com> i mailed this to several people separately, but then i thought it could benefit the entire group: http://sebulba.wikispaces.com/recipe+thread2 it's an implementation of the proposed "thread.raise_exc", through an extension to the threading.Thread class. you can test it for yourself; if it proves useful, it should be exposed as thread.raise_exc in the stdlib (instead of the ctypes hack)... and of course it should be reflected in threading.Thread as welll. -tomer -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060812/487eb7e6/attachment.htm From greg.ewing at canterbury.ac.nz Sat Aug 12 03:06:40 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 12 Aug 2006 13:06:40 +1200 Subject: [Python-3000] threading, part 2 In-Reply-To: <20060811102346.EFC4.SLAWOMIR.NOWACZYK.847@student.lu.se> References: <20060811102346.EFC4.SLAWOMIR.NOWACZYK.847@student.lu.se> Message-ID: <44DD29A0.4000902@canterbury.ac.nz> Slawomir Nowaczyk wrote: > But it should not be done lightly and never when the code is not > specifically expecting it. What if, together with a way of blocking asynchronous exceptions, threads started out by default with them blocked? Then a thread would have to explicitly consent to being interrupted. -- Greg From pje at telecommunity.com Sat Aug 12 03:32:49 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 11 Aug 2006 21:32:49 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: Message-ID: <5.1.1.6.0.20060811211801.02287420@sparrow.telecommunity.com> At 3:16 PM 8/12/2006 -0700, Talin wrote: >Phillip J. Eby wrote: > > At 06:10 AM 8/11/2006 -0700, Talin wrote: > >> Or to put it another way: If you create a tool, and you assume that tool > >> will only be used in certain specific ways, but you fail to enforce that > >> limitation, then your assumption will be dead wrong. The idea that there > >> will only be a few type annotation providers who will all nicely > >> cooperate with one another is just as naive as I was in the SysEx > >> debacle. > > > > Are you saying that function annotations are a bad idea because we won't > > be able to pickle them? > >Huh? What does pickling have to do with anything I said? I'll happily answer that question as soon as you explain what *function annotations* have to do with anything you said. Bonus points if you can explain what MIDI has to do with overloaded functions. :) To put it another way, the only reason I asked about pickling was to try to find *some* meaning in your post. If pickling doesn't relate, then your post has nothing to do with function annotations, because pickling is the most similar thing to the programming problem you actually described. However, if pickling *does* relate, then the mere existence of Python's ability to do pickling proves that the MIDI issue, transferred to the Python sphere, doesn't actually exist. Thus, either way, the MIDI problems you described are moot with respect to function annotations in Python. Is that clearer? (See also my replies to Greg and Josiah on this subject.) From lcaamano at gmail.com Sat Aug 12 03:51:25 2006 From: lcaamano at gmail.com (Luis P Caamano) Date: Fri, 11 Aug 2006 21:51:25 -0400 Subject: [Python-3000] threading, part 2 Message-ID: That's how I feel too Josiah. In some ways, it's the same as writing device drivers in a pre-emptable kernel. You can get interrupted and pre-empted by the hardware at any freaking time in any piece of code and your memory might go away so you better pin it and deal with the interrupts. Forget about that and you end up with a nice kernel panic. Still, we have all kinds of device drivers on SMP, pre-emptable kernels. It can be done. [ sarcastic mode on ] Yes, if it gets exposed to the language it should come with a big warning ... now, how condescending should that warning be? "You can't use this unless you're a good programmer!" or "You better know what you're doing" or how about "A guy once pulled out all his pubic hair trying to figure out what happened when he started using this feature!"? [ sarcastic mode off] It's a gun, here's a bullet, it's a tool, go get food but try not to shoot yourself. I'm also -0 on this, not that I think my opinion counts though. I'm -0 because Tomer pointed me to a nice recipe that uses ctypes to get to the C interface. I'm happy with that and we can start using it right now. Perhaps that should be as high as it gets expose so that it would be an automatic skill test? If you can find it, you probably know how to use it and the kind of problems you might run into. On 8/11/06, Josiah Carlson wrote: > > > I believe that if a user cannot design and implement their own system to > handle when a thread can be killed or not to their own satisfaction, > then they have no business killing threads. > > > - Josiah > -- Luis P Caamano Atlanta, GA USA From talin at acm.org Sat Aug 12 04:17:37 2006 From: talin at acm.org (Talin) Date: Fri, 11 Aug 2006 19:17:37 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811211801.02287420@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811211801.02287420@sparrow.telecommunity.com> Message-ID: <44DD3A41.10507@acm.org> Phillip J. Eby wrote: > At 3:16 PM 8/12/2006 -0700, Talin wrote: >> Phillip J. Eby wrote: >> > At 06:10 AM 8/11/2006 -0700, Talin wrote: >> >> Or to put it another way: If you create a tool, and you assume that >> tool >> >> will only be used in certain specific ways, but you fail to enforce >> that >> >> limitation, then your assumption will be dead wrong. The idea that >> there >> >> will only be a few type annotation providers who will all nicely >> >> cooperate with one another is just as naive as I was in the SysEx >> >> debacle. >> > >> > Are you saying that function annotations are a bad idea because we >> won't >> > be able to pickle them? >> >> Huh? What does pickling have to do with anything I said? > > I'll happily answer that question as soon as you explain what *function > annotations* have to do with anything you said. Bonus points if you can > explain what MIDI has to do with overloaded functions. :) All right. I realize that not everyone made the connection between my parable and the current debate, and I need to spell it out more explicitly. The parable is essentially about standards-writers who fail to do their job by underspecifying certain aspects of the standard, and leave the solution to individual implementers of the standard; And its also how the implementers who try to fill in the missing pieces of the standard do so in a way that is unique and incompatible with what every other implementer is doing. The story also has to do with people who assume things about the behavior of other software developers - specifically, my assumption that other people, working in isolation from one another, would come up with the same or similar solutions to a given problem, vs. Colin's assumption that other creators of annotation interpreters would coordinate their efforts in a sensible way. What the annotation PEP and the SysEx have in common is that they are both dealing with an open-ended specification - one which allows any provider to extend the protocol in any way they wish, without any knowledge or coordination from any other provider. Both specs describe a 'container' for information, but deliberately avoid saying what's in the container. Both specs fail to provide any means for an external entity to discover the meaning of what the objects in the container are - instead, external entities must have a priori knowledge of the contained data. My criticism of Colin's PEP was that it hand-waved over some fairly major problems, and the logic behind the hand-wave was that, well, developers won't do that - there's only going to be a small number of such developers, and they will all deal with each other. I wanted to illustrate how disastrous such an assumption could be. Another lesson of the story has to do with the failure of the MMA committee to specify any guidelines or hints as to how their open-ended protocol should be used. If the MMA had simply put a paragraph in the original standard saying "You are free to create any protocol format you want, but here's an example of how a bulk dump protocol might work" (followed by a description of such), then what would have happened is that most of the instrument makers would simply have used the example as a starting point. This would have saved millions of man-hours of confusion and chaos over the last 20 years. Dozens of companies created Universal Librarian products, and all of them had to deal with the astounding diversity of protocols, which could have been avoided by one little non-binding paragraph in the standard. In other words, I criticize both the MMA's spec and Colin's for the sin of underspecification - that is, allowing critical decisions that *should* have been made by the standard writer to instead be made by the standard implementers, with the result that each implementer comes up with their own unique solution to a problem which should have been solved in the original standard doc. -- Talin From collinw at gmail.com Sat Aug 12 04:43:43 2006 From: collinw at gmail.com (Collin Winter) Date: Fri, 11 Aug 2006 22:43:43 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DD3A41.10507@acm.org> References: <5.1.1.6.0.20060811211801.02287420@sparrow.telecommunity.com> <44DD3A41.10507@acm.org> Message-ID: <43aa6ff70608111943o1fb05d1eq753157bc4fc53ccb@mail.gmail.com> On 8/11/06, Talin wrote: > The story also has to do with people who assume things about the > behavior of other software developers - specifically, my assumption that > other people, working in isolation from one another, would come up with > the same or similar solutions to a given problem, vs. Colin's assumption > that other creators of annotation interpreters would coordinate their > efforts in a sensible way. I make no assumptions that people writing annotation interpreters will coordinate their efforts. My assertion that "[t]here is no worry that these libraries will assign semantics at random, or that a variety of libraries will appear, each with varying semantics and interpretations of what, say, a tuple of strings means." is not based on coordination but rather the marketplace. If someone starts assigning semantics that aren't "pythonic", that don't fit in with how the majority of Python programmers think, no-one will use their library and it will die. The drive to write, release and maintain open-source software is predicated on a desire to have people use your product, to find it useful. To that end, I expect that the creators of annotation interpreters will take care to maximise the utility (and hence the audience) for their library. > What the annotation PEP and the SysEx have in common is that they are > both dealing with an open-ended specification - one which allows any > provider to extend the protocol in any way they wish, without any > knowledge or coordination from any other provider. In your long parable, you've ignored the key difference between the open-ended-ness of my PEP and that of SysEx: there are much greater environmental constraints on people writing interpreters for function annotations. The only constraints for developers using SysEx are "anything you can turn into bytes". > Another lesson of the story has to do with the failure of the MMA > committee to specify any guidelines or hints as to how their open-ended > protocol should be used. I agree that the PEP needs to include some guidance for those writing annotation interpreters (such as how to anticipate being used in conjunction with other interpreters), but I see no merit in setting in stone a list of officially endorsed uses for function annotations. > In other words, I criticize both the MMA's spec and Colin's for the sin > of underspecification - that is, allowing critical decisions that > *should* have been made by the standard writer to instead be made by the > standard implementers, with the result that each implementer comes up > with their own unique solution to a problem which should have been > solved in the original standard doc. Are you referring to the fact that the PEP doesn't dictate how lists, tuples, etc are to be interpreted, or still to the fact that I didn't include a paragraph talking about interpreter chaining? > -- Talin Collin Winter PS: My name has 2 L's in it. From pje at telecommunity.com Sat Aug 12 04:52:57 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 11 Aug 2006 22:52:57 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: Message-ID: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> At 03:39 PM 8/12/2006 -0700, Talin wrote: >Say I want to annotate a specific argument with two pieces of >information, a type and a docstring. I have two metadata interpreters, >one which uses the type information to restrict the kinds of arguments >that can be passed in, and another which uses the docstring to enhance >the generated documentation. > >Now, lets say that these two metadata interpreters were written by two >people, who are not in communication with each other. Each one decides >that they would like to "play nice" with other competing metadata. > >So Author A, who wrote the annotation decorator that looks for >docstrings, decides that not only will he accept docstring annotations, >but if the annotation is a tuple, then he will search that tuple for any >docstrings, skipping over any annotations that he doesn't understand. >(Although how he is supposed to manage that is unclear - since there >could also be other annotations that are simple text strings as well.) > >Author B, who wrote the type-enforcement module, also wants to play nice >with others, but since he doesn't know A, comes up with a different >solution. His idea is to create a system in which annotations >automatically chain each other - so that each annotation has a "next" >attribute referring to the next annotation. > >So programmer C, who wants to incorporate both A and B's work into his >program, has a dilemma - each has a sharing mechanism, but the sharing >mechanisms are different and incompatible. So he is unable to apply both >A-type and B-type metadata to any given signature. Not at all. A and B need only use overloadable functions, and the problem is trivially resolved by adding overloads. The author of C can add an overload to "A" that will handle objects with 'next' attributes, or add one to "B" that handles tuples, or both. I've not bothered to reply to the rest of your email, since it depends on assumptions that I've already shown to be invalid. From pje at telecommunity.com Sat Aug 12 05:01:38 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Fri, 11 Aug 2006 23:01:38 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: Message-ID: <5.1.1.6.0.20060811225402.0228c178@sparrow.telecommunity.com> At 07:49 PM 8/12/2006 -0400, "Collin Winter" wrote: >What Josiah is hinting at -- and what Talin describes more explicitly >-- is the problem of how exactly "chaining" annotation interpreters >will work. I'd prefer we not use the word "interpreters" to describe operations that use annotations. It carries a lot of excess baggage. >The case I've thought out the most completely is that of using >decorators to analyse/utilise the annotations: > >1) Each decorator should be written with the assumption that it is the >only decorator that will be applied to a given function (with respect >to annotations). > >2) Chaining will be accomplished by maintaining this illusion for each >decorator. For example, if our annotation-sharing convention is that >annotations will be n-tuples (n == number of annotation-interpreting >decorators), where t[i] is the annotation the i-th decorator should >care about, the following chain() function will do the trick (a full >demo script is attached): I don't see the point of this. A decorator should be responsible for manipulating the signature of its return value. Meanwhile, the semantics for combining annotations should be defined by an overloaded function like "combineAnnotations(a1,a2)" that returns a new annotation. There is no need to have a special chaining decorator. May I suggest that you try using Guido's Py3K overloaded function prototype? I expect you'll find that if you play around with it a bit, it will considerably simplify your view of what's required to do this. It truly isn't necessary to predefine what an annotation is, or even any structural constraints on how they will be combined, since the user is able to define for any given type how such things will be handled. From qrczak at knm.org.pl Sat Aug 12 06:06:53 2006 From: qrczak at knm.org.pl (Marcin 'Qrczak' Kowalczyk) Date: Sat, 12 Aug 2006 06:06:53 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: <20060811125449.1940.JCARLSON@uci.edu> (Josiah Carlson's message of "Fri, 11 Aug 2006 13:12:15 -0700") References: <20060811105742.193A.JCARLSON@uci.edu> <87veozoyo9.fsf@qrnik.zagroda> <20060811125449.1940.JCARLSON@uci.edu> Message-ID: <877j1emwbm.fsf@qrnik.zagroda> Josiah Carlson writes: > Threading is already difficult enough to do 'right' (see the dozens > of threads discussing why this is really the case), and designing > software that can survive the raising of an exception at any point > makes threading even more difficult. That's why I'm proposing to provide ways to limit those "any points". > I believe that you are attempting to design an interface to make > this particular feature foolproof. No, I'm merely attempting to make it usable. > I think that such is a mistake; killing a thread should be frought > with gotchas and should be documented as "may crash the runtime". You are proposing to make it unusable? > Offering users anything more is tantamount to encouraging its use, > which is counter to the reasons why it is not available via a > standard threading.function call: because it shouldn't be used at > all, except by people who know what the heck they are doing. Indeed, you are proposing to make it unusable. > I believe that if a user cannot design and implement their own > system to handle when a thread can be killed or not to their own > satisfaction, then they have no business killing threads. I have already implemented it. In my own language, where I have full control over the runtime. Some Haskell people made the first design a few years ago, and implemented it in Glasgow Haskell Compiler. http://citeseer.ist.psu.edu/415348.html Some people saw that it was good, that the existing handling of KeyboardInterrupt in Python is unsafe, and they adapted the design for Python (without actually implementeing it as far as I know). http://www.cs.williams.edu/~freund/papers/02-lwl2.ps I built on their experience, extended the design, and implemented it in my language Kogut, so I can play with it and see how it works in practice. http://www.cs.ioc.ee/tfp-icfp-gpce05/tfp-proc/06num.pdf I'm quite confident that something like this is the right design, even if some details could be changed. Now it would be nice if Python had usable asynchronous exceptions too. If we are not brave enough, we can implement at least an equivalent of POSIX thread cancellation. It would be better than nothing, though not as useful, because the default mode allows interruption only at certain blocking primitives. In this scenario Unix signals need a different policy so a pure computation not performing I/O nor thread synchronization can be interrupted; Unix signals usually cause the whole process to abort so data integrity was less of a concern. A language with GC and exceptions can do better, with a unified policy for thread cancellation and Unix signals and other asynchronous events. It can be done such that well-written libraries are safely interruptible even if exceptions may occur almost anywhere. Protection should be built into certain operations (e.g. try...finally extended with an "initially" clause, or taking a mutex), so that there is less work needed to make code safe to be interrupted; then quite often it's already safe. -- __("< Marcin Kowalczyk \__/ qrczak at knm.org.pl ^^ http://qrnik.knm.org.pl/~qrczak/ From collinw at gmail.com Sat Aug 12 06:33:28 2006 From: collinw at gmail.com (Collin Winter) Date: Sat, 12 Aug 2006 00:33:28 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811225402.0228c178@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811225402.0228c178@sparrow.telecommunity.com> Message-ID: <43aa6ff70608112133w7eb2d0c6x287c021b108974b@mail.gmail.com> > I don't see the point of this. A decorator should be responsible for > manipulating the signature of its return value. Meanwhile, the semantics > for combining annotations should be defined by an overloaded function like > "combineAnnotations(a1,a2)" that returns a new annotation. There is no > need to have a special chaining decorator. > > May I suggest that you try using Guido's Py3K overloaded function > prototype? I expect you'll find that if you play around with it a bit, it > will considerably simplify your view of what's required to do this. It > truly isn't necessary to predefine what an annotation is, or even any > structural constraints on how they will be combined, since the user is able > to define for any given type how such things will be handled. I've looked at Guido's overloaded function prototype, and while I think I'm in the direction of understanding, I'm not quite there 100%. Could you illustrate (in code) what you've got in mind for how to apply overloaded functions to this problem space? Collin Winter From talin at acm.org Sat Aug 12 06:49:52 2006 From: talin at acm.org (Talin) Date: Fri, 11 Aug 2006 21:49:52 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> Message-ID: <44DD5DF0.40405@acm.org> Phillip J. Eby wrote: > Not at all. A and B need only use overloadable functions, and the > problem is trivially resolved by adding overloads. The author of C can > add an overload to "A" that will handle objects with 'next' attributes, > or add one to "B" that handles tuples, or both. I'm still not sure what you are talking about - what is being overloaded here? Let me give you a better example. Suppose I have a 'docstring' annotation and a 'getopt' annotation. The docstring annotation associates a string with each argument, which can be inspected by an external documentation scanner to produce documentation for that argument. Thus: def myfunc( x : "The x coordinate", y : "The y coordinate" ) ... The 'getopt' annotation is used in conjunction with the 'getopt' decorator, which converts from command-line arguments to python method arguments. The idea is that you have a class that is acting as a back end to a command-line shell. Each method in the class corresponds to a single command. The annotations allow you to associate specific flags or switches with particular arguments. So: class MyHandler( CommandLineHandler ): @getopt def list( infile:"i" = sys.stdin, outfile:"o" = sys.stdout ): ... With the getopt handler in place, I can type the following shell command: list -i -o If either the -i or -o switch is omitted, then the corresponding argument is either stdin or stdout. Additionally, the getopt module can generate 'usage' information for the function in question: Usage: list [-i infile] [-o outfile] Now, what happens if I want to use both docstrings and the getopt decorator on the same function? The both expect to see annotations that are strings! How does the doc extractor and the getopt decorator know which strings belong to them, and which strings they should ignore? -- Talin From slawomir.nowaczyk.847 at student.lu.se Sat Aug 12 08:22:17 2006 From: slawomir.nowaczyk.847 at student.lu.se (Slawomir Nowaczyk) Date: Sat, 12 Aug 2006 08:22:17 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: References: Message-ID: <20060812082034.EFEC.SLAWOMIR.NOWACZYK.847@student.lu.se> On Fri, 11 Aug 2006 21:51:25 -0400 Luis P Caamano wrote: #> That's how I feel too Josiah. In some ways, it's the same as writing #> device drivers in a pre-emptable kernel. You can get interrupted and #> pre-empted by the hardware at any freaking time in any piece of code #> and your memory might go away so you better pin it and deal with the #> interrupts. Forget about that and you end up with a nice kernel #> panic. Still, we have all kinds of device drivers on SMP, #> pre-emptable kernels. It can be done. Of course it can... but do we *really* want programming in Python3k to be comparable in difficulty to writing device drivers? -- Best wishes, Slawomir Nowaczyk ( Slawomir.Nowaczyk at cs.lth.se ) Numeric stability is probably not all that important when you're guessing. From ncoghlan at gmail.com Sat Aug 12 08:58:47 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 12 Aug 2006 16:58:47 +1000 Subject: [Python-3000] threading, part 2 In-Reply-To: <20060812082034.EFEC.SLAWOMIR.NOWACZYK.847@student.lu.se> References: <20060812082034.EFEC.SLAWOMIR.NOWACZYK.847@student.lu.se> Message-ID: <44DD7C27.9000006@gmail.com> Slawomir Nowaczyk wrote: > On Fri, 11 Aug 2006 21:51:25 -0400 > Luis P Caamano wrote: > > #> That's how I feel too Josiah. In some ways, it's the same as writing > #> device drivers in a pre-emptable kernel. You can get interrupted and > #> pre-empted by the hardware at any freaking time in any piece of code > #> and your memory might go away so you better pin it and deal with the > #> interrupts. Forget about that and you end up with a nice kernel > #> panic. Still, we have all kinds of device drivers on SMP, > #> pre-emptable kernels. It can be done. > > Of course it can... but do we *really* want programming in Python3k to > be comparable in difficulty to writing device drivers? > No, but "programming in Py3k" and "trying to asynchronously terminate an active thread in Py3k without active cooperation from that thread" are not really the same thing. Making easy things easy and difficult things possible is a good goal - making difficult things appear to be deceptively easy is a good way to cause problems down the road :) Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Sat Aug 12 09:58:08 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 12 Aug 2006 17:58:08 +1000 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> Message-ID: <44DD8A10.1040808@gmail.com> Phillip J. Eby wrote: > At 03:39 PM 8/12/2006 -0700, Talin wrote: >> So programmer C, who wants to incorporate both A and B's work into his >> program, has a dilemma - each has a sharing mechanism, but the sharing >> mechanisms are different and incompatible. So he is unable to apply both >> A-type and B-type metadata to any given signature. > > Not at all. A and B need only use overloadable functions, Stop right there. "A and B need only use overloadable functions"? That sounds an awful lot like placing a constraint on the way annotation libraries are implemented in order to facilitate a single program using multiple annotation libraries - which is exactly what Talin is saying is needed! Talin is saying "the annotation PEP needs to recommend a mechanism that allows a single program to use multiple annotation libraries". And you're saying "a good mechanism for allow a program to use multiple annotation libraries is for every annotation library to expose an overloades 'interpret_annotation' function that the application can hook in order to handle new annotation types". I think you're right that overloaded functions are a possible solution to this problem, but that doesn't obviate the need for the PEP to address the question explicitly (and using overloaded functions for this strikes me as hitting a very small nail with a very large hammer). With the function overloading solution, you would need to do three things in order to get two frameworks to cooperate: 1. Define your own Annotation type and register it with the frameworks you are using 2. Define a decorator to wrap the annotations in a function __signature__ into your custom annotation type 3. Apply your decorator to functions before the decorators for the annotation libraries are invoked Overloading a standard type (like tuple) wouldn't work, as you might have two different modules, both using the same annotation library, that want it to interpret tuples in two different ways (e.g. in module A, the library's info is at index 0, while in module B it is at index 1). So, for example: @library_A_type_processor @library_B_docstring_processor @handle_annotations def func(a: (int, "an int"), b: (str, "a string")) -> (str, "returns a string, too!): # do something def handle_annotations(f): note_dict = f.__signature__.annotations for param, note in note_dict.items(): note_dict[param] = MyAnnotation(note) return f However, what we're really talking about here is a scenario where you're defining your *own* custom annotation processor: you want the first part of the tuple in the expression handled by the type processing library, and the second part handled by the docstring processing library. Which says to me that the right solution is for the annotation to be split up into its constituent parts before the libraries ever see it. This could be done as Collin suggests by tampering with __signature__.annotations before calling each decorator, but I think it is cleaner to do it by defining a particular signature for decorators that are intended to process annotations. Specifically, such decorators should accept a separate dictionary to use in preference to the annotations on the function itself: process_function_annotations(f, annotations=None): # Process the function f # If annotations is not None, use it # otherwise, get the annotations from f.__signature__ Then our function declaration and decorator would look like: @handle_annotations def func(a: (int, "an int"), b: (str, "a string")) -> (str, "returns!): # do something def handle_annotations(f): decorators = library_A_type_processor, library_B_docstring_processor note_dicts = {}, {} for param, note in f.__signature__.annotations.iteritems(): for note_dict, subnote in zip(note_dicts, note): note_dict[param] = subnote for decorator, note_dict in zip(decorators, note_dicts): f = decorator(f, note_dict) return f Writing a factory function to handle chaining of an arbitrary number of annotation interpreting libraries would be trivial, with the set of decroators provided as positional arguments if your notes are in a tuple, and as a keyword arguments if the notes are in a dictionary. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Sat Aug 12 10:13:44 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 12 Aug 2006 18:13:44 +1000 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> References: <43aa6ff70608091732o150a6674t4416f4b76d8bc40b@mail.gmail.com> Message-ID: <44DD8DB8.3050102@gmail.com> Collin Winter wrote: > Return Values > ------------- > > The examples thus far have omitted examples of how to annotate the > type of a function's return value. This is done like so: > > :: > def sum(*vargs: Number) -> Number: > ... > > > The parameter list can now be followed by a literal ``->`` and > a Python expression. Like the annotations for parameters, this > expression will be evaluated when the function is compiled. I'd like to request that the annotation for the return type be *inside* the parentheses for the parameter list. Why, you ask? Because, as soon as the annotations are at all verbose, you're going to want to split the function definition up so that each parameter gets its own line. For the parameters, this works beautifully because parenthesis matching keeps the compiler from getting upset: def sum(seq: "the sequence of values to be added", init=0: "the initial value of the total"): # do it But now try to document the return type on its own line: def sum(seq: "the sequence of values to be added", init=0: "the initial value of the total") -> "the summation of the sequence": # do it Kaboom - SyntaxError on the second line because of the missing colon. However, if the return type annotation is *inside* the parentheses and separated by a comma, there's no problem: def sum(seq: "the sequence of values to be added", init=0: "the initial value of the total", -> "the summation of the sequence"): # do it Having to use a line continuation just to be able to annotate the return type on a separate line would be an annoyance. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From jcarlson at uci.edu Sat Aug 12 10:35:02 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 12 Aug 2006 01:35:02 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: References: Message-ID: <20060812012526.195B.JCARLSON@uci.edu> "Luis P Caamano" wrote: > It's a gun, here's a bullet, it's a tool, go get food but try not to > shoot yourself. > > I'm also -0 on this, not that I think my opinion counts though. I'm > -0 because Tomer pointed me to a nice recipe that uses ctypes to get > to the C interface. I'm happy with that and we can start using it > right now. Perhaps that should be as high as it gets expose so that > it would be an automatic skill test? If you can find it, you probably > know how to use it and the kind of problems you might run into. Remember that the meat of Tomer's recipe, the ctypes call, is the only thing that is going to be documented in Python 2.5 . The functionality of being able to kill threads with exceptions has existed since Python 2.3 (if I understood previous postings correctly), but has been generally undocumented. Because it is literally just a documentation change, and not actually additional functionality, means that it *can* go into Python 2.5 . All other feature additions are too late in the Beta cycle (Beta 3 is next week) to be added, unless someone manages to convince the release manager that it should be allowed (I would put money on it not going to happen). - Josiah > On 8/11/06, Josiah Carlson wrote: > > > > > > I believe that if a user cannot design and implement their own system to > > handle when a thread can be killed or not to their own satisfaction, > > then they have no business killing threads. > > > > > > - Josiah > > > > > -- > Luis P Caamano > Atlanta, GA USA > _______________________________________________ > Python-3000 mailing list > Python-3000 at python.org > http://mail.python.org/mailman/listinfo/python-3000 > Unsubscribe: http://mail.python.org/mailman/options/python-3000/jcarlson%40uci.edu From jcarlson at uci.edu Sat Aug 12 11:07:29 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 12 Aug 2006 02:07:29 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <877j1emwbm.fsf@qrnik.zagroda> References: <20060811125449.1940.JCARLSON@uci.edu> <877j1emwbm.fsf@qrnik.zagroda> Message-ID: <20060812013530.195E.JCARLSON@uci.edu> "Marcin 'Qrczak' Kowalczyk" wrote: > Josiah Carlson writes: > > Threading is already difficult enough to do 'right' (see the dozens > > of threads discussing why this is really the case), and designing > > software that can survive the raising of an exception at any point > > makes threading even more difficult. > > That's why I'm proposing to provide ways to limit those "any points". > > > I believe that you are attempting to design an interface to make > > this particular feature foolproof. > > No, I'm merely attempting to make it usable. > > You are proposing to make it unusable? > > Indeed, you are proposing to make it unusable. Because you or anyone else can define a standard mechanism of handling these points where threads are allowed to be killed, and you can publish it on the internet via the Python cookbook, etc., having nothing in the standard library specifically supporting the operation isn't making anything unusable. I'm not proposing to make it unusable, merely that it should not be made any easier to use. See Nick Coughlan's comment with regards to '...easy things easy...'. > > I believe that if a user cannot design and implement their own > > system to handle when a thread can be killed or not to their own > > satisfaction, then they have no business killing threads. > > I have already implemented it. In my own language, where I have > full control over the runtime. I'm glad that you have managed to implement it in your programming language. But this discussion isn't about Kogut, Haskell, etc., this is about Python. Specifically what should and should not be available in the Python standard library. I've said it before, but apparently the following point is ignored, so I'll say it again. The 'kill thread' mechanism isn't available via some threading.kill_thread(thr) function because Guido and other core developers *of* Python do not want it to be generally acceptable for users to kill arbitrary threads. The introduction of methods of controlling where a thread could be killed into the standard library would be encouraging the 'kill thread' usage. It would be far safer (and much less work for the developers of Python) for users to just learn how to handle thread quitting using any of the standard methods of doing so (check the value of a variable, wait for a signal, etc.). Never mind that any feature is going to have to wait 18+ months before Python 2.6 comes out in order to get your proposed changes in. > Now it would be nice if Python had usable asynchronous exceptions too. Python has had usable asynchronous exceptions since Python 2.3 [1]. > If we are not brave enough, we can implement at least an equivalent > of POSIX thread cancellation. It would be better than nothing, though > not as useful, because the default mode allows interruption only at > certain blocking primitives. In this scenario Unix signals need a > different policy so a pure computation not performing I/O nor thread > synchronization can be interrupted; Unix signals usually cause the > whole process to abort so data integrity was less of a concern. > > A language with GC and exceptions can do better, with a unified policy > for thread cancellation and Unix signals and other asynchronous events. > It can be done such that well-written libraries are safely interruptible > even if exceptions may occur almost anywhere. Protection should be > built into certain operations (e.g. try...finally extended with an > "initially" clause, or taking a mutex), so that there is less work > needed to make code safe to be interrupted; then quite often it's > already safe. I don't have much of a comment with regards to attempted unification of signals, etc., as Windows signal handling is effectively useless (and my primary development platform tends to be Windows). - Josiah [1] Python 2.3.5 (#62, Feb 8 2005, 16:23:02) [MSC v.1200 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import ctypes >>> import threading >>> import time >>> def foo(): ... try: ... while 1: ... time.sleep(.01) ... finally: ... print "I quit!" ... >>> x = threading.Thread(target=foo) >>> x.start() >>> for i,j in threading._active.items(): ... if j is x: ... break ... >>> ctypes.pythonapi.PyThreadState_SetAsyncExc(i, ctypes.py_object(Exception)) 1 >>> I quit! Exception in thread Thread-2:Traceback (most recent call last): File "C:\python23\lib\threading.py", line 442, in __bootstrap self.run() File "C:\python23\lib\threading.py", line 422, in run self.__target(*self.__args, **self.__kwargs) File "", line 4, in foo Exception From tim.peters at gmail.com Sat Aug 12 12:29:07 2006 From: tim.peters at gmail.com (Tim Peters) Date: Sat, 12 Aug 2006 06:29:07 -0400 Subject: [Python-3000] threading, part 2 --- + a bit of ctypes FFI worry Message-ID: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> [Josiah Carlson] > ... > Python 2.3.5 (#62, Feb 8 2005, 16:23:02) [MSC v.1200 32 bit (Intel)] on win32 > Type "help", "copyright", "credits" or "license" for more information. > >>> import ctypes > >>> import threading > >>> import time > >>> def foo(): > ... try: > ... while 1: > ... time.sleep(.01) > ... finally: > ... print "I quit!" > ... > >>> x = threading.Thread(target=foo) > >>> x.start() > >>> for i,j in threading._active.items(): > ... if j is x: > ... break > ... > >>> ctypes.pythonapi.PyThreadState_SetAsyncExc(i, ctypes.py_object(Exception)) As I discovered to my chagrin when I added a similar test to the test suite a few days ago, that's got a subtle error on most 64-bit boxes. When the ctypes docs talk about passing and returning integers, they never explain what "integers" /means/, but it seems the docs implicitly have a 32-bit-only view of the world here. In reality "integer" seems to mean the native C `int` type. But a Python thread id is a native C `long` (== a Python short integer), and the code above fails in a baffling way on most 64-bit boxes: the call returns 0 instead; i.e. the thread id isn't found, and no exception gets set. So I believe that needs to be: ctypes.pythonapi.PyThreadState_SetAsyncExc( ctypes.c_long(i), ctypes.py_object(Exception)) to make it portable. It's unclear to me how to write portable ctypes code in the presence of a gazillion integer typedefs and #defines, such as for Py_ssize_t. That doesn't map to a fixed C integral type cross-platform, so what can you do? You're not required to answer that ;-) Thread ids may bite us someday too. Python casts the platform's notion of a thread id to C `long`, but there's no guarantee this won't lose information (or is even legal) on all platforms. We'd probably be safer casting to, e.g., Py_uintptr_t (some thread implementions return an index into a kernel or library thread-info table, but at least some in my lifetime returned a pointer to a thread-info struct, and that's definitely fatter than C `long` on some boxes). > 1 > >>> I quit! > Exception in thread Thread-2:Traceback (most recent call last): > File "C:\python23\lib\threading.py", line 442, in __bootstrap > self.run() > File "C:\python23\lib\threading.py", line 422, in run > self.__target(*self.__args, **self.__kwargs) > File "", line 4, in foo > Exception It's really cool that you can do this from ctypes, eh? That's exactly the right level of abstraction for this attractive nuisance too ;-) From greg.ewing at canterbury.ac.nz Sat Aug 12 13:05:08 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 12 Aug 2006 23:05:08 +1200 Subject: [Python-3000] threading, part 2 --- + a bit of ctypes FFI worry In-Reply-To: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> References: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> Message-ID: <44DDB5E4.9010903@canterbury.ac.nz> Tim Peters wrote: > It's unclear to me how to write portable ctypes code in the presence > of a gazillion integer typedefs and #defines, such as for Py_ssize_t. A start would be to have constants in the ctypes module for Py_ssize_t and other such Python-defined API types. -- Greg From l.oluyede at gmail.com Sat Aug 12 13:11:47 2006 From: l.oluyede at gmail.com (Lawrence Oluyede) Date: Sat, 12 Aug 2006 13:11:47 +0200 Subject: [Python-3000] threading, part 2 --- + a bit of ctypes FFI worry In-Reply-To: <44DDB5E4.9010903@canterbury.ac.nz> References: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> <44DDB5E4.9010903@canterbury.ac.nz> Message-ID: <9eebf5740608120411m40da5724r11700fdbe509914@mail.gmail.com> On 8/12/06, Greg Ewing wrote: > Tim Peters wrote: > > > It's unclear to me how to write portable ctypes code in the presence > > of a gazillion integer typedefs and #defines, such as for Py_ssize_t. > > A start would be to have constants in the ctypes module > for Py_ssize_t and other such Python-defined API types. rctypes and pypy tools are somewhat one step further than ctypes machinery. In rctypes you can easily do something like: size_t = ctypes_platform.SimpleType("size_t", c_ulong) In this way you have platform safe data type to use in your code. The second argument of SimpleType() is a hint for the tool. You can also use ConstantInteger() and DefinedCostantInteger() to get values of costants in headers file like this: BUFSIZ = ctypes_platform.ConstantInteger("BUFSIZ") Maybe one day this can be ported to CPython ctypes from the RPython one. -- Lawrence http://www.oluyede.org/blog From aahz at pythoncraft.com Sat Aug 12 15:42:44 2006 From: aahz at pythoncraft.com (Aahz) Date: Sat, 12 Aug 2006 06:42:44 -0700 Subject: [Python-3000] Python 2.5 release schedule (was: threading, part 2) In-Reply-To: <20060812012526.195B.JCARLSON@uci.edu> References: <20060812012526.195B.JCARLSON@uci.edu> Message-ID: <20060812134244.GA29374@panix.com> [added python-dev to make sure everyone sees this] On Sat, Aug 12, 2006, Josiah Carlson wrote: > > All other feature additions are too late in the Beta cycle (Beta 3 is > next week) For some reason, this is the second time I've seen this claim. Beta 3 was released August 3 and next week is rc1. We are right now in complete feature lockdown; even documenting an existing API IMO requires approval from the Release Manager. -- Aahz (aahz at pythoncraft.com) <*> http://www.pythoncraft.com/ "Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it." --Brian W. Kernighan From aahz at pythoncraft.com Sat Aug 12 15:44:28 2006 From: aahz at pythoncraft.com (Aahz) Date: Sat, 12 Aug 2006 06:44:28 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: References: Message-ID: <20060812134428.GB29374@panix.com> On Fri, Aug 11, 2006, Luis P Caamano wrote: > > That's how I feel too Josiah. In some ways, it's the same as writing > device drivers in a pre-emptable kernel. You can get interrupted and > pre-empted by the hardware at any freaking time in any piece of code > and your memory might go away so you better pin it and deal with the > interrupts. Forget about that and you end up with a nice kernel > panic. Still, we have all kinds of device drivers on SMP, > pre-emptable kernels. It can be done. But Python is not the language/platform to do it. (Yeah, someone else said that already, but I think it needs emphasis.) -- Aahz (aahz at pythoncraft.com) <*> http://www.pythoncraft.com/ "Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it." --Brian W. Kernighan From pje at telecommunity.com Sat Aug 12 17:36:51 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sat, 12 Aug 2006 11:36:51 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DD5DF0.40405@acm.org> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> At 09:49 PM 8/11/2006 -0700, Talin wrote: >Phillip J. Eby wrote: >>Not at all. A and B need only use overloadable functions, and the >>problem is trivially resolved by adding overloads. The author of C can >>add an overload to "A" that will handle objects with 'next' attributes, >>or add one to "B" that handles tuples, or both. > > >I'm still not sure what you are talking about - what is being overloaded here? > >Let me give you a better example. Suppose I have a 'docstring' annotation >and a 'getopt' annotation. The docstring annotation associates a string >with each argument, which can be inspected by an external documentation >scanner to produce documentation for that argument. > >Thus: > > def myfunc( x : "The x coordinate", y : "The y coordinate" ) > ... > >The 'getopt' annotation is used in conjunction with the 'getopt' >decorator, which converts from command-line arguments to python method >arguments. The idea is that you have a class that is acting as a back end >to a command-line shell. Each method in the class corresponds to a single >command. The annotations allow you to associate specific flags or switches >with particular arguments. So: > >class MyHandler( CommandLineHandler ): > > @getopt > def list( infile:"i" = sys.stdin, outfile:"o" = sys.stdout ): > ... > >With the getopt handler in place, I can type the following shell command: > > list -i -o > >If either the -i or -o switch is omitted, then the corresponding argument >is either stdin or stdout. > >Additionally, the getopt module can generate 'usage' information for the >function in question: > > Usage: list [-i infile] [-o outfile] > >Now, what happens if I want to use both docstrings and the getopt >decorator on the same function? The both expect to see annotations that >are strings! How does the doc extractor and the getopt decorator know >which strings belong to them, and which strings they should ignore? Each one defines an overloaded function that performs the operation. E.g. "getArgumentOption(annotation)" and "getArgumentDoc(annotation)". If somebody wants to use both decorators on the same function, they add methods to one or both of those functions to define how to handle their own type. For example, I could create a "documented option" class that has attributes for the docstring and option character, and register methods with both getArgumentOption and getArgumentDoc to extact the right attributes from it. From pje at telecommunity.com Sat Aug 12 18:12:26 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sat, 12 Aug 2006 12:12:26 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DD8A10.1040808@gmail.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060812113701.02343408@sparrow.telecommunity.com> At 05:58 PM 8/12/2006 +1000, Nick Coghlan wrote: >Phillip J. Eby wrote: >>At 03:39 PM 8/12/2006 -0700, Talin wrote: >>>So programmer C, who wants to incorporate both A and B's work into his >>>program, has a dilemma - each has a sharing mechanism, but the sharing >>>mechanisms are different and incompatible. So he is unable to apply both >>>A-type and B-type metadata to any given signature. >>Not at all. A and B need only use overloadable functions, > >Stop right there. "A and B need only use overloadable functions"? That >sounds an awful lot like placing a constraint on the way annotation >libraries are implemented in order to facilitate a single program using >multiple annotation libraries - which is exactly what Talin is saying is >needed! You could perhaps look at it that way. However, I'm simply using overloadable functions as a trivial example of how easy this is to handle without specifying a single mechanism. There are numerous overloaded function implementations available, for example, including ad-hoc registry-based ones (like the ones used by pickle) and other mechanisms besides overloaded functions that do the same thing. PEP 246 adaptation, for example, as used by Twisted and Zope. My point is that: 1. trivial standard extension mechanisms (that are already in use in today's Python) allow libraries to offer compatibility between approaches, without choosing any blessed implementation or even approach to combination 2. there is no need to define a fixed semantic framework for annotations. Guidelines for combinability (e.g. a standard interpretation for tuples or lists) might be a good idea, but it isn't *necessary* to mandate a single interpretation. >(and using overloaded functions for this strikes me as hitting a very >small nail with a very large hammer). Remember: Python is built from the ground up on overloaded functions. len(), iter(), str(), repr(), hash(), int(), ... You name it in builtins or operator, it's pretty much an overloaded function. These functions differ from "full" overloaded functions in only these respects: 1. There is no framework to let you define new ones 2. They are single-dispatch only (except for the binary arithmetic operators, which have a crude double-dispatching protocol) 3. They do not allow third-party registration; classes must define __special__ methods to register implementations (Some other overloaded functions in Python, such as pickle.dump and copy.copy, *do* allow third-party registrations, but they have ad-hoc implementations rather than using a common base implementation.) So, saying that overloaded functions are a large hammer may or may not be meaningful, but it's certainly true that they are in *enormous* use in today's Python, even for very small nails like determining the length of an object. :) Indeed, the *default* way of doing almost anything in Python that involves multiple possible implementations is to define an overloaded function -- regardless of how small the nail might be. >However, what we're really talking about here is a scenario where you're >defining your *own* custom annotation processor: you want the first part >of the tuple in the expression handled by the type processing library, and >the second part handled by the docstring processing library. > >Which says to me that the right solution is for the annotation to be split >up into its constituent parts before the libraries ever see it. > >This could be done as Collin suggests by tampering with >__signature__.annotations before calling each decorator, but I think it is >cleaner to do it by defining a particular signature for decorators that >are intended to process annotations. Now you're embedding a particular implementation again. The way to do this that imposes the least constraints on users, is to just have an 'iter_annotations()' overloadable function, and let it iterate over lists and tuples, and yield anything else, e.g.: @iter_annotations.when(tuple) @iter_annotations.when(list) def iter_annotation_sequence(annotation): for a in annotation: for aa in iter_annotations(a): yield aa Now, if you have some custom annotation type that contains other annotations, you need only add a method to iter_annotations, and everything works. In contrast, your approach is too limiting because you're *creating a framework* that then everyone has to conform to. I want annotations to be framework-free. I don't even think that the stdlib needs to provide an iter_annotations function, because there's no reason not to just define a method similar to the above for the specific operations you're doing. In fact the general rule of overloadable functions is that the closer to the domain semantics the function is, the better. For example, a 'generateCodeFor(annotation)' overloaded function that can walk annotation sequences itself is a better idea than writing a non-overloaded function that uses iter_annotations() and then generates code for individual annotations, because it allows for better overloads. For example, if you have a type that contains something that would ordinarily be considered separate annotation objects, but which the code generator could combine in some way to produce more optimal code. Walking the annotations and then generating code would rob you of the opportunity to define an optimization overload in this case. And *that* is why I don't think the stdlib should impose any semantics on annotations -- semantic imposition doesn't *fix* incompatibility, it *creates* it. How? Because if somebody needs to do something that doesn't fit within the imposed semantics, they are forced to create their own, and they now must reinvent everything so it works with their own! This is the history of Python frameworks in a nutshell, and it's entirely avoidable. We should leave the semantics open, precisely so that it will force people to make their code *extensible*. As a side benefit, it provides a nice example of when and how to use overloaded functions effectively. From pje at telecommunity.com Sat Aug 12 18:39:15 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sat, 12 Aug 2006 12:39:15 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608112133w7eb2d0c6x287c021b108974b@mail.gmail.com > References: <5.1.1.6.0.20060811225402.0228c178@sparrow.telecommunity.com> <5.1.1.6.0.20060811225402.0228c178@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060812121239.03c1da60@sparrow.telecommunity.com> At 12:33 AM 8/12/2006 -0400, Collin Winter wrote: >>I don't see the point of this. A decorator should be responsible for >>manipulating the signature of its return value. Meanwhile, the semantics >>for combining annotations should be defined by an overloaded function like >>"combineAnnotations(a1,a2)" that returns a new annotation. There is no >>need to have a special chaining decorator. >> >>May I suggest that you try using Guido's Py3K overloaded function >>prototype? I expect you'll find that if you play around with it a bit, it >>will considerably simplify your view of what's required to do this. It >>truly isn't necessary to predefine what an annotation is, or even any >>structural constraints on how they will be combined, since the user is able >>to define for any given type how such things will be handled. > >I've looked at Guido's overloaded function prototype, and while I >think I'm in the direction of understanding, I'm not quite there 100%. > >Could you illustrate (in code) what you've got in mind for how to >apply overloaded functions to this problem space? You just define an overloadable function for whatever operation you want to perform on annotations. Then you define methods that implement the operation for known types, and a default method that ignores unknown types. Then you're done. If somebody wants to do more than one thing with the annotations on their functions, then everything "just works", since there is only one annotation per argument (per the PEP), and each operation is ignoring types it doesn't understand. This leaves only one problem: the possibility of incompatible interpretations for a given type of annotation -- and it is easily solved by using some container or wrapper type, for which methods can be added to the respective operations. So, let's say I'm using two decorators that have a common (and incompatible) interpretation for type "str". I need only create a type that is unique to my program, and then define methods for the overloaded functions those decorators expose. QED: any incompatibility can be trivially solved by introducing a new type. However, the most likely source of conflict is the need to specify multiple, unrelated annotations for a given argument. So, it's likely that most operations will want to interpret a list of annotations as just that: a list of annotations. But there is no *requirement* that they do so. Someone writing a library of their own that has a special use for lists is under no obligation to adhere to that pattern. Remember: any conflict can be trivially solved by introducing a new type. If you'd like me to sketch this out in code, fine, but you define the specific example you'd like to see. To me, this all seems as obvious and straightforward as 2+2=4 implying that 4-2=2. And it doesn't even have anything specifically to do with overloaded functions! If you replace overloaded functions with functions that expect to call certain method names on the objects, *the exact same principles apply*. As long as each operation gets a unique method name, any conflict can be trivially solved by introducing a new type that implements both methods. The key here is that introspection and explicit dispatching are bad. Code like this: def decorate(func): ... if isinstance(annotation,str): # do something with string is wrong, wrong, *wrong*. It should simply be doing the equivalent of: annotation.doWhatIWant() Except in the overloaded function case, it's 'doWhatIWant(annotation)'. The latter spelling has the advantage that you don't have to be able to modify the 'str' class to add a 'doWhatIWant()' method. Is this clearer now? This is known, by the way, as the "tell, don't ask" pattern. In Python, we use the variant terms "duck typing" and "EAFP" (easier to ask forgiveness than permission), but "tell, don't ask" refers specifically to the idea that you should never dig around in an object's guts to perform an operation, and instead always delegate the operation to it. Of course, delegation is impossible in the case of a "third-party" object being used -- i.e., one that can't be modified to add the necessary method. Overloaded functions remove that restriction. (This, by the way, is why I think Python should ultimately add an overloading syntax -- so that we could ultimately replace things like 'def __str__(self)' with something like 'defop str(self)'. But that's not relevant to the immediate discussion.) From paul at prescod.net Sat Aug 12 21:38:06 2006 From: paul at prescod.net (Paul Prescod) Date: Sat, 12 Aug 2006 12:38:06 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> Message-ID: <1cb725390608121238v427fe287s303e2acdda97bab5@mail.gmail.com> Phillip. I'm having trouble following the discussion. I briefly caught up when Talin got very concrete with syntax and I would appreciate if you could offer some correspondingly remedial training. Talin's example is that metadata inventor A documents that his/her users should use this syntax for parameter docstrings: def myfunc( x : "The x coordinate", y : "The y coordinate" ) ... Then metadata inventor B documents this his/her users should use this syntax for getopt strings: class MyHandler( CommandLineHandler ): @getopt def list( infile:"i" = sys.stdin, outfile:"o" = sys.stdout ): Now the user is faced with the challenge of making these two work together in order to get the best of both worlds. What does the user type? The mechanism of overloading, function dispatching etc. is uninteresting to me until I understand what goes in the user's Python file. Syntax is important. Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060812/a022b238/attachment.htm From pje at telecommunity.com Sat Aug 12 23:10:17 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sat, 12 Aug 2006 17:10:17 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608121238v427fe287s303e2acdda97bab5@mail.gmail.co m> References: <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> At 12:38 PM 8/12/2006 -0700, Paul Prescod wrote: >Phillip. I'm having trouble following the discussion. I briefly caught up >when Talin got very concrete with syntax and I would appreciate if you >could offer some correspondingly remedial training. > >Talin's example is that metadata inventor A documents that his/her users >should use this syntax for parameter docstrings: > >def myfunc( x : "The x coordinate", y : "The y coordinate" ) > ... > >Then metadata inventor B documents this his/her users should use this >syntax for getopt strings: > >class MyHandler( CommandLineHandler ): > > @getopt > def list( infile:"i" = sys.stdin, outfile:"o" = sys.stdout ): > >Now the user is faced with the challenge of making these two work together >in order to get the best of both worlds. What does the user type? As long as both inventors used overloadable functions, the user can type almost *anything they want to*, as long as: 1. It's consistent, 2. It's unambiguous, and 3. They've defined the appropriate overloads. For example, they might use a 'docopt' class that allows both to be specified, or a pair of 'doc' and 'opt' objects in a list. >The mechanism of overloading, function dispatching etc. is uninteresting >to me until I understand what goes in the user's Python file. Syntax is >important. Indeed it is. Hence the importance of not forcing some particular semantics, so as to allow the user to use the types and semantics of their choosing. By the way, it should be understood that when I say "overloadable function", I simply mean some type-extensible dispatching mechanism. If you exclude built-in types from consideration, and simply have special attribute or method names, then duck typing works just as well. You can have decorators that use hasattr() and such to do their dirty work. It's only if you want to have sensible meaning for built-in types that there even begins to be an illusion that conflicts are an issue. However, the only built-in types likely to even be used in such a way are lists, dictionaries, tuples, and strings. If there's more than one way to interpret them, depending on the operation, their use is inherently ambiguous, and it's up to the person combining them to supply the differentiation. However, if you have: def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) There is no ambiguity. Likewise: def cat( infile:opt("i") = sys.stdin, outfile:opt("o") = sys.stdout ): is unambiguous. And the interpetation of: def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, outfile: [doc("output stream"), opt("o")] = sys.stdout ): is likewise unambiguous, unless the creator of the documentation or option features has defined some other interpretation for a list than "recursively apply to contained items". In which case, you need only do something like: def cat(infile: docopt("input stream", "i") = sys.stdin, outfile: docopt("output stream", "o") = sys.stdout ): with an appropriate definition of methods for the 'docopt' type. Since many people seem to be unfamiliar with overloaded functions, I would just like to take this opportunity to remind you that the actual overload mechanism is irrelevant. If you gave 'doc' objects a 'printDocString()' method and 'opt' objects a 'setOptionName()' method, the exact same logic regarding extensibility applies. The 'docopt' type would simply implement both methods. This is normal, simple standard Python stuff; nothing at all fancy. The only thing that overloaded functions add to this is that they allow you to (in effect) add methods to existing types without monkeypatching. Thus, you can define overloads for built-in types, and types you didn't implement yourself. Even if overloaded functions didn't exist, it wouldn't be necessary to invent them just to allow arbitrary annotation semantics! It simply requires that operations that *use* annotations always follow the "tell, don't ask" pattern, whether it's done by duck typing, EAFP, or overloaded functions. From talin at acm.org Sat Aug 12 23:07:18 2006 From: talin at acm.org (Talin) Date: Sat, 12 Aug 2006 14:07:18 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608121238v427fe287s303e2acdda97bab5@mail.gmail.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <1cb725390608121238v427fe287s303e2acdda97bab5@mail.gmail.com> Message-ID: <44DE4306.4070304@acm.org> Paul Prescod wrote: > Phillip. I'm having trouble following the discussion. I briefly caught up > when Talin got very concrete with syntax and I would appreciate if you > could > offer some correspondingly remedial training. > > Talin's example is that metadata inventor A documents that his/her users > should use this syntax for parameter docstrings: > > def myfunc( x : "The x coordinate", y : "The y coordinate" ) > ... One important point I want to mention. I deliberately did *not* show a decorator for this above example. The reason for this is that the docstring annotations are not intended for consumption by a decorator function - they are intended for consumption by an external program that extracts documentation. More specifically, this external doc extractor program would be part of a standard package of documentation tools, written by an entirely different author than the person actually writing 'myfunc'. This doc extractor knows nothing about decorators, and is unconcerned with their presence. So I'd like Phillip to incorporate that into his explanation of how that is all supposed to work. -- Talin From talin at acm.org Sun Aug 13 00:00:45 2006 From: talin at acm.org (Talin) Date: Sat, 12 Aug 2006 15:00:45 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> References: <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <44DE4F8D.6050503@acm.org> Phillip J. Eby wrote: > At 12:38 PM 8/12/2006 -0700, Paul Prescod wrote: > However, if you have: > > def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) > > There is no ambiguity. Likewise: > > def cat( infile:opt("i") = sys.stdin, outfile:opt("o") = sys.stdout ): > > is unambiguous. And the interpetation of: > > def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, > outfile: [doc("output stream"), opt("o")] = sys.stdout > ): By doing this, you've already introduced an implicit requirement for annotations: Rather than saying that annotations can be "any format you want", the actual restriction is "any format you want that is distinguishable from other formats." More specifically, the rule is that annotations intended for different consumers must be distinguishable from each other via rule. This is in direct contradiction with the statement in the PEP that says that annotations have no predefined syntax or semantics -- they are required to have, at minimum, semantics sufficient to allow rule-based discrimination. (BTW, I propose the term "Annotation Consumer" to mean a body of code that is intended to process annotations. You can have decorator-based consumers, as well as external consumers that are not part of the decorator stack and which inspect the function signature directly, without invoking the decorators.) Lets use the term 'discriminator' to indicate any means, using function overloading or whatever, of determining which consumers should process which annotations. Lets also define the term 'discriminator protocol' to mean any input specifications to the discriminator - so in the above example, 'doc()' and 'opt()' are part of the discriminator protocol. Now, you are trying very hard not to specify a standard discriminator protocol, but the fact is that if you don't do it, someone else will. Nobody wants to have to write their own discriminator for each application. And you can't mix discriminator protocols unless those protocols are a priori compatible. Thus, there is very strong pressure to create a single, standard discriminator, or at least a standard discriminator protocol. The pressure is based on the fact that most users would rather deal with a protocol that someone else has written rather than writing their own. And because mixing protocols has the potential for discrimination errors, a heterogeneous environment with multiple protocols will inevitably degenerate into one where a single protocol has a monopoly. So why don't you save us all the trouble and pain and just define the standard discrimination mechanism up front? As I have shown, its going to happen anyway - its inevitable - and delaying the decision simply means a lot of heartache for a lot of folk until the one true discriminator takes over. (Which is another thing that I was trying to illustrate with my SysEx story.) As a footnote, I'd like to make a philosophical point about designing protocols. A 'protocol' (not in the technical sense, but in the sense of human relations) is simply an agreement to curtail the range of one's behavior to a restricted subset of what one is capable of, in order to facilitate cooperation between individuals. Language is a protocol - as I am typing this message, I implicitly agree to use words of English, rather than random made-up syllables, in order to facilitate understanding of my meaning. Now, the curious and paradoxical thing about protocols is that in order to give the most freedom, you have to take some freedom away. Taking away certain freedoms can give you *more* freedom, because it allows you to predict and rely on the behaviors of the other participants in the protocol, enabling you to accomplish things that you wouldn't be able to do otherwise. For a given situation, there will be some "sweet spot", some balance between openness and restriction, that will give the largest amount of "effective" freedom and capability to the participants. Here's an example: Cultures which have a strong mercantile ethic for fair dealing and enforcement of contracts tend to have vastly more efficient national economies. In countries where the mercantile ethic is poor, transaction costs are much higher - each individual has to spend effort vetting and enforcing each potential transaction, instead of being able to simply trust the other person. So by voluntarily restricting ones behavior to not unfairly take advantage of others and thus gain a temporary local advantage, one gains a huge advantage on the aggregate level. For this reason, I am skeptical of the benefit of completely open-ended protocols. The value of the protocol is in the agreement between individuals - if the individuals don't agree on much, then there's not much value to be had. -- Talin From paul at prescod.net Sun Aug 13 02:05:56 2006 From: paul at prescod.net (Paul Prescod) Date: Sat, 12 Aug 2006 17:05:56 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <1cb725390608121705s6e43b02fo28b4e83865c914ab@mail.gmail.com> It seems to me that there are two very reasonable positions being expressed. Is the following (non-normative) text a compromise? "In order for processors of function annotations to work interoperably, they must use a common interpretation of objects used as annotations on a particular function. For example, one might interpret string annotations as docstrings. Another might interpet them as path segments for a web framework. For this reason, function annotation processors SHOULD avoid assigning processor-specific meanings to types defined outside of the processor's framework. For example, a Django processor could process annotations of a type defined in a Zope package, but Zope's creators should be considered the authorities on the type's meaning for the same reasons that they would be considered authorities on the semantics of classes or methods in their packages. This implies that the interpretation of built-in types would be controlled by Python's developers and documented in Python's documentation. This is just a best practice. Nothing in the language can or should enforce this practice and there may be a few domains where there is a strong argument for violating it (e.g. an education environment where saving keystrokes may be more important than easing interopability)." "In Python 3000, semantics will be attached to the following types: basestring and its subtypes are to be used for documentation (though they are not necessarily the exclusive source of documentation about the type). List and its subtypes are to be used for attaching multiple independent annotations." (does chaining make sense in this context?) Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060812/978a3b2c/attachment.htm From greg.ewing at canterbury.ac.nz Sun Aug 13 03:26:26 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sun, 13 Aug 2006 13:26:26 +1200 Subject: [Python-3000] threading, part 2 --- + a bit of ctypes FFI worry In-Reply-To: <9eebf5740608120411m40da5724r11700fdbe509914@mail.gmail.com> References: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> <44DDB5E4.9010903@canterbury.ac.nz> <9eebf5740608120411m40da5724r11700fdbe509914@mail.gmail.com> Message-ID: <44DE7FC2.4030501@canterbury.ac.nz> Lawrence Oluyede wrote: > rctypes and pypy tools are somewhat one step further than ctypes > machinery. In rctypes you can easily do something like: > > size_t = ctypes_platform.SimpleType("size_t", c_ulong) Does this work dynamically, or does it rely on C code being generated and the C compiler working out the details? -- Greg From l.oluyede at gmail.com Sun Aug 13 03:42:44 2006 From: l.oluyede at gmail.com (Lawrence Oluyede) Date: Sun, 13 Aug 2006 03:42:44 +0200 Subject: [Python-3000] threading, part 2 --- + a bit of ctypes FFI worry In-Reply-To: <44DE7FC2.4030501@canterbury.ac.nz> References: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> <44DDB5E4.9010903@canterbury.ac.nz> <9eebf5740608120411m40da5724r11700fdbe509914@mail.gmail.com> <44DE7FC2.4030501@canterbury.ac.nz> Message-ID: <9eebf5740608121842x4c1492baq9e049302905c2837@mail.gmail.com> > Does this work dynamically, or does it rely on > C code being generated and the C compiler working > out the details? It relies on C... that somewhat hinders the usefulness of the process. There's also the code generator option but we're again onto a compilation stuff. -- Lawrence http://www.oluyede.org/blog From pje at telecommunity.com Sun Aug 13 04:21:47 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sat, 12 Aug 2006 22:21:47 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608121705s6e43b02fo28b4e83865c914ab@mail.gmail.co m> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060812221550.0258ce68@sparrow.telecommunity.com> At 05:05 PM 8/12/2006 -0700, Paul Prescod wrote: >It seems to me that there are two very reasonable positions being >expressed. Is the following (non-normative) text a compromise? > >"In order for processors of function annotations to work interoperably, >they must use a common interpretation of objects used as annotations on a >particular function. For example, one might interpret string annotations >as docstrings. Another might interpet them as path segments for a web >framework. For this reason, function annotation processors SHOULD avoid >assigning processor-specific meanings to types defined outside of the >processor's framework. For example, a Django processor could process >annotations of a type defined in a Zope package, but Zope's creators >should be considered the authorities on the type's meaning for the same >reasons that they would be considered authorities on the semantics of >classes or methods in their packages. This implies that the interpretation >of built-in types would be controlled by Python's developers and >documented in Python's documentation. This is just a best practice. >Nothing in the language can or should enforce this practice and there may >be a few domains where there is a strong argument for violating it ( e.g. >an education environment where saving keystrokes may be more important >than easing interopability)." I mostly like this; the main issue I see is that as long as we're recommending best practices, we should recommend using tell-don't-ask (via duck typing protocols, adaptation, or overloaded functions) so that their libraries can be enhanced and extended by other developers. >"In Python 3000, semantics will be attached to the following types: >basestring and its subtypes are to be used for documentation (though they >are not necessarily the exclusive source of documentation about the type). >List and its subtypes are to be used for attaching multiple independent >annotations." I'm not sure why we would use strings for documentation, but I'm not opposed since it eliminates the question of multiple interpretations for strings. >(does chaining make sense in this context?) I don't know if I know what you mean by "chaining". Good use of tell-don't-ask means that any interpretation of annotations nested in other annotations would be defined by the enclosing annotation (or in an overload for it). From pje at telecommunity.com Sun Aug 13 04:23:00 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sat, 12 Aug 2006 22:23:00 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: Message-ID: <5.1.1.6.0.20060812215907.0226e808@sparrow.telecommunity.com> At 03:00 PM 8/12/2006 -0700, Talin wrote: >Phillip J. Eby wrote: > > At 12:38 PM 8/12/2006 -0700, Paul Prescod wrote: > > > However, if you have: > > > > def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) > > > > There is no ambiguity. Likewise: > > > > def cat( infile:opt("i") = sys.stdin, outfile:opt("o") = sys.stdout ): > > > > is unambiguous. And the interpetation of: > > > > def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, > > outfile: [doc("output stream"), opt("o")] = sys.stdout > > ): > >By doing this, you've already introduced an implicit requirement for >annotations: Rather than saying that annotations can be "any format you >want", the actual restriction is "any format you want that is >distinguishable from other formats." And your point is what? > More specifically, the rule is that >annotations intended for different consumers must be distinguishable >from each other via rule. This is in direct contradiction with the >statement in the PEP that says that annotations have no predefined >syntax or semantics -- they are required to have, at minimum, semantics >sufficient to allow rule-based discrimination. You've lost me here entirely. If we didn't want unambiguous semantics, we'd write programs in English, not Python. :) >(BTW, I propose the term "Annotation Consumer" to mean a body of code >that is intended to process annotations. You can have decorator-based >consumers, as well as external consumers that are not part of the >decorator stack and which inspect the function signature directly, >without invoking the decorators.) Um, okay. I'm not sure what benefit this new term adds over "operation that uses annotations", which is what I've been using, but whatever. >Lets use the term 'discriminator' to indicate any means, using function >overloading or whatever, of determining which consumers should process >which annotations. Lets also define the term 'discriminator protocol' to > mean any input specifications to the discriminator - so in the above >example, 'doc()' and 'opt()' are part of the discriminator protocol. Um, what? Why are you adding all this complication to a simple idea? Duck typing is normal, simple, standard Python programming practice. We use objects with methods all the time, and check for the existence of attributes all the time. I don't understand why you insist on making that more complicated than it is. It's really simple. Annotations are objects. Objects can be inspected, or selected by type. You can do what you want to with them. How complex is that? (Meanwhile, I'm going to ignore all the red herrings about freedom and commerce and other rigamarole that has absolutely nothing to do with argument annotations.) Going forward, may I suggest you take a look at Java and C# argument annotations before continuing to pursue this spurious line of reasoning? I'm curious to see what your explanation will be for why these other languages doesn't have the problems that you claim will inevitably occur. Meanwhile, if library authors write bad code because they don't understand basic OO concepts like duck typing and "tell, don't ask", then their users will educate them when they complain about not being able to use multiple annotation types. Providing good examples and recommending best practices is one thing, but mandating a particular semantics is another. From exarkun at divmod.com Sun Aug 13 05:21:49 2006 From: exarkun at divmod.com (Jean-Paul Calderone) Date: Sat, 12 Aug 2006 23:21:49 -0400 Subject: [Python-3000] [Python-Dev] What is the status of file.readinto? In-Reply-To: Message-ID: <20060813032149.1717.1953938655.divmod.quotient.21274@ohm> On Sat, 12 Aug 2006 19:28:44 -0700, Guido van Rossum wrote: >On 8/12/06, "Martin v. L?wis" wrote: >> I can only guess why it may go away; my guess it will go away when >> the buffer interface is removed from Python (then it becomes >> unimplementable). > >In Py3k, the I/O APIs will be redesigned, especially the binary ones. >My current idea is to have read() on a binary file return a bytes >object. If readinto() continues to be necessary, please make sure the >Py3k list (python-3000 at python.org) knows about your use case. We >aren't quite writing up the I/O APIs in PEP-form, but when we do, that >would be the right time to speak up. > The ability to read into pre-allocated memory is fairly important for high-performance applications. This should be preserved somehow (and preferably given a real, supported API). Jean-Paul From ironfroggy at gmail.com Sun Aug 13 05:50:26 2006 From: ironfroggy at gmail.com (Calvin Spealman) Date: Sat, 12 Aug 2006 23:50:26 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608121705s6e43b02fo28b4e83865c914ab@mail.gmail.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <1cb725390608121705s6e43b02fo28b4e83865c914ab@mail.gmail.com> Message-ID: <76fd5acf0608122050v75aa6dbbs32bf05f85222fa7e@mail.gmail.com> I am getting very tired of gmail's ingoring of the mailing-list headers in context of replying! Anyway, here is what I accidentally sent as personal messages related to this thread. Replying to Talin's long story about MIDI devices: WOW I won't even pretend to reply with anything near a similar sized body of text. Condolences go out to you for the water and laptop, by the way. Anyways... Although this is a humourous story (post it somewhere readily with some more fleshiness, maybe!) and I enjoyed reading it quite a bit, I saw where it was going very early on and disagreed immedately with the point I see you trying to get across. The thing is, the situations are too different to compare so bluntly. The era from which this story comes was a different world, which was far more brutal for any attempts at loose cooperation than we can do today, what with the internet and this being lots of open source software, not a hundred and fifty competing MIDI vendors who think compatibility would just make it easier to loose customers. The simplicity of the matter is that there won't be that many annotation libraries, and mixing them will be possible. When someone writes the good type annottation handling library, other people (even those writing other annotation libraries) will use it, until it reaches the point that it will get put into the standard library. And, lets no one pretend that will not happen. De facto and even just mildly common libraries almost always get pushed into the standard library eventually, but having some time in the wild is good for evolution to take its course. And to what Paul Said here: On 8/12/06, Paul Prescod wrote: > It seems to me that there are two very reasonable positions being expressed. > Is the following (non-normative) text a compromise? > > "In order for processors of function annotations to work interoperably, they > must use a common interpretation of objects used as annotations on a > particular function. For example, one might interpret string annotations as > docstrings. Another might interpet them as path segments for a web > framework. For this reason, function annotation processors SHOULD avoid > assigning processor-specific meanings to types defined outside of the > processor's framework. For example, a Django processor could process > annotations of a type defined in a Zope package, but Zope's creators should > be considered the authorities on the type's meaning for the same reasons > that they would be considered authorities on the semantics of classes or > methods in their packages. This implies that the interpretation of built-in > types would be controlled by Python's developers and documented in Python's > documentation. This is just a best practice. Nothing in the language can or > should enforce this practice and there may be a few domains where there is a > strong argument for violating it ( e.g. an education environment where > saving keystrokes may be more important than easing interopability)." > > "In Python 3000, semantics will be attached to the following types: > basestring and its subtypes are to be used for documentation (though they > are not necessarily the exclusive source of documentation about the type). > List and its subtypes are to be used for attaching multiple independent > annotations." > > (does chaining make sense in this context?) > > Paul Prescod I've been looking for a good place to pipe in with the suggestion of defining that a dictionary as an annotation is taken as a mapping of annotation type names to the annotation itself, such as using {'doc': "The single character argument for the command line.", 'type': int} as an annotation for some parameter in a function. However, reading through all the posts I missed recooperating from a long trip I just returned from, I think this coupled with taking _any iterable_ (not just list and subtypes) and the whole "your type, your annotation" guideline, is definately sufficient for all uses. From jimjjewett at gmail.com Sun Aug 13 05:56:15 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Sat, 12 Aug 2006 23:56:15 -0400 Subject: [Python-3000] PEP3102 Keyword-Only Arguments In-Reply-To: References: Message-ID: On 8/11/06, Jiwon Seo wrote: > When we have keyword-only arguments, do we allow 'keyword dictionary' > argument? If that's the case, where would we want to place > keyword-only arguments? > Are we going to allow any of followings? > 1. def foo(a, b, *, key1=None, key2=None, **map) Seems perfectly reasonable. I think the controversy was over whether or not to allow keyword-only without a default. > 2. def foo(a, b, *, **map, key1=None, key2=None) Seems backward, though I suppose we could adjust if we needed to. > 3. def foo(a, b, *, **map) What would the * even mean, since there aren't any named keywords to separate? -jJ From talin at acm.org Sun Aug 13 06:05:27 2006 From: talin at acm.org (Talin) Date: Sat, 12 Aug 2006 21:05:27 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812215907.0226e808@sparrow.telecommunity.com> References: <5.1.1.6.0.20060812215907.0226e808@sparrow.telecommunity.com> Message-ID: <44DEA507.9040900@acm.org> Phillip J. Eby wrote: > At 03:00 PM 8/12/2006 -0700, Talin wrote: >> Phillip J. Eby wrote: >> > At 12:38 PM 8/12/2006 -0700, Paul Prescod wrote: >> >> > However, if you have: >> > >> > def myfunc( x : doc("The x coordinate"), y : doc("The y >> coordinate") ) >> > >> > There is no ambiguity. Likewise: >> > >> > def cat( infile:opt("i") = sys.stdin, outfile:opt("o") = >> sys.stdout ): >> > >> > is unambiguous. And the interpetation of: >> > >> > def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, >> > outfile: [doc("output stream"), opt("o")] = sys.stdout >> > ): >> >> By doing this, you've already introduced an implicit requirement for >> annotations: Rather than saying that annotations can be "any format you >> want", the actual restriction is "any format you want that is >> distinguishable from other formats." > > And your point is what? > My point is that this statement in the Collin's PEP is wrong: > There is no worry that these libraries will assign semantics at > random, or that a variety of libraries will appear, each with varying > semantics and interpretations of what, say, a tuple of strings > means. The difficulty inherent in writing annotation interpreting > libraries will keep their number low and their authorship in the > hands of people who, frankly, know what they're doing. The way I read this is "there is no need for annotations to be designed so as not to interfere with one another, nor does there need to be any mechanism defined in this PEP for resolving such interference". I and others have provided extensive use cases to show that unless care is taken, different annotations *will* step on each others toes. >> More specifically, the rule is that >> annotations intended for different consumers must be distinguishable >> from each other via rule. This is in direct contradiction with the >> statement in the PEP that says that annotations have no predefined >> syntax or semantics -- they are required to have, at minimum, semantics >> sufficient to allow rule-based discrimination. > > You've lost me here entirely. If we didn't want unambiguous semantics, > we'd write programs in English, not Python. :) Again, look at the language of the PEP. >> (BTW, I propose the term "Annotation Consumer" to mean a body of code >> that is intended to process annotations. You can have decorator-based >> consumers, as well as external consumers that are not part of the >> decorator stack and which inspect the function signature directly, >> without invoking the decorators.) > > Um, okay. I'm not sure what benefit this new term adds over "operation > that uses annotations", which is what I've been using, but whatever. > I'm just trying to get a handle on this stuff so that we can *talk* about it. >> Lets use the term 'discriminator' to indicate any means, using function >> overloading or whatever, of determining which consumers should process >> which annotations. Lets also define the term 'discriminator protocol' to >> mean any input specifications to the discriminator - so in the above >> example, 'doc()' and 'opt()' are part of the discriminator protocol. > > Um, what? Why are you adding all this complication to a simple idea? I'm not adding anything to the concept, I am trying to come up with a way to *talk* about the concept. So far the whole conversation has gotten very confused because we're dealing with some highly abstract stuff here. > Duck typing is normal, simple, standard Python programming practice. We > use objects with methods all the time, and check for the existence of > attributes all the time. > > I don't understand why you insist on making that more complicated than > it is. It's really simple. Annotations are objects. Objects can be > inspected, or selected by type. You can do what you want to with them. > > How complex is that? It gets complex when you have more than one inspector or selector. What we are arguing about is how much the various inspectors/selectors need to know about each other. And while the answer is hopefully "not much", I hope that I have shown that it cannot be "nothing at all". There has to be some ground rules for cooperation, or cooperation is impossible, that's basic logic. > (Meanwhile, I'm going to ignore all the red herrings about freedom and > commerce and other rigamarole that has absolutely nothing to do with > argument annotations.) Don't think of it as red herrings. Think of it as, um, "highly non-linear train of thought". :) > Going forward, may I suggest you take a look at Java and C# argument > annotations before continuing to pursue this spurious line of > reasoning? I'm curious to see what your explanation will be for why > these other languages doesn't have the problems that you claim will > inevitably occur. Dude, you don't want to know how many man-years of C# programming I've done :) Lets take C# attributes as an example. C# Attributes have the following syntactical/semantic structure: 1) They must be derived from the base class "Attribute". (This by itself is not really significant.) 2) Attributes are distinguished by type, or in some cases by value. 3) The types do not overlap. 4) A given consumer of attributes can always distinguish attributes which are relevant to their purposes to attributes which are not, even against hypothetical future annotations which have not yet been established. As a user, when I add an attribute to a method, I know that (a) there is a known consumer of that attribute, (b) That it is impossible for an attribute which is not intended for that consumer to be confused for one that is. if I set [Browseable(false)] on a property, I know exactly how that attribute is going to be interpreted, and by what component. If someone comes along later and adds a new annotation called "SortOfBrowseable", which has many of the same attributes as Browseable, there will never be the possibility that there annotation and mine can get confused with each other. (As opposed to Python, where it's relatively easy to have classes that masquerade as one another.) The Annotation PEP, on the other hand, makes none of these guarantees, because it tries hard not to guarantee anything. It doesn't specify the mechanism by which one annotation is distinguished from another; Unlike the C# attributes which are organized into a tree of types, the annotations have no organization and no categorization defined. Because there is no prohibition against category overlap, that means that the annotations that I write today might one day in the future match against a newly-created category, with results that I can't predict. I also want to point out that C# attributes are very different from Python decorators, so you can't use analogies between them. Decorators are active agents - that is, they hook into the process of defining a method. Because of this, decorators have the option of having all of their semantic meaning buried within the decorator itself. In essence, the rule by which decorators "play nice" with each other is already defined - each gets a shot at modifying the function object, and each receives the result of the previous decorator. C# attributes and function annotations, on the other hand, are purely passive - they have no knowledge of what they are attached to, and their only meaning is derived from external use. They themselves don't have to play nice with each other, but the interpreters / inspectors / consumers do. > Meanwhile, if library authors write bad code because they don't > understand basic OO concepts like duck typing and "tell, don't ask", > then their users will educate them when they complain about not being > able to use multiple annotation types. > > Providing good examples and recommending best practices is one thing, > but mandating a particular semantics is another. From jcarlson at uci.edu Sun Aug 13 06:16:18 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sat, 12 Aug 2006 21:16:18 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> References: <1cb725390608121238v427fe287s303e2acdda97bab5@mail.gmail.co m> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <20060812205512.197A.JCARLSON@uci.edu> "Phillip J. Eby" wrote: > However, if you have: > > def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) > > There is no ambiguity. Likewise: > > def cat( infile:opt("i") = sys.stdin, outfile:opt("o") = sys.stdout ): > > is unambiguous. And the interpetation of: > > def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, > outfile: [doc("output stream"), opt("o")] = sys.stdout > ): > > is likewise unambiguous, unless the creator of the documentation or option > features has defined some other interpretation for a list than "recursively > apply to contained items". In which case, you need only do something like: > > def cat(infile: docopt("input stream", "i") = sys.stdin, > outfile: docopt("output stream", "o") = sys.stdout > ): I now understand where you were coming from with regards to this being equivalent to pickle (at least pickle + copy_reg). I think that if you would have posted this particular sample a couple days ago, there wouldn't have been the discussion (argument?) about incompatible mechanisms for annotation processing. With that said, the above is a protocol. Just like __len__, __str__, copy_reg, __reduce__, __setstate__, etc., are protocols. It may not be fully specified (when annotations are to be processed, if at all, by whom, where the annotation registry is, etc.), but it is still a protocol. Do we need any more specification for the PEP and 2.6/3k? I don't know, maybe. You claim no, with the history of PEAK and other languages as proof that doing anything more is unnecessary. And I can understand why you would resist any further specification: PEAK has been doing annotations for quite a while, and additional specifications could make transitioning to these annotations a pain in the ass for you and your users. I'm personally not convinced that no further specification is desired or necessary (provided we include a variant of the above example annotations), but I also cannot convince myself that specifying anything further would be flexible enough to not be a mistake. - Josiah From pje at telecommunity.com Sun Aug 13 07:05:13 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sun, 13 Aug 2006 01:05:13 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DEA507.9040900@acm.org> References: <5.1.1.6.0.20060812215907.0226e808@sparrow.telecommunity.com> <5.1.1.6.0.20060812215907.0226e808@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060813005228.022737c8@sparrow.telecommunity.com> At 09:05 PM 8/12/2006 -0700, Talin wrote: >What we are arguing about is how much the various inspectors/selectors >need to know about each other. And while the answer is hopefully "not >much", I hope that I have shown that it cannot be "nothing at all". As I've previously stated, they need to know enough to ignore what they don't understand. And, to be useful, they should allow user extension via duck typing or overloading. > There has to be some ground rules for cooperation, or cooperation is > impossible, that's basic logic. See the ground rules provided above. >>Going forward, may I suggest you take a look at Java and C# argument >>annotations before continuing to pursue this spurious line of >>reasoning? I'm curious to see what your explanation will be for why >>these other languages doesn't have the problems that you claim will >>inevitably occur. > >Dude, you don't want to know how many man-years of C# programming I've done :) > >Lets take C# attributes as an example. C# Attributes have the following >syntactical/semantic structure: > > 1) They must be derived from the base class "Attribute". (This by > itself is not really significant.) > 2) Attributes are distinguished by type, or in some cases by value. > 3) The types do not overlap. > 4) A given consumer of attributes can always distinguish attributes > which are relevant to their purposes to attributes which are not, even > against hypothetical future annotations which have not yet been established. I fail to see how this is different from what I've already said. >As a user, when I add an attribute to a method, I know that (a) there is a >known consumer of that attribute, (b) That it is impossible for an >attribute which is not intended for that consumer to be confused for one >that is. if I set [Browseable(false)] on a property, I know exactly how >that attribute is going to be interpreted, and by what component. If >someone comes along later and adds a new annotation called >"SortOfBrowseable", which has many of the same attributes as Browseable, >there will never be the possibility that there annotation and mine can get >confused with each other. Again, so far it sounds just like the existing proposal. > (As opposed to Python, where it's relatively easy to have classes that > masquerade as one another.) That's a feature, not a bug. :) >The Annotation PEP, on the other hand, makes none of these guarantees, >because it tries hard not to guarantee anything. It doesn't specify the >mechanism by which one annotation is distinguished from another; Unlike >the C# attributes which are organized into a tree of types, the >annotations have no organization and no categorization defined. Because >there is no prohibition against category overlap, that means that the >annotations that I write today might one day in the future match against a >newly-created category, with results that I can't predict. Not if the annotation consumers simply use a tell-don't-ask pattern -- a pattern which I've repeatedly explained, and which can be trivially implemented with either duck typing or overloading. >I also want to point out that C# attributes are very different from Python >decorators, so you can't use analogies between them. That statement makes me think that the reason we're not communicating is that you are talking about something else than I am. I never compared Python decorators and C# attributes. In fact, I've rarely mentioned decorators at all and have tried as much as possible to push decorators *out* of the conversation, because they are irrelevant. Documentation tools, for example, are unlikely to use decorators. Metaclasses also aren't decorators, but both documentation tools and metaclasses are likely candidates for consuming annotation data. Thus, I prefer to talk about "operations using annotations" since decorators are only a kind of "delivery vector" for such annotation-consuming operations. >C# attributes and function annotations, on the other hand, are purely >passive - they have no knowledge of what they are attached to, and their >only meaning is derived from external use. They themselves don't have to >play nice with each other, but the interpreters / inspectors / consumers do. And precisely the same things are true of Python function annotations. I'm still lost as to why you think there's something different going on here. Python decorators simply provide a vector for immediate annotation processing -- one that is entirely orthogonal to the notion of annotations themselves. From pje at telecommunity.com Sun Aug 13 07:21:01 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sun, 13 Aug 2006 01:21:01 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <20060812205512.197A.JCARLSON@uci.edu> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <1cb725390608121238v427fe287s303e2acdda97bab5@mail.gmail.co m> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060813010634.0228ee30@sparrow.telecommunity.com> At 09:16 PM 8/12/2006 -0700, Josiah Carlson wrote: >"Phillip J. Eby" wrote: > > However, if you have: > > > > def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) > > > > There is no ambiguity. Likewise: > > > > def cat( infile:opt("i") = sys.stdin, outfile:opt("o") = sys.stdout ): > > > > is unambiguous. And the interpetation of: > > > > def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, > > outfile: [doc("output stream"), opt("o")] = sys.stdout > > ): > > > > is likewise unambiguous, unless the creator of the documentation or option > > features has defined some other interpretation for a list than > "recursively > > apply to contained items". In which case, you need only do something like: > > > > def cat(infile: docopt("input stream", "i") = sys.stdin, > > outfile: docopt("output stream", "o") = sys.stdout > > ): > >I now understand where you were coming from with regards to this being >equivalent to pickle (at least pickle + copy_reg). I think that if you >would have posted this particular sample a couple days ago, there >wouldn't have been the discussion (argument?) about incompatible >mechanisms for annotation processing. Well, it just seemed to me that that was the One Obvious Way To Do It; more specifically, I couldn't conceive of any *other* way to do it! >With that said, the above is a protocol. Just like __len__, __str__, >copy_reg, __reduce__, __setstate__, etc., are protocols. It may not be >fully specified (when annotations are to be processed, if at all, by >whom, where the annotation registry is, etc.), but it is still a >protocol. Actually, it's a family of *patterns* for creating protocols. It's not a protocol, incompletely specified or otherwise. Note that the actual implementation of the tell-don't-ask pattern can be via: 1. duck typing (i.e., prearranged method names) 2. adaptation 3. overloaded functions (any of several implementations) 4. ad hoc type-based registries So it isn't even a *meta*-protocol, just a pattern family. >Do we need any more specification for the PEP and 2.6/3k? I don't know, >maybe. You claim no, with the history of PEAK and other languages as >proof that doing anything more is unnecessary. And I can understand why >you would resist any further specification: PEAK has been doing >annotations for quite a while, and additional specifications could make >transitioning to these annotations a pain in the ass for you and your >users. Not really; PEAK's annotations are currently only on *attributes* and *classes*, not functions, arguments, or return values. I was merely using it as an example of how overloaded functions allow heterogeneous annotations to coexist without needing any prearranged common semantics. But I don't believe we know enough *today* to be able to safely define a rigid specification without ruling out possibly-valid uses. By making a less-rigid specification, we force annotation consumers to code defensively... which is really the right thing to do in a heterogeneous environment anyway. >I'm personally not convinced that no further specification is desired or >necessary (provided we include a variant of the above example >annotations), As I said, I'd prefer to see the tell-don't-ask pattern specifically cited and recommended, perhaps with examples. I'll note, however, that the only consequence of *not* following that pattern is that you create a non-extensible, non-interoperable framework -- of which Python has huge numbers already. This is not so damaging an outcome as to be worrisome, any more than we worry about people creating incompatible metaclasses today! >but I also cannot convince myself that specifying anything >further would be flexible enough to not be a mistake. Right - that's the bit I'm concerned about. Python also usually doesn't impose such policy constraints on mechanism. For example, function attributes can be or contain anything, and nobody has argued that there need to be prespecified combination semantics, despite the fact that multiple tools can be consumers of the attributes. From jimjjewett at gmail.com Sun Aug 13 07:29:52 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Sun, 13 Aug 2006 01:29:52 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <20060812205512.197A.JCARLSON@uci.edu> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> Message-ID: On 8/13/06, Josiah Carlson wrote: > "Phillip J. Eby" wrote: > > However, if you have: > > def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) > > There is no ambiguity. Sure there is. There will probably be several frameworks using the magic name "doc". This isn't a problem for the person writing myfunc, and therefore isn't a problem for immediate decorators. It is a problem for inspection code that wants to present information about arbitrary 3rd-party libraries. And once you get into multiple annotations, there will be some frameworks that say "the doc annotation is mine, I'll ignore the opt annotation" and others that say "oh, a dictionary of annotations, I need to do this with name doc and that with name opt" And of course, people won't really write doc("The x coordinate") unless they're already thinking of other uses for a string; they'll just write "The x coordinate" and someone later (perhaps from a different package) will have to untangle what they meant -- short expressions will end up being ambiguous almost from the start. Eventually, ways will be found to sort things out. But there will be less pain and backwards incompatibility if these issues are considered from the start. > Do we need any more specification for the PEP and 2.6/3k? I don't know, > maybe. You claim no, with the history of PEAK and other languages as > proof that doing anything more is unnecessary. The history of complaints about PEAK being hard to understand and inadequately documented suggests that a fair number of people would prefer additional guidance and handholding. If annotations could only be used safely by people who can understand PEAK, then offering syntactic sugar to everyone would be asking for trouble. -jJ From paul at prescod.net Sun Aug 13 08:00:36 2006 From: paul at prescod.net (Paul Prescod) Date: Sat, 12 Aug 2006 23:00:36 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DEA507.9040900@acm.org> References: <5.1.1.6.0.20060812215907.0226e808@sparrow.telecommunity.com> <44DEA507.9040900@acm.org> Message-ID: <1cb725390608122300q3b20db1apc707e537c36fd0ee@mail.gmail.com> I made a proposal that Phillip was mostly okay with. What do other participants in the thread think? Would it move towards resolving this thread? "In order for processors of function annotations to work interoperably, they must use a common interpretation of objects used as annotations on a particular function. For example, one might interpret string annotations as docstrings. Another might interpet them as path segments for a web framework. For this reason, function annotation processors SHOULD avoid assigning processor-specific meanings to types defined outside of the processor's framework. For example, a Django processor could process annotations of a type defined in a Zope package, but Zope's creators should be considered the authorities on the type's meaning for the same reasons that they would be considered authorities on the semantics of classes or methods in their packages. This implies that the interpretation of built-in types would be controlled by Python's developers and documented in Python's documentation. This is just a best practice. Nothing in the language can or should enforce this practice and there may be a few domains where there is a strong argument for violating it ( e.g. an education environment where saving keystrokes may be more important than easing interopability)." "In Python 3000, semantics will be attached to the following types: basestring and its subtypes are to be used for documentation (though they are not necessarily the exclusive source of documentation about the type). List and its subtypes are to be used for attaching multiple independent annotations." Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060812/d1576e7a/attachment.htm From pje at telecommunity.com Sun Aug 13 08:06:50 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sun, 13 Aug 2006 02:06:50 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: References: <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> Message-ID: <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> At 01:29 AM 8/13/2006 -0400, Jim Jewett wrote: >On 8/13/06, Josiah Carlson wrote: > >>"Phillip J. Eby" wrote: >> > However, if you have: > >> > def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) > >> > There is no ambiguity. > >Sure there is. There will probably be several frameworks using the >magic name "doc". > >This isn't a problem for the person writing myfunc, and therefore >isn't a problem for immediate decorators. It is a problem for >inspection code that wants to present information about arbitrary >3rd-party libraries. By this argument, we shouldn't have metaclasses or function attributes, because they have the same "problem". However, it's only a problem if you insist on writing brain-damaged code. If you want interoperability here, you must write tell-don't-ask code. This is true for *any* use case where frameworks might share objects; there is absolutely *nothing* special about annotations in this regard! I'm really baffled by the controversy over this; is it really the case that so many people don't know what tell-don't-ask code is or why you want it? I guess maybe it's something that's only grasped by people who have experience writing code intended for interoperability. After you run into the issue a few times, you look for a solution, and end up with either duck typing, interfaces/adaptation, overloaded functions, or ad hoc registries. ALL of these solutions are *more* than adequate to handle a simple thing like argument annotations. That's why I keep describing this as a trivial thing: even *pickling* is more complicated than this is. This is no more complex than len() or iter() or filter()! However, it appears that mine is a minority opinion. Unfortunately, I'm at a bit of a communication disadvantage, because if somebody wants to believe something is complicated, there is nothing that anybody can do to change their mind. If you don't consider the possibility that it is way simpler than you think, you will never be able to see it. The other possibility, of course, is that all of you have some horrendously complex use case in mind that I just don't "get". But so far all the examples that anybody else has put forth have been practically whimsical in their triviality -- while I've been explaining how the same principles will even work for complex things like type-checking code generation, let alone the trivial examples. So I don't think that's it. And at least Paul and Josiah have shown that they "get" what I'm saying, so I don't think that the answer is simply that I'm crazy, either. [Meanwhile, I'm not going to respond to the rest of your message, since it contained some things that appeared to me to be a mixture of ad hominem attack and straw man argument. I hope that was not actually your intent.] From ironfroggy at gmail.com Sun Aug 13 08:07:19 2006 From: ironfroggy at gmail.com (Calvin Spealman) Date: Sun, 13 Aug 2006 02:07:19 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608122300q3b20db1apc707e537c36fd0ee@mail.gmail.com> References: <5.1.1.6.0.20060812215907.0226e808@sparrow.telecommunity.com> <44DEA507.9040900@acm.org> <1cb725390608122300q3b20db1apc707e537c36fd0ee@mail.gmail.com> Message-ID: <76fd5acf0608122307m11d3128ah3791ded3b3df2cd@mail.gmail.com> On 8/13/06, Paul Prescod wrote: > I made a proposal that Phillip was mostly okay with. What do other > participants in the thread think? Would it move towards resolving this > thread? > > "In order for processors of function annotations to work interoperably, they > must use a common interpretation of objects used as annotations on a > particular function. For example, one might interpret string annotations as > docstrings. Another might interpet them as path segments for a web > framework. For this reason, function annotation processors SHOULD avoid > assigning processor-specific meanings to types defined outside of the > processor's framework. For example, a Django processor could process > annotations of a type defined in a Zope package, but Zope's creators should > be considered the authorities on the type's meaning for the same reasons > that they would be considered authorities on the semantics of classes or > methods in their packages. This implies that the interpretation of built-in > types would be controlled by Python's developers and documented in Python's > documentation. This is just a best practice. Nothing in the language can or > should enforce this practice and there may be a few domains where there is a > strong argument for violating it ( e.g. an education environment where > saving keystrokes may be more important than easing interopability)." > > > "In Python 3000, semantics will be attached to the following types: > basestring and its subtypes are to be used for documentation (though they > are not necessarily the exclusive source of documentation about the type). > List and its subtypes are to be used for attaching multiple independent > annotations." > > Paul Prescod +1 This needs resolved, and willy-nilly use of built-in types or someone else's types.. doesn't seem like anyone could be supportive of that. From paul at prescod.net Sun Aug 13 08:39:32 2006 From: paul at prescod.net (Paul Prescod) Date: Sat, 12 Aug 2006 23:39:32 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812221550.0258ce68@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <5.1.1.6.0.20060812221550.0258ce68@sparrow.telecommunity.com> Message-ID: <1cb725390608122339m6087c604l85faeb89d6061524@mail.gmail.com> On 8/12/06, Phillip J. Eby wrote: > > > I mostly like this; the main issue I see is that as long as we're > recommending best practices, we should recommend using tell-don't-ask (via > duck typing protocols, adaptation, or overloaded functions) so that their > libraries can be enhanced and extended by other developers. Would you mind suggesting text for the PEP as an addendum to what I proposed? And an example of both bad and good practice? >"In Python 3000, semantics will be attached to the following types: > >basestring and its subtypes are to be used for documentation (though they > >are not necessarily the exclusive source of documentation about the > type). > >List and its subtypes are to be used for attaching multiple independent > >annotations." > > I'm not sure why we would use strings for documentation, but I'm not > opposed since it eliminates the question of multiple interpretations for > strings. I don't understand your point. Is there a better use for strings? Or a better type to associate with documentation? Or you just don't see a need for inline parameter documentation? The PEP itself used string docstrings as an example. >(does chaining make sense in this context?) > > I don't know if I know what you mean by "chaining". Good use of > tell-don't-ask means that any interpretation of annotations nested in > other > annotations would be defined by the enclosing annotation (or in an > overload > for it). Yes, it's clear what nesting means. I'm not asking about nesting. The question was whether there should be any relationship implied by the fact that an annotation appears to the left or right of another annotation in a list of annotations. def a(b: [doc('x'), type('y')]): pass Is there any sense in which the function 'x' should be passed context information that would help it wrap or communicate with 'y'? The most likely answer is "no" but function decorators do chain so I just wanted to raise the issue in case anyone wanted to make the case that parameter and return code annotations should as well. Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060812/31cb1cab/attachment.html From paul at prescod.net Sun Aug 13 08:47:29 2006 From: paul at prescod.net (Paul Prescod) Date: Sat, 12 Aug 2006 23:47:29 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608111649g54e82dd6kef19862f0c281254@mail.gmail.com> References: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> <20060811084623.1931.JCARLSON@uci.edu> <44DD073C.7030305@acm.org> <43aa6ff70608111649g54e82dd6kef19862f0c281254@mail.gmail.com> Message-ID: <1cb725390608122347q2527151fiadf1a8fc7bcd4af5@mail.gmail.com> On 8/11/06, Collin Winter wrote: > > >>> def chain(*decorators): > >>> assert len(decorators) >= 2 > >>> > >>> def decorate(function): > >>> sig = function.__signature__ > >>> original = sig.annotations > >>> > >>> for i, dec in enumerate(decorators): > >>> fake = dict((p, original[p][i]) for p in original) > >>> > >>> function.__signature__.annotations = fake > >>> function = dec(function) > >>> > >>> function.__signature__.annotations = original > >>> return function > >>> return decorate I must be confused. This is a function returning a function. Does that mean that the thing showing up in the __signatures__ dictionary is a function? Or does the caller need to use two sets of parentheses to call the factory function and then the inner function? Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060812/3263fd44/attachment.htm From paul at prescod.net Sun Aug 13 09:02:05 2006 From: paul at prescod.net (Paul Prescod) Date: Sun, 13 Aug 2006 00:02:05 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <76fd5acf0608122011w442afac8o6bfaa7f42ec9cbcd@mail.gmail.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <1cb725390608121705s6e43b02fo28b4e83865c914ab@mail.gmail.com> <76fd5acf0608122011w442afac8o6bfaa7f42ec9cbcd@mail.gmail.com> Message-ID: <1cb725390608130002gbe3cb88j301b451386c51328@mail.gmail.com> On 8/12/06, Calvin Spealman wrote: > > I've been looking for a good place to pipe in with the suggestion of > defining that a dictionary as an annotation is taken as a mapping of > annotation type names to the annotation itself, such as using {'doc': > "The single character argument for the command line.", 'type': int} as > an annotation for some parameter in a function. I think we need to decide whether metadata type identifiers are just strings or whether they will typically be objects. I think that the arguments in favour of objects are strong. However, reading through all the posts I missed recooperating from a > long trip I just returned from, I think this coupled with taking _any > iterable_ (not just list and subtypes) and the whole "your type, your > annotation" guideline, is definately sufficient for all uses. > One reason not to treat any iterable as a list of decorators is that a string is an iterable. Maybe strings won't be the only annotation that people want to attach that happens to be iterable for unrelated reasons. A second reason that I restricted it to lists in particular is to encourage consistent syntax (rather than one person using a list, another a tuple, a third a generator, etc.). And overall it is just overgeneralization. YAGNI. Lists work fine. def myProtocolChainer(*args): return list(doSomething(args)): It is easy to loosen the protocol in future versions if I turn out to be wrong. Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060813/a1656667/attachment.html From paul at prescod.net Sun Aug 13 09:42:06 2006 From: paul at prescod.net (Paul Prescod) Date: Sun, 13 Aug 2006 00:42:06 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <1cb725390608130042h50c7d7f9oc4068f30f2b04bbb@mail.gmail.com> > And the interpetation of: > > def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, > outfile: [doc("output stream"), opt("o")] = sys.stdout > ): > > is likewise unambiguous, unless the creator of the documentation or option > features has defined some other interpretation for a list than > "recursively > apply to contained items". The meaning is "unambiguous unless..." then it ambiguous. So as per my previous proposal I think that you and I agree that we should disallow the stupid interpretation by encoding the obvious one in the PEP. In which case, you need only do something like: > > def cat(infile: docopt("input stream", "i") = sys.stdin, > outfile: docopt("output stream", "o") = sys.stdout > ): > > with an appropriate definition of methods for the 'docopt' type. Given that there are an infinite number of tools in the universe that could be processing "doc" and "opt" annotations, how would the user KNOW that there is one out there with a stupid interpretation of lists? They might annotate thousands of classes before finding out that some hot tool that they were planning to use next year is incompatible. So let's please define a STANDARD way of attaching multiple annotations to a parameter. Lists seem like a no-brainer choice for that. Since many people seem to be unfamiliar with overloaded functions, I would > just like to take this opportunity to remind you that the actual overload > mechanism is irrelevant. If you gave 'doc' objects a 'printDocString()' > method and 'opt' objects a 'setOptionName()' method, the exact same logic > regarding extensibility applies. The 'docopt' type would simply implement > both methods. > > This is normal, simple standard Python stuff; nothing at all fancy. The context is a little bit different than standard duck typing. Let's say I define a function like this: def car(b): "b is a list-like object" return b[0] Then someone comes along and does something I never expected. They invent a type representing a list of bits in a bitfield. They pass it to my function and everything works trivially. But there's something important that happened. The programmer ASSERTED by passing the RDF list to the function 'a' that it is a list like object. My code wouldn't have tried to treat it as a list if the user hadn't passed it as one explicitly. Now look at it from the point of view of function annotations. As we said before, the annotations are inert. They are just attached. There is some code like a type checker or documentation generator that comes along after the fact and scoops them up to do something with them. The user did not assert (at the language level!) that any particular annotation applies to any particular annotation processor. The annotation processor is just looking for stuff that it recognizes. But what if it thinks it recognizes something but does not? Consider this potential case: BobsDocumentationGenerator.py: class BobsDocumentationGeneratorAnnotation: def __init__... def printDocument(self): print self.doc def sideEffect(self): deleteHardDrive() def BobsDocumentationGenerator(annotation): if hasattr(annotation, "printDocument"): annotation.printDocument() SamsDocumentationGenerator.py: class SamsDocumentationGeneratorAnnotation: def __init__... def printDocument(self): return self.doc def sideEffect(self): email(self.doc, "python-dev at pytho...") def SamsDocumentationGenerator(annotation): if hasattr(annotation, "printDocument"): print annotation.printDocument() annotation.sideEffect() These objects, _by accident_ have the same method signature but different side effects and return values. Nobody anywhere in the system made an incorrect assertion. They just happened to be unlucky in the naming of their methods. (unbelievably unlucky but you get the drift) One simple way to make it unambiguous would be to do a test more like: if hasattr(annotation, SamsDocumentationGenerator.uniqueObject): ... The association of the unique object with an annotator object would be an explicit assertion of compatibility. Can we agree that the PEP should describe strategies that people should use to make their annotation recognition strategies unambiguous and failure-proof? I think that merely documenting appropriately defensive techniques might be enough to make Talin happy. Note that it isn't the processing code that needs to be defensive (in the sense of try/catch blocks). It is the whole recognition strategy that the processing code uses. Whatever recognition strategy it uses must be unambiguous. It seems like it would hurt nobody to document this and suggest some unambiguous techniques. Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060813/8fd0e73f/attachment-0001.htm From jcarlson at uci.edu Sun Aug 13 09:59:06 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sun, 13 Aug 2006 00:59:06 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060813010634.0228ee30@sparrow.telecommunity.com> References: <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813010634.0228ee30@sparrow.telecommunity.com> Message-ID: <20060812233132.197F.JCARLSON@uci.edu> "Phillip J. Eby" wrote: > At 09:16 PM 8/12/2006 -0700, Josiah Carlson wrote: > >"Phillip J. Eby" wrote: > > > However, if you have: > > > > > > def myfunc( x : doc("The x coordinate"), y : doc("The y coordinate") ) > > > > > > There is no ambiguity. Likewise: > > > > > > def cat( infile:opt("i") = sys.stdin, outfile:opt("o") = sys.stdout ): > > > > > > is unambiguous. And the interpetation of: > > > > > > def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, > > > outfile: [doc("output stream"), opt("o")] = sys.stdout > > > ): > > > > > > is likewise unambiguous, unless the creator of the documentation or option > > > features has defined some other interpretation for a list than > > "recursively > > > apply to contained items". In which case, you need only do something like: > > > > > > def cat(infile: docopt("input stream", "i") = sys.stdin, > > > outfile: docopt("output stream", "o") = sys.stdout > > > ): > > > >I now understand where you were coming from with regards to this being > >equivalent to pickle (at least pickle + copy_reg). I think that if you > >would have posted this particular sample a couple days ago, there > >wouldn't have been the discussion (argument?) about incompatible > >mechanisms for annotation processing. > > Well, it just seemed to me that that was the One Obvious Way To Do It; more > specifically, I couldn't conceive of any *other* way to do it! Perhaps, but it was also obvious that very few people knew what the heck you were talking about (hence the "how" and "what do you mean" queries). Try to remember that while you may be old-hat at annotations, perhaps not everyone discussing them at the moment has your particular experience and assumptions. Also, when you hand-wave with "it's trivial", it's more than a little frustrating, because while it may be "trivial" to you, it's certainly not trivial to the asker (why would they be asking otherwise?) > >With that said, the above is a protocol. Just like __len__, __str__, > >copy_reg, __reduce__, __setstate__, etc., are protocols. It may not be > >fully specified (when annotations are to be processed, if at all, by > >whom, where the annotation registry is, etc.), but it is still a > >protocol. > > Actually, it's a family of *patterns* for creating protocols. It's not a > protocol, incompletely specified or otherwise. Note that the actual > implementation of the tell-don't-ask pattern can be via: Here's my take: Protocol in this context is a set of rules for the definition of the annotations and their interaction with the handler for the annotations. For what we seem to have agreed upon, the definition is via a base class or instance, and the annotation handling is left up to the user to define (via the four methods you offered, or even others). If you want to call it a 'pattern', 'protocol', 'meta-protocol', or whatever, they are all effectively the same thing in this context; a way of writing annotations that can later be seen as having a (hopefully unambiguous) meaning. > But I don't believe we know enough *today* to be able to safely define a > rigid specification without ruling out possibly-valid uses. By making a > less-rigid specification, we force annotation consumers to code > defensively... which is really the right thing to do in a heterogeneous > environment anyway. Right. I'm in no way suggesting that a 'rigid' specification be developed, and I'm generally on the fence about whether *any* specification should be done. But really, the more I think about it, the more I believe that *something* should be offered as a starting point. Whether it is in the Python cookbook, a 3rd party module or package, etc. As long as it includes a link from the standard Python documentation where annotations are discussed, I think that would be satisfactory. > >but I also cannot convince myself that specifying anything > >further would be flexible enough to not be a mistake. > > Right - that's the bit I'm concerned about. Python also usually doesn't > impose such policy constraints on mechanism. For example, function > attributes can be or contain anything, and nobody has argued that there > need to be prespecified combination semantics, despite the fact that > multiple tools can be consumers of the attributes. Ahh, but function decorators *do* have a specified combination semantic; specifically an order of application and chaining (the return from the first decorator will be passed to the second decorator, etc.). If we were to specify anything, I would suggest we define an order of annotation calling, which would also define a chaining order if applicable. Maybe it is completely obvious, but one should never underestimate what kinds of silly things users will do. You responded to Jim Jewett > [Meanwhile, I'm not going to respond to the rest of your message, since it > contained some things that appeared to me to be a mixture of ad hominem > attack and straw man argument. I hope that was not actually your intent.] As a point of reference, even after you linked the documentation about PEAK, I still had *no idea* what the heck you meant about PEAK annotations or their implications to function argument annotations. I like to believe that I'm not stupid, but maybe I'm wrong, or maybe the documentation could be better (this isn't an insult, I'm quite experienced at writing poor documentation)? - Josiah From paul at prescod.net Sun Aug 13 10:06:01 2006 From: paul at prescod.net (Paul Prescod) Date: Sun, 13 Aug 2006 01:06:01 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> Message-ID: <1cb725390608130106y3cf29002q6c63dd6ac1ce04d4@mail.gmail.com> Sorry to write so many emails, but I want to get in one last point tonight (I'm sure I'll regret posting late at night) Jim's email seems not to have gotten through to the whole list. There's a lot of that going aruond. On 8/12/06, Phillip J. Eby wrote: > > >Sure there is. There will probably be several frameworks using the > >magic name "doc". > > > >This isn't a problem for the person writing myfunc, and therefore > >isn't a problem for immediate decorators. It is a problem for > >inspection code that wants to present information about arbitrary > >3rd-party libraries. > > By this argument, we shouldn't have metaclasses or function attributes, > because they have the same "problem". I don't think Jim's issue is a real one (according to the snippet I see in your email) because doc is an object defined in one and only one place in Python. It has a unique id(). If two people use the name "doc" then they will be addressable as module1.doc() and module2.doc(). No problem. However, it's only a problem if you insist on writing brain-damaged > code. If you want interoperability here, you must write tell-don't-ask > code. This is true for *any* use case where frameworks might share > objects; there is absolutely *nothing* special about annotations in this > regard! There is something different about annotations than everything else in Python so far. Annotations are the first feature other than docstrings (which are proto-annotations) in core Python where third party tools are supposed to go trolling through your objects FINDING STUFF that they may decide is interesting or not to them. When you attach a metaclass or a decorator, you INVOKE CODE that you have installed on your hard drive and if it crashes then you load up your debugger and see what happend. When you attach an annotation, you are just adding information that code OUTSIDE OF YOUR CONTROL will poke around and interpret (the metadata processor, like a type checker or documentation generator). What you do when you attach an annotation is make an assertion. You always want to be confident that you and the person writing the processor code have the same understanding of the assertion you are making. You do not want to attach a list because you are asserting that the list is a container for a bunch of other assertions about the contents of the list whereas the person writing the processing code thinks that you are asserting that the variable will be of TYPE list. Now I'm sure that with all of your framework programming you've run into this many times and have many techniques for making these assertions unambiguous. All we need to do is document them so that people who are not as knowledgable will not get themselves into trouble. It isn't sufficient to say: "Only smart people will use this stuff so we need not worry" which is what the original PEP said. Even if it is true, I don't understand why we would bother taking the risk when the alternative is so low-cost. Define the behaviour for intepreting a few built-in types and define guidelines and best practices for other types. After you run into the issue a few times, you look for a solution, and end > up with either duck typing, interfaces/adaptation, overloaded functions, > or > ad hoc registries. ALL of these solutions are *more* than adequate to > handle a simple thing like argument annotations. That's why I keep > describing this as a trivial thing: even *pickling* is more complicated > than this is. This is no more complex than len() or iter() or filter()! Pickling works because of the underscores and magic like " __safe_for_unpickling__". Len works because of __length__. etc. There are reasons there are underscores there. You understand them, I understand them, Talin understands them. That doesn't mean that they are self-evident. A lesser inventor might have used a method just called "safe_for_pickling" and some unlucky programmer at Bick's might have accidentally triggered unexpected aspects of the protocol while documenting the properties of cucumbers. These are not universally understood techniques. Let's just document them in the PEP. However, it appears that mine is a minority opinion. Unfortunately, I'm at > a bit of a communication disadvantage, because if somebody wants to > believe > something is complicated, there is nothing that anybody can do to change > their mind. If you don't consider the possibility that it is way simpler > than you think, you will never be able to see it. If it wasn't at least a bit complicated then there would be no underscores. The underscores are there to prevent SOMETHING bad from happening, right? Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060813/f99f4ac2/attachment.htm From paul at prescod.net Sun Aug 13 10:17:26 2006 From: paul at prescod.net (Paul Prescod) Date: Sun, 13 Aug 2006 01:17:26 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608111649g54e82dd6kef19862f0c281254@mail.gmail.com> References: <5.1.1.6.0.20060811112118.023af398@sparrow.telecommunity.com> <20060811084623.1931.JCARLSON@uci.edu> <44DD073C.7030305@acm.org> <43aa6ff70608111649g54e82dd6kef19862f0c281254@mail.gmail.com> Message-ID: <1cb725390608130117p7f393441ld43f4f901728b316@mail.gmail.com> On 8/11/06, Collin Winter wrote: ... What Josiah is hinting at -- and what Talin describes more explicitly > -- is the problem of how exactly "chaining" annotation interpreters > will work. I don't think the question is really how to chain them. The question is how to avoid them stepping on top of each other accidentally. The case I've thought out the most completely is that of using > decorators to analyse/utilise the annotations: This is not as interesting a case as the following: annotation scheme 1 is invented by person 1 annotation scheme 2 is invented by person 2 person 3 must use them together on a single function persons 4 through 1000 write programs that hunt for annotation scheme 1 objects on functions in modules. persons 2000 through 4000 write programs that hunt for annotation scheme 2 objects. How can persons 4 through 4000 be confident when they see an annotation on an object that they are interpreting it as person 3 intended? How can they be confident that they are not accidentally processing an object (a list, a string, a file, a customer object, whatever) that was intended to be an assertion in annotation scheme 1 according to the rules of annotation scheme 2? Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060813/7929f61c/attachment.html From talin at acm.org Sun Aug 13 10:18:18 2006 From: talin at acm.org (Talin) Date: Sun, 13 Aug 2006 01:18:18 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608130042h50c7d7f9oc4068f30f2b04bbb@mail.gmail.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <1cb725390608130042h50c7d7f9oc4068f30f2b04bbb@mail.gmail.com> Message-ID: <44DEE04A.4090708@acm.org> Paul Prescod wrote: >> And the interpetation of: >> >> def cat(infile: [doc("input stream"), opt("i")] = sys.stdin, >> outfile: [doc("output stream"), opt("o")] = sys.stdout >> ): >> >> is likewise unambiguous, unless the creator of the documentation or >> option >> features has defined some other interpretation for a list than >> "recursively >> apply to contained items". > > > The meaning is "unambiguous unless..." then it ambiguous. So as per my > previous proposal I think that you and I agree that we should disallow the > stupid interpretation by encoding the obvious one in the PEP. > > In which case, you need only do something like: >> >> def cat(infile: docopt("input stream", "i") = sys.stdin, >> outfile: docopt("output stream", "o") = sys.stdout >> ): >> >> with an appropriate definition of methods for the 'docopt' type. > > > Given that there are an infinite number of tools in the universe that could > be processing "doc" and "opt" annotations, how would the user KNOW that > there is one out there with a stupid interpretation of lists? They might > annotate thousands of classes before finding out that some hot tool that > they were planning to use next year is incompatible. So let's please define > a STANDARD way of attaching multiple annotations to a parameter. Lists seem > like a no-brainer choice for that. > > Since many people seem to be unfamiliar with overloaded functions, I would >> just like to take this opportunity to remind you that the actual overload >> mechanism is irrelevant. If you gave 'doc' objects a 'printDocString()' >> method and 'opt' objects a 'setOptionName()' method, the exact same logic >> regarding extensibility applies. The 'docopt' type would simply >> implement >> both methods. >> >> This is normal, simple standard Python stuff; nothing at all fancy. > > > The context is a little bit different than standard duck typing. > > Let's say I define a function like this: > > def car(b): > "b is a list-like object" > return b[0] > > Then someone comes along and does something I never expected. They invent a > type representing a list of bits in a bitfield. They pass it to my function > and everything works trivially. But there's something important that > happened. The programmer ASSERTED by passing the RDF list to the function > 'a' that it is a list like object. My code wouldn't have tried to treat it > as a list if the user hadn't passed it as one explicitly. > > Now look at it from the point of view of function annotations. As we said > before, the annotations are inert. They are just attached. There is some > code like a type checker or documentation generator that comes along after > the fact and scoops them up to do something with them. The user did not > assert (at the language level!) that any particular annotation applies to > any particular annotation processor. The annotation processor is just > looking for stuff that it recognizes. But what if it thinks it recognizes > something but does not? > > Consider this potential case: > > BobsDocumentationGenerator.py: > > class BobsDocumentationGeneratorAnnotation: > def __init__... > def printDocument(self): > print self.doc > def sideEffect(self): > deleteHardDrive() > > def BobsDocumentationGenerator(annotation): > if hasattr(annotation, "printDocument"): > annotation.printDocument() > > SamsDocumentationGenerator.py: > > class SamsDocumentationGeneratorAnnotation: > def __init__... > def printDocument(self): > return self.doc > def sideEffect(self): > email(self.doc, "python-dev at pytho...") > > def SamsDocumentationGenerator(annotation): > if hasattr(annotation, "printDocument"): > print annotation.printDocument() > annotation.sideEffect() > > These objects, _by accident_ have the same method signature but different > side effects and return values. Nobody anywhere in the system made an > incorrect assertion. They just happened to be unlucky in the naming of > their > methods. (unbelievably unlucky but you get the drift) > > One simple way to make it unambiguous would be to do a test more like: > > if hasattr(annotation, SamsDocumentationGenerator.uniqueObject): ... > > The association of the unique object with an annotator object would be an > explicit assertion of compatibility. > > Can we agree that the PEP should describe strategies that people should use > to make their annotation recognition strategies unambiguous and > failure-proof? > > I think that merely documenting appropriately defensive techniques might be > enough to make Talin happy. Note that it isn't the processing code that > needs to be defensive (in the sense of try/catch blocks). It is the whole > recognition strategy that the processing code uses. Whatever recognition > strategy it uses must be unambiguous. It seems like it would hurt nobody to > document this and suggest some unambiguous techniques. This says pretty much what I was trying to say, only better :) I think I am going to chill out on this topic for a bit - it seems that there are folks who have a better understanding of the issue than I do, and mainly the only reason I was commenting on the PEP was because that was what was asked for. I don't really have a big stake in the whole annotation effort, there are other issues that I am really more interested in. -- Talin From paul at prescod.net Sun Aug 13 10:24:00 2006 From: paul at prescod.net (Paul Prescod) Date: Sun, 13 Aug 2006 01:24:00 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <20060812233132.197F.JCARLSON@uci.edu> References: <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813010634.0228ee30@sparrow.telecommunity.com> <20060812233132.197F.JCARLSON@uci.edu> Message-ID: <1cb725390608130124m2e3a3254v40058e23c2b6b737@mail.gmail.com> On 8/13/06, Josiah Carlson wrote: > > ... > If we were to specify anything, I would suggest we define an order of > annotation calling, which would also define a chaining order if > applicable. Maybe it is completely obvious, but one should never > underestimate what kinds of silly things users will do. > Annotations are not called. They are not like decorators. Decorators typically "wrap" a function. Annotations are just attached to it. A decorator must be a callable. An annotation could be just the number "5". Decorators build on each other, perhaps changing the function's behaviour. Annotations (should!) just accumulate and typically do not change the parameter's behaviour. The PEP does not say how you would define annotations that just accumulate but it seems common sense to me that it would be through a list syntax. I think that the PEP should just say that. Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060813/b232372c/attachment.htm From jcarlson at uci.edu Sun Aug 13 10:53:23 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sun, 13 Aug 2006 01:53:23 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608130124m2e3a3254v40058e23c2b6b737@mail.gmail.com> References: <20060812233132.197F.JCARLSON@uci.edu> <1cb725390608130124m2e3a3254v40058e23c2b6b737@mail.gmail.com> Message-ID: <20060813013709.1982.JCARLSON@uci.edu> "Paul Prescod" wrote: > On 8/13/06, Josiah Carlson wrote: > > > > ... > > If we were to specify anything, I would suggest we define an order of > > annotation calling, which would also define a chaining order if > > applicable. Maybe it is completely obvious, but one should never > > underestimate what kinds of silly things users will do. > > Annotations are not called. They are not like decorators. Right. What I meant (which perhaps wan't what I said), was that we should define the order in which functions that operate on these annotations execute, regardless of the mechanism. Say, for example, I have the following function definition: def foo(arg1:[bar(1), baz(2)]): ... However the (unspecified user defined machinery that handles the) annotation processing gets to foo(), if it knows about how to handle the 'bar' and 'baz' annotations, a properly written annotation processor will handle the 'bar' annotation before the 'baz' annotation. - Josiah From talin at acm.org Sun Aug 13 13:07:44 2006 From: talin at acm.org (Talin) Date: Sun, 13 Aug 2006 04:07:44 -0700 Subject: [Python-3000] Python/C++ question In-Reply-To: References: <44DA6C01.2040904@acm.org> Message-ID: <44DF0800.4060204@acm.org> Guido van Rossum wrote: > On 8/9/06, Talin wrote: > For the majority of Python developers it's probably the other way > around. It's been 15 years since I wrote C++, and unlike C, that > language has changed a lot since then... > > It would be a complete rewrite; I prefer doing a gradual > transmogrification of the current codebase into Py3k rather than > starting from scratch (read Joel Spolsky on why). BTW, Should this be added to PEP 3099? (Although I do think that a gradual transition is certainly possible, I am not going to push for it.) -- Talin From talin at acm.org Sun Aug 13 13:30:00 2006 From: talin at acm.org (Talin) Date: Sun, 13 Aug 2006 04:30:00 -0700 Subject: [Python-3000] Bound and unbound methods Message-ID: <44DF0D38.6070507@acm.org> One of the items in PEP 3100 is getting rid of unbound methods. I want to explore a heretical notion, which is getting rid of bound methods as well. Now, to be honest, I rather like bound methods. I like being able to capture a method call, store it in a variable, and call it later. However, I also realize that requiring every access to a class variable to instantiate a new method object is expensive, to say the least. Calling a callable would not require a bound method - the 'self' parameter would be just another argument. User-defined functions would then be no different from native built-in functions or other callables. You would still need some way to explicitly bind a method if you wanted to store it in a variable, perhaps using something like the various wrappers in module 'functional'. It would be extra typing, but for me at least its not something I do very often, and it would at least have the virtue that the intent of the code would be more visually obvious. (Also, I tend to find, in my code at least, that I more often use closures to accomplish the same thing, which are both clearer to read and more powerful.) Now, one remaining problem to be solved is whether or not to pass 'self' as an argument to the resulting callable. I suppose that could be handled by inspecting the attributes of the callable and adding the extra 'self' argument at the last minute if its not a static method. I suspect such tests would be relatively fast, much less than the time needed to instantiate and initialize a new method object. Anyway, I just wanted to throw that out there. Feel free to -1 away... :) -- Talin From g.brandl at gmx.net Sun Aug 13 14:24:59 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 13 Aug 2006 14:24:59 +0200 Subject: [Python-3000] Python/C++ question In-Reply-To: <44DF0800.4060204@acm.org> References: <44DA6C01.2040904@acm.org> <44DF0800.4060204@acm.org> Message-ID: Talin wrote: > Guido van Rossum wrote: >> On 8/9/06, Talin wrote: >> For the majority of Python developers it's probably the other way >> around. It's been 15 years since I wrote C++, and unlike C, that >> language has changed a lot since then... >> >> It would be a complete rewrite; I prefer doing a gradual >> transmogrification of the current codebase into Py3k rather than >> starting from scratch (read Joel Spolsky on why). > > BTW, Should this be added to PEP 3099? Yes, why not. Georg From pje at telecommunity.com Sun Aug 13 19:28:42 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Sun, 13 Aug 2006 13:28:42 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608130106y3cf29002q6c63dd6ac1ce04d4@mail.gmail.co m> References: <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> At 01:06 AM 8/13/2006 -0700, Paul Prescod wrote: >There is something different about annotations than everything else in >Python so far. Annotations are the first feature other than docstrings >(which are proto-annotations) in core Python where third party tools are >supposed to go trolling through your objects FINDING STUFF that they may >decide is interesting or not to them. You make it sound like we've never had documentation tools before, or web servers. Zope has been trolling through Python objects "finding stuff" since *1996*. It's not at all a coincidence that the first interface/adaptation systems for Python (AFAIK) were built for Zope. So some people in the Python community have had an entire *decade* of experience with this kind of thing. It's just a guess, but some of them might actually know a thing or two about the subject by now. ;-) >Now I'm sure that with all of your framework programming you've run into >this many times and have many techniques for making these assertions >unambiguous. All we need to do is document them so that people who are not >as knowledgable will not get themselves into trouble. Sure. Here are two nice articles that people can read to understand the basic ideas of "tell, don't ask". One by the "Pragmatic Programmers": http://www.pragmaticprogrammer.com/articles/jan_03_enbug.pdf And another by Allen Holub on the evils of getters and setters, that touches on the same principles: http://www.javaworld.com/javaworld/jw-09-2003/jw-0905-toolbox.html >It isn't sufficient to say: "Only smart people will use this stuff so we >need not worry" which is what the original PEP said. Even if it is true, I >don't understand why we would bother taking the risk when the alternative >is so low-cost. There are so many other pitfalls to writing extensible and interoperable code in Python, why focus so much effort on such an incredibly minor one? The truth is that hardly anybody cares about writing extensible or interoperable code except framework developers -- and they've already *got* solutions. Twisted or Zope developers would see this as a trivial use case for adaptation, and PEAK developers would use either adaptation or generic functions, and keep on moving with nary a speedbump. Nonetheless, I don't object to documenting best practices; I just don't want to mandate a *particular* solution -- with one exception. If Py3K is going to include overloaded functions, then that should be considered the One Obvious Way to work with annotations, since it's an "included battery" (and none of the existing interface/adaptation/overloading toolkits are likely to work as-is in Py3K without some porting effort). But if Py3K doesn't include overloading or adaptation, then the One Obvious Way will be "whatever a knowledgeable framework programmer wants to do." >Pickling works because of the underscores and magic like " >__safe_for_unpickling__". Len works because of __length__. etc. There are >reasons there are underscores there. You understand them, I understand >them, Talin understands them. That doesn't mean that they are >self-evident. A lesser inventor might have used a method just called >"safe_for_pickling" and some unlucky programmer at Bick's might have >accidentally triggered unexpected aspects of the protocol while >documenting the properties of cucumbers. Note that you're pointing out a problem that already exists today in Python, and has for some time. It's why the Zope folks use interfaces and adaptation, and why I use overloaded functions. The problem has nothing to do with annotations as such, so if you want to solve that problem, you should be pushing for overloaded functions in the stdlib, and using annotations as an example of why they're good to have. >Can we agree that the PEP should describe strategies that people should >use to make their annotation recognition strategies unambiguous and >failure-proof? Absolutely - and I recommended that we recommend "tell, don't ask" processing using one of the following techniques: 1. duck typing 2. adaptation 3. overloaded functions 4. type registries You seem to be arguing that duck typing is inadequate because it is name-based and names can conflict. I agree, which is why I believe #2-4 are better: they don't rely on mere name matching. However, duck typing is still *adequate* as long as names are sufficiently descriptive or at least lengthy enough to prevent collision. Including a package-specific namespace prefix like "foo_printDocumentation" is sufficient best practice to avoid duck typing name collisions in virtually all cases. I'm just baffled why all this focus on the issue on such a minor thing, when Python has far more pitfalls to interoperability than this. But I guess if you see this as the first time that objects might be implicitly used by something, I suppose it makes sense. But it's really not the first time, and these are well-understood problems among developers of major Python frameworks, especially Zope. >I think that merely documenting appropriately defensive techniques might >be enough to make Talin happy. Note that it isn't the processing code that >needs to be defensive (in the sense of try/catch blocks). It is the whole >recognition strategy that the processing code uses. Whatever recognition >strategy it uses must be unambiguous. It seems like it would hurt nobody >to document this and suggest some unambiguous techniques. I already recommended that we do this, and have repeated my recommendation above for your convenience. From steven.bethard at gmail.com Sun Aug 13 19:29:20 2006 From: steven.bethard at gmail.com (Steven Bethard) Date: Sun, 13 Aug 2006 11:29:20 -0600 Subject: [Python-3000] Bound and unbound methods In-Reply-To: <44DF0D38.6070507@acm.org> References: <44DF0D38.6070507@acm.org> Message-ID: On 8/13/06, Talin wrote: > One of the items in PEP 3100 is getting rid of unbound methods. I want > to explore a heretical notion, which is getting rid of bound methods as > well. I believe you're suggesting that the code that I just wrote moments ago would stop working:: get_features = self._get_document_features return [get_features(i, document_graph, comparable_graphs) for i, document_graph in enumerate(document_graphs)] The line ``get_features = ...`` expects the function stored to be bound to ``self``. I write code like this *all the time*, particularly when I have a long method name that needs to be used in a complex expression and I want to keep my lines within the suggested 79 characters. If I understand the proposal right and my code above would be invalidated, I'm a strong -1 to this. It would break an enormous amount of my code. STeVe -- I'm not *in*-sane. Indeed, I am so far *out* of sane that you appear a tiny blip on the distant coast of sanity. --- Bucky Katt, Get Fuzzy From jcarlson at uci.edu Sun Aug 13 19:58:33 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Sun, 13 Aug 2006 10:58:33 -0700 Subject: [Python-3000] Bound and unbound methods In-Reply-To: <44DF0D38.6070507@acm.org> References: <44DF0D38.6070507@acm.org> Message-ID: <20060813102036.1985.JCARLSON@uci.edu> Talin wrote: > > One of the items in PEP 3100 is getting rid of unbound methods. I want > to explore a heretical notion, which is getting rid of bound methods as > well. > > Now, to be honest, I rather like bound methods. I like being able to > capture a method call, store it in a variable, and call it later. > > However, I also realize that requiring every access to a class variable > to instantiate a new method object is expensive, to say the least. Well, it's up-front vs. at-access. For instances whose methods are generally used rarely, the up-front cost of instantiating every method is high in comparison (unless there are a relatively large number of method accesses), and technically infinite if applied to all objects. Why? I have a class foo, I instantiate foo, now all of foo's methods get instantiated. Ahh, but foo's methods are also instances of function. It doesn't really have any new methods on foo's methods, but they do have attributes that are instances, so we will need to instantiate all of the methods' attributes' methods, and recursively, to infinity. The non-creation of instantiated methods for objects is a lazy-evaluation technique to prevent infinite recursion, in general. On the other hand, it may make sense to offer a metaclass and/or decorator that signals that a single method instance should be created for particular methods up-front, rather than at-access to those methods. But what kind of difference could we expect? 42%/28% improvement for class methods/object methods in 2.4 respectively, and 45%/26% improvement in 2.5 beta . This does not include actually calling the methods. > Now, one remaining problem to be solved is whether or not to pass 'self' > as an argument to the resulting callable. I suppose that could be > handled by inspecting the attributes of the callable and adding the > extra 'self' argument at the last minute if its not a static method. I > suspect such tests would be relatively fast, much less than the time > needed to instantiate and initialize a new method object. I think that a change that required calls of the form obj.instancemethod(obj, ...) are non-starters. I'm -1 for instantiating all methods (for the infinite cost reasons), and -1 for int, long, list, tuple, dict, float (method access is generally limited for these objects). I'm +0 for offering a suitable metaclass and/or decorator, but believe it would be better suited for the Python cookbook, as performance improvements when function calls are taken into consideration is significantly less. - Josiah [1] Timings for accessing instance methods Python 2.4.3 (#69, Mar 29 2006, 17:35:34) [MSC v.1310 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import time >>> >>> def test(n): ... _time = time ... ... class foo: ... def bar(self): ... pass ... xr = xrange(n) ... x = foo() ... t = time.time() ... for i in xr: ... x.bar ... print 'class method', time.time()-t ... ... x.bar = x.bar ... t = time.time() ... for i in xr: ... x.bar ... print 'instantiated class method', time.time()-t ... ... class foo(object): ... def bar(self): ... pass ... ... x = foo() ... t = time.time() ... for i in xr: ... x.bar ... print 'object method', time.time()-t ... ... x.bar = x.bar ... t = time.time() ... for i in xr: ... x.bar ... print 'instantiated object method', time.time()-t ... ... class foo(object): ... __slots__ = 'bar' ... def __init__(self): ... self.bar = self._bar ... def _bar(self): ... pass ... ... x = foo() ... t = time.time() ... for i in xr: ... x.bar ... print 'instantiated object __slot__ method', time.time()-t ... >>> test(5000000) class method 1.96799993515 instantiated class method 1.14100003242 object method 1.71900010109 instantiated object method 1.23399996758 instantiated object __slot__ method 1.26600003242 >>> Python 2.5b2 (r25b2:50512, Jul 11 2006, 10:16:14) [MSC v.1310 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import time >>> >>> def test(n): ... _time = time ... ... class foo: ... def bar(self): ... pass ... xr = xrange(n) ... x = foo() ... t = time.time() ... for i in xr: ... x.bar ... print 'class method', time.time()-t ... ... x.bar = x.bar ... t = time.time() ... for i in xr: ... x.bar ... print 'instantiated class method', time.time()-t ... ... class foo(object): ... def bar(self): ... pass ... ... x = foo() ... t = time.time() ... for i in xr: ... x.bar ... print 'object method', time.time()-t ... ... x.bar = x.bar ... t = time.time() ... for i in xr: ... x.bar ... print 'instantiated object method', time.time()-t ... ... class foo(object): ... __slots__ = 'bar' ... def __init__(self): ... self.bar = self._bar ... def _bar(self): ... pass ... ... x = foo() ... t = time.time() ... for i in xr: ... x.bar ... print 'instantiated object __slot__ method', time.time()-t ... >>> test(5000000) class method 1.98500013351 instantiated class method 1.09299993515 object method 1.67199993134 instantiated object method 1.23500013351 instantiated object __slot__ method 1.23399996758 >>> From paul at prescod.net Sun Aug 13 19:57:20 2006 From: paul at prescod.net (Paul Prescod) Date: Sun, 13 Aug 2006 10:57:20 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> Message-ID: <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> If we get past the meta-discussion, I don't really see any disagreement left. I'll grit my teeth and avoid commenting on the meta-discussion. ;) My proposed text for the PEP is as follows: "In order for processors of function annotations to work interoperably, they must use a common interpretation of objects used as annotations on a particular function. For example, one might interpret string annotations as docstrings. Another might interpet them as path segments for a web framework. For this reason, function annotation processors SHOULD avoid assigning processor-specific meanings to types defined outside of the processor's framework. For example, a Django processor could process annotations of a type defined in a Zope package, but Zope's creators should be considered the authorities on the type's meaning for the same reasons that they would be considered authorities on the semantics of classes or methods in their packages." "This implies that the interpretation of built-in types would be controlled by Python's developers and documented in Python's documentation. This is just a best practice. Nothing in the language can or should enforce this practice and there may be a few domains where there is a strong argument for violating it ( e.g. an education environment where saving keystrokes may be more important than easing interopability)." "In Python 3000, semantics will be attached to the following types: objects of type string (or subtype of string) are to be used for documentation (though they are not necessarily the exclusive source of documentation about the type). Objects of type list (or subtype of list) are to be used for attaching multiple independent annotations." "Developers who define new metadata frameworks SHOULD choose explicit and unambiguous mechanisms for associating objects with their frameworks. Furthermore, they SHOULD consider that some users may wish to extend their frameworks and should support that. For example, they could use Python 3000 overloaded functions, some form of registry, some kind of interface or some unambiguously recognizable method signature protocol (e.g. _pytypelib_type_check())." Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060813/9edf62a9/attachment.htm From talin at acm.org Sun Aug 13 22:08:10 2006 From: talin at acm.org (Talin) Date: Sun, 13 Aug 2006 13:08:10 -0700 Subject: [Python-3000] Bound and unbound methods In-Reply-To: <20060813102036.1985.JCARLSON@uci.edu> References: <44DF0D38.6070507@acm.org> <20060813102036.1985.JCARLSON@uci.edu> Message-ID: <44DF86AA.7050207@acm.org> Josiah Carlson wrote: > Talin wrote: >> One of the items in PEP 3100 is getting rid of unbound methods. I want >> to explore a heretical notion, which is getting rid of bound methods as >> well. >> >> Now, to be honest, I rather like bound methods. I like being able to >> capture a method call, store it in a variable, and call it later. >> >> However, I also realize that requiring every access to a class variable >> to instantiate a new method object is expensive, to say the least. > > Well, it's up-front vs. at-access. For instances whose methods are > generally used rarely, the up-front cost of instantiating every method > is high in comparison (unless there are a relatively large number of > method accesses), and technically infinite if applied to all objects. > Why? > > I have a class foo, I instantiate foo, now all of foo's methods get > instantiated. Ahh, but foo's methods are also instances of function. It > doesn't really have any new methods on foo's methods, but they do have > attributes that are instances, so we will need to instantiate all of the > methods' attributes' methods, and recursively, to infinity. The > non-creation of instantiated methods for objects is a lazy-evaluation > technique to prevent infinite recursion, in general. > > On the other hand, it may make sense to offer a metaclass and/or > decorator that signals that a single method instance should be created > for particular methods up-front, rather than at-access to those methods. > But what kind of difference could we expect? 42%/28% improvement for > class methods/object methods in 2.4 respectively, and 45%/26% > improvement in 2.5 beta . This does not include actually calling the > methods. No, I wasn't proposing that methods be bound up front...read on. >> Now, one remaining problem to be solved is whether or not to pass 'self' >> as an argument to the resulting callable. I suppose that could be >> handled by inspecting the attributes of the callable and adding the >> extra 'self' argument at the last minute if its not a static method. I >> suspect such tests would be relatively fast, much less than the time >> needed to instantiate and initialize a new method object. > > I think that a change that required calls of the form > obj.instancemethod(obj, ...) are non-starters. Yes, that's a non-starter, but that's not what I was proposing either. I see that I left an important piece out of my proposal, which I'll need to explain. Right now, when you say: 'obj.instancemethod()', there are in fact two distinct operations going on. The first is the lookup of the attribute 'instancemethod', and the second is the invoking of the resulting callable. In order to get rid of the creation of method objects, the compiler would have to recognize these two operations and combine them into a single "call method" opcode - one which looks up the attribute, but leaves the original object pointer on the stack, and then invokes the resulting callable, along with the object pointer. So essentially the 'bind' operation is moved into the method invocation code - which eliminates the need to create a holding object to remember the binding information. Hmmmm....I wonder if it could be me made to work in a backwards-compatible way. In other words, suppose the existing logic of creating a method object were left in place, however the 'obj.instancemethod()' pattern would bypass all of that. In other words, the compiler would note the combination of the attribute access and the call, and combine them into an opcode that skips the whole method creation step. (Maybe it already does this and I'm just being stupid.) > I'm -1 for instantiating all methods (for the infinite cost reasons), > and -1 for int, long, list, tuple, dict, float (method access is > generally limited for these objects). I'm +0 for offering a suitable > metaclass and/or decorator, but believe it would be better suited for > the Python cookbook, as performance improvements when function calls are > taken into consideration is significantly less. Thanks for the timing information by the way. From thomas at python.org Sun Aug 13 23:22:32 2006 From: thomas at python.org (Thomas Wouters) Date: Sun, 13 Aug 2006 23:22:32 +0200 Subject: [Python-3000] Bound and unbound methods In-Reply-To: <44DF86AA.7050207@acm.org> References: <44DF0D38.6070507@acm.org> <20060813102036.1985.JCARLSON@uci.edu> <44DF86AA.7050207@acm.org> Message-ID: <9e804ac0608131422w3bd95d57gb1c195e16dc1f9bd@mail.gmail.com> On 8/13/06, Talin wrote: > Hmmmm....I wonder if it could be me made to work in a > backwards-compatible way. In other words, suppose the existing logic of > creating a method object were left in place, however the > 'obj.instancemethod()' pattern would bypass all of that. In other words, > the compiler would note the combination of the attribute access and the > call, and combine them into an opcode that skips the whole method > creation step. (Maybe it already does this and I'm just being stupid.) Been there, done that, bought the T-shirt (well, it was just a PyCon-1 T-shirt): http://sourceforge.net/tracker/index.php?func=detail&aid=709744&group_id=5470&atid=305470 Back then, the end result of that particular change was very tiny, and it wasn't even taking new-style classes into account (which would have made it more complex.) It may be worth re-trying anyway, especially for python-3000: no classic classes to worry about. And quite a lot has changed in the compiler and opcode dispatcher in the mean time. I am completely -1 on getting rid of bound methods, though. -- Thomas Wouters Hi! I'm a .signature virus! copy me into your .signature file to help me spread! -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060813/5d70b8e9/attachment.html From ark-mlist at att.net Mon Aug 14 00:47:31 2006 From: ark-mlist at att.net (Andrew Koenig) Date: Sun, 13 Aug 2006 18:47:31 -0400 Subject: [Python-3000] Bound and unbound methods In-Reply-To: <44DF0D38.6070507@acm.org> Message-ID: <000901c6bf2a$76781270$6402a8c0@arkdesktop> > However, I also realize that requiring every access to a class variable > to instantiate a new method object is expensive, to say the least. Why does every access to a class variable have to instantiate a new method object? From tomerfiliba at gmail.com Mon Aug 14 01:03:06 2006 From: tomerfiliba at gmail.com (tomer filiba) Date: Mon, 14 Aug 2006 01:03:06 +0200 Subject: [Python-3000] Bound and unbound methods Message-ID: <1d85506f0608131603u39be2727ie0b2f15db3dee69f@mail.gmail.com> [Josiah] > I'm -1 for instantiating all methods (for the infinite cost reasons), > and -1 for int, long, list, tuple, dict, float (method access is > generally limited for these objects). I'm +0 for offering a suitable > metaclass and/or decorator, but believe it would be better suited for > the Python cookbook, as performance improvements when function calls are > taken into consideration is significantly less. http://sebulba.wikispaces.com/receip+prebound i'm sorry, i just love descriptors too much. it kept me out of bed, until i wrote it down :) -tomer From ncoghlan at gmail.com Mon Aug 14 03:40:43 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 14 Aug 2006 11:40:43 +1000 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> Message-ID: <44DFD49B.4030308@gmail.com> Paul Prescod wrote: > If we get past the meta-discussion, I don't really see any disagreement > left. I'll grit my teeth and avoid commenting on the meta-discussion. ;) Ah, so I'm not the only one doing that then };> > My proposed text for the PEP is as follows: Generally +1, except for this bit: > "In Python 3000, semantics will be attached to the following types: > objects of type string (or subtype of string) are to be used for > documentation (though they are not necessarily the exclusive source of > documentation about the type). Objects of type list (or subtype of list) > are to be used for attaching multiple independent annotations." Interpretations of string & list subtypes should be up to whoever creates those subtypes - it's only the builtins themselves that python-dev should be the authority for. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From ncoghlan at gmail.com Mon Aug 14 04:27:57 2006 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 14 Aug 2006 12:27:57 +1000 Subject: [Python-3000] Bound and unbound methods In-Reply-To: <44DF0D38.6070507@acm.org> References: <44DF0D38.6070507@acm.org> Message-ID: <44DFDFAD.3010408@gmail.com> Talin wrote: > Anyway, I just wanted to throw that out there. Feel free to -1 away... :) Based on the later discussion, I see two interesting possibilities: 1. A special CALL_METHOD opcode that the compiler emits when it spots the ".NAME(ARGS)" pattern. This could simply be an optimisation performed by the bytecode emitter when processing an AST Call node with an Attribute node as the "func" subnode (it would need to poke around inside the Attribute node, rather than generating the Attribute node's code normally, though). For functions, this opcode could bypass __get__ and invoke __call__ directly with the right arguments. Put the actual optimisation into PyObject_CallMethod and call that from the new opcode, and more than just the eval loop would benefit. This could also be done by the addition of a MethodCall AST node, and an AST->AST optimizing pass that took the Call+Attribute node and merged them into a single MethodCall node (The concrete parser can't look far enough ahead to figure out that a given attribute access is part of a method call). Option 1 is focused on the speedup Talin mentioned. Aside from the downside of additional complexity in the code generation phase, I don't see any real downside - __get__ will only be bypassed when the interpreter *knows* what the descriptor would do. 2. Rewrite the __get__ methods on functions, classmethod and staticmethod to cache the resulting method object in the class dictionary or instance dictionary. This would entail making method objects descriptors that returned a bound copy of themselves when retrieved through an instance. That way, for methods that are never called, the method objects are never created, but for methods that are used, the method object is created only once. Something would need to be done to make this work for object's without an instance dictionary - those could either continue to not cache their instance methods, they could have a lazily initialized __dict__ pointer that is instantiated the first time it is needed instead of no dict at all (yay, attributes on object() instances!), or else there could be an id() keyed cache internal to the interpreter. I personally would favour the option of making __dict__ available by default (i.e. put that behaviour in object), with no caching occurring if the object had no __dict__ attribute at all. Tuples and the numeric types could continue not to support attributes (as allocating space for an extra pointer would be a big size increase for them in their general usage pattern, and they don't generally have methods that are called from Python), while the other builtin types would acquire a usable __dict__ attribute (which may not be instantiated until the first time it is needed, although if instance methods get cached, it would be needed most of the time, so the extra complexity of lazy initialization may not be worth it). The interesting benefit of option 2 is that "assert list.append is list.append" would now succeed, as would "s = []; assert s.append is s.append". "assert [].index is [].index" would still fail though, as different instances would get their own bound methods. The downside of option 2 is that it is slightly more likely to break stuff due to the changes in semantics, and that it is a case of a genuine space-speed tradeoff - this approach *will* use more memory than the current approach, because bound method objects are always allocated permanently instead of being ephemeral things. OTOH, if you did both option 1 and option 2, the caching would occur only if you retrieved a method without calling it immediately, and be bypassed most of the time. Cheers, Nick. -- Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia --------------------------------------------------------------- http://www.boredomandlaziness.org From greg.ewing at canterbury.ac.nz Mon Aug 14 04:31:46 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 14 Aug 2006 14:31:46 +1200 Subject: [Python-3000] Bound and unbound methods In-Reply-To: <44DF86AA.7050207@acm.org> References: <44DF0D38.6070507@acm.org> <20060813102036.1985.JCARLSON@uci.edu> <44DF86AA.7050207@acm.org> Message-ID: <44DFE092.8030604@canterbury.ac.nz> Talin wrote: > the compiler would note the combination of the attribute access and the > call, and combine them into an opcode that skips the whole method > creation step. Something like that could probably be made to work. You'd want to be careful to do the optimisation only when the attribute in question is an ordinary attribute, not a property or other descriptor. I'm also -1 on eliminating bound methods entirely. I worked through that idea in considerable depth during my discussions with the author of Prothon, which was also to have been without any notion of bound methods. The consequences are further-reaching than you might think at first. The bottom line is that without bound methods, Python wouldn't really be Python any more. -- Greg From greg.ewing at canterbury.ac.nz Mon Aug 14 05:22:10 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 14 Aug 2006 15:22:10 +1200 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> References: <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <44DFEC62.6000904@canterbury.ac.nz> Phillip J. Eby wrote: > Since many people seem to be unfamiliar with overloaded functions, I would > just like to take this opportunity to remind you that the actual overload > mechanism is irrelevant. I don't think it's the concept of overloadable functions that people are having trouble with here, but that you haven't clearly explained *how* they would be applied to solving this particular problem. You seem to think the answer to that is so obvious that it doesn't need mentioning, but we're not all up to the same mental speed as you on this. Perhaps you could provide a complete worked-out example for people to look at? -- Greg From greg.ewing at canterbury.ac.nz Mon Aug 14 05:22:22 2006 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Mon, 14 Aug 2006 15:22:22 +1200 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> Message-ID: <44DFEC6E.8020603@canterbury.ac.nz> Phillip J. Eby wrote: > Not at all. A and B need only use overloadable functions, and the problem > is trivially resolved by adding overloads. The author of C can add an > overload to "A" that will handle objects with 'next' attributes, or add one > to "B" that handles tuples, or both. Phillip, you still haven't explained what to do if the code processing the annotations is in a separate program altogether, to which the user has no access in order to overload methods or perform other such modifications. -- Greg From pje at telecommunity.com Mon Aug 14 06:21:27 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Mon, 14 Aug 2006 00:21:27 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DFEC6E.8020603@canterbury.ac.nz> References: <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060814002014.02dbe9d0@sparrow.telecommunity.com> At 03:22 PM 8/14/2006 +1200, Greg Ewing wrote: >Phillip J. Eby wrote: > >>Not at all. A and B need only use overloadable functions, and the >>problem is trivially resolved by adding overloads. The author of C can >>add an overload to "A" that will handle objects with 'next' attributes, >>or add one to "B" that handles tuples, or both. > >Phillip, you still haven't explained what to do if >the code processing the annotations is in a separate >program altogether, to which the user has no access >in order to overload methods or perform other such >modifications. It can't be a "separate program altogether", since to get at the annotations, the program must import the module that contains them. Thus, the registration need only occur in some module imported by the module that uses the annotations. From pje at telecommunity.com Mon Aug 14 06:52:41 2006 From: pje at telecommunity.com (Phillip J. Eby) Date: Mon, 14 Aug 2006 00:52:41 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <44DFEC62.6000904@canterbury.ac.nz> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060811224658.0226da70@sparrow.telecommunity.com> <44DD5DF0.40405@acm.org> <5.1.1.6.0.20060812113118.0293d2d8@sparrow.telecommunity.com> <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> Message-ID: <5.1.1.6.0.20060814002138.02909ad0@sparrow.telecommunity.com> At 03:22 PM 8/14/2006 +1200, Greg Ewing wrote: >Phillip J. Eby wrote: >>Since many people seem to be unfamiliar with overloaded functions, I >>would just like to take this opportunity to remind you that the actual >>overload mechanism is irrelevant. > >I don't think it's the concept of overloadable functions >that people are having trouble with here, but that you >haven't clearly explained *how* they would be applied >to solving this particular problem. In the same way that plain old standard Python duck typing would be used. The only differences between overloaded functions and duck typing are that: 1. Overloaded functions can't accidentally collide, the way names chosen for duck typing can. 2. Third parties can declare overloaded methods without monkeypatching, but duck typing requires that you be the author of the object in question or that you be able to monkeypatch the type to add methods. 3. You can usually define some default behavior for an unrecognized type - as though you could add methods to the 'object' type. 4. Overloaded functions can dispatch on more than one type at the same time, or do other things, depending on their implementation. Aside from these extra features of overloaded functions, there isn't much difference between overloading and duck typing; it's merely the difference between: someOb.quack() and: quack(someOb) So, if you can imagine handling annotations using duck typing and hasattr(), then you can imagine doing it with overloaded functions. If you can't imagine using duck typing or hasattr() to process some annotations and ignore the ones you don't understand, then I don't really know how I would explain it. >You seem to think the answer to that is so obvious >that it doesn't need mentioning, but we're not all >up to the same mental speed as you on this. > >Perhaps you could provide a complete worked-out >example for people to look at? I did - the PEAK documentation links I gave previously included a doctest that walked through the definition of a 'Message()' attribute annotation that prints a message at class definition (or other metadata definition) time. The other two links showed examples of using attribute annotations for declaring security permissions and command-line options. Some people said they didn't "get" anything from those links, but I'm at somewhat of a loss to understand why. The examples there are very short and simple; in fact the complete Message implementation, including imports and overload declarations is only *6 lines long*. So, my only guess is that the people who looked at that skimmed right past it, looking for something more complicated! They probably then proceeded to the rest of the documentation and got bogged down in other aspects of the framework that aren't related to this discussion. Therefore, if anybody would like to provide an example of how *they* would write code for some function attribute scenario, I'll happily modify it to demonstrate tell-don't-ask with either duck typing, adaptation, overloading, or whatever you like. But from a communication POV, it doesn't make sense to me to try and write an example, since it's going to come from *my* worldview (in which this is a trivial problem) and not the worldview of the people who don't understand it. It seems to me that the right way to proceed is to have somebody provide an example in *their* worldview, so that when I alter it they will have a reference point for what I'm talking about. (Notice that this seemed to work well for Josiah and Paul when I reworked Paul's example.) From theller at python.net Mon Aug 14 16:55:24 2006 From: theller at python.net (Thomas Heller) Date: Mon, 14 Aug 2006 16:55:24 +0200 Subject: [Python-3000] threading, part 2 --- + a bit of ctypes FFI worry In-Reply-To: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> References: <1f7befae0608120329wc646164w25ca4875da4cc5c0@mail.gmail.com> Message-ID: Tim Peters schrieb: > [Josiah Carlson] >> ... >> Python 2.3.5 (#62, Feb 8 2005, 16:23:02) [MSC v.1200 32 bit (Intel)] on win32 >> Type "help", "copyright", "credits" or "license" for more information. >> >>> import ctypes >> >>> import threading >> >>> import time >> >>> def foo(): >> ... try: >> ... while 1: >> ... time.sleep(.01) >> ... finally: >> ... print "I quit!" >> ... >> >>> x = threading.Thread(target=foo) >> >>> x.start() >> >>> for i,j in threading._active.items(): >> ... if j is x: >> ... break >> ... >> >>> ctypes.pythonapi.PyThreadState_SetAsyncExc(i, ctypes.py_object(Exception)) > > As I discovered to my chagrin when I added a similar test to the test > suite a few days ago, that's got a subtle error on most 64-bit boxes. > When the ctypes docs talk about passing and returning integers, they > never explain what "integers" /means/, but it seems the docs > implicitly have a 32-bit-only view of the world here. In reality > "integer" seems to mean the native C `int` type. 'ctypes.c_int' and 'ctypes.c_long' correspond to the C 'int' and 'long' types. If you think that the docs could be clearer, please suggest changes. > But a Python thread > id is a native C `long` (== a Python short integer), and the code > above fails in a baffling way on most 64-bit boxes: the call returns > 0 instead; i.e. the thread id isn't found, and no exception gets set. > So I believe that needs to be: > > ctypes.pythonapi.PyThreadState_SetAsyncExc( > ctypes.c_long(i), > ctypes.py_object(Exception)) > > to make it portable. Right. A little bit more safety migt be gained by setting the argtypes attribute of the PyThreadState_SetAsyncExc function in this way: ctypes.pythonapi.PyThreadState_SetAsyncEx.argtypes = ctypes.c_long, ctypes.py_object This way the wrapping of arguments is automatic. > It's unclear to me how to write portable ctypes code in the presence > of a gazillion integer typedefs and #defines, such as for Py_ssize_t. > That doesn't map to a fixed C integral type cross-platform, so what > can you do? You're not required to answer that ;-) This must probably be exported from the C code. Currently ctypes has the basic (integer) types c_byte, c_short, c_int, c_long, c_longlong, plus their unsigned variants. On 32-bit platforms, c_int is an alias to c_long. Sized ints are defined: c_int8, c_int16, c_int32, c_int64, (plus the unsigned variants again), also as aliases to the 10 basic integer types. I *should* be possible by some checks to find out about the size of Py_ssize_t at runtime (unless it is an configurable option)... > Thread ids may bite us someday too. Python casts the platform's > notion of a thread id to C `long`, but there's no guarantee this won't > lose information (or is even legal) on all platforms. We'd probably > be safer casting to, e.g., Py_uintptr_t (some thread implementions > return an index into a kernel or library thread-info table, but at > least some in my lifetime returned a pointer to a thread-info struct, > and that's definitely fatter than C `long` on some boxes). > >> 1 >> >>> I quit! >> Exception in thread Thread-2:Traceback (most recent call last): >> File "C:\python23\lib\threading.py", line 442, in __bootstrap >> self.run() >> File "C:\python23\lib\threading.py", line 422, in run >> self.__target(*self.__args, **self.__kwargs) >> File "", line 4, in foo >> Exception > > It's really cool that you can do this from ctypes, eh? That's exactly > the right level of abstraction for this attractive nuisance too ;-) ;-) Thomas From guido at python.org Mon Aug 14 17:31:31 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 14 Aug 2006 08:31:31 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> Message-ID: After thinking about it some more, IMO for most purposes ctypes is really quite sub-optimal. I think it would make more sense to work on Parrot support for Python. Sure, in the short term ctypes is more practical than Parrot -- in its most recent incarnation, the latter doesn't even list Python as a supported language -- a regression from last year when Python support was among the best. But in the long term, Parrot (like .NET or Jython do in other contexts) offers cross-language interoperability, and perhaps even (like .NET and Jython) automatic generation of wrappers. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From exarkun at divmod.com Mon Aug 14 17:33:48 2006 From: exarkun at divmod.com (Jean-Paul Calderone) Date: Mon, 14 Aug 2006 11:33:48 -0400 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: Message-ID: <20060814153348.1717.1313126828.divmod.quotient.22734@ohm> On Mon, 14 Aug 2006 08:31:31 -0700, Guido van Rossum wrote: >After thinking about it some more, IMO for most purposes ctypes is >really quite sub-optimal. I think it would make more sense to work on >Parrot support for Python. Sure, in the short term ctypes is more >practical than Parrot -- in its most recent incarnation, the latter >doesn't even list Python as a supported language -- a regression from >last year when Python support was among the best. But in the long >term, Parrot (like .NET or Jython do in other contexts) offers >cross-language interoperability, and perhaps even (like .NET and >Jython) automatic generation of wrappers. > This is a joke, right? Jean-Paul From guido at python.org Mon Aug 14 18:09:49 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 14 Aug 2006 09:09:49 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <20060814153348.1717.1313126828.divmod.quotient.22734@ohm> References: <20060814153348.1717.1313126828.divmod.quotient.22734@ohm> Message-ID: No. Why would it be a joke? Because it's a Perl thing? Because it doesn't acknowledge Python's obvious supremacy in the universe of languages? Because it admits that other projects sometimes have good ideas? Because it's a good idea to have to write separate wrappers around every useful library for each dynamic languague separately? Because Parrot isn't real? IMO it's pretty real already -- the 0.4.6 release supports Ruby, Javascript, Tcl, and a bunch more (possibly even Perl 6 :-). I wouldn't be surprised if Parrot reached maturity around the same time as Py3k. --Guido On 8/14/06, Jean-Paul Calderone wrote: > On Mon, 14 Aug 2006 08:31:31 -0700, Guido van Rossum wrote: > >After thinking about it some more, IMO for most purposes ctypes is > >really quite sub-optimal. I think it would make more sense to work on > >Parrot support for Python. Sure, in the short term ctypes is more > >practical than Parrot -- in its most recent incarnation, the latter > >doesn't even list Python as a supported language -- a regression from > >last year when Python support was among the best. But in the long > >term, Parrot (like .NET or Jython do in other contexts) offers > >cross-language interoperability, and perhaps even (like .NET and > >Jython) automatic generation of wrappers. > > > > This is a joke, right? > > Jean-Paul > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From exarkun at divmod.com Mon Aug 14 19:20:00 2006 From: exarkun at divmod.com (Jean-Paul Calderone) Date: Mon, 14 Aug 2006 13:20:00 -0400 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: Message-ID: <20060814172000.1717.863905740.divmod.quotient.22821@ohm> On Mon, 14 Aug 2006 09:09:49 -0700, Guido van Rossum wrote: >On 8/14/06, Jean-Paul Calderone wrote: >>On Mon, 14 Aug 2006 08:31:31 -0700, Guido van Rossum >>wrote: >> >After thinking about it some more, IMO for most purposes ctypes is >> >really quite sub-optimal. I think it would make more sense to work on >> >Parrot support for Python. Sure, in the short term ctypes is more >> >practical than Parrot -- in its most recent incarnation, the latter >> >doesn't even list Python as a supported language -- a regression from >> >last year when Python support was among the best. But in the long >> >term, Parrot (like .NET or Jython do in other contexts) offers >> >cross-language interoperability, and perhaps even (like .NET and >> >Jython) automatic generation of wrappers. >> > >> >>This is a joke, right? >> >No. Why would it be a joke? Because it's a Perl thing? Because it >doesn't acknowledge Python's obvious supremacy in the universe of >languages? Because it admits that other projects sometimes have good >ideas? Heh. Strawmen, all. I assure you, none of these objections ever entered my mind. >Because it's a good idea to have to write separate wrappers >around every useful library for each dynamic languague separately? If a project has done this successfully, I don't think I've seen it. Can you point out some examples where this has been accomplished in a useful form? The nearest thing I can think of is SWIG, which is basically a failure. This is not to say that it is not a noble goal, but I think it remains to be shown that Parrot is actually a solution here. >Because Parrot isn't real? IMO it's pretty real already -- the 0.4.6 >release supports Ruby, Javascript, Tcl, and a bunch more (possibly >even Perl 6 :-). I wouldn't be surprised if Parrot reached maturity >around the same time as Py3k. > Parrot has been around for quite a while now without accomplishing anything much of practical value. Does anyone _use_ it for Ruby, JavaScript, or Tcl? (I know no one uses it for Perl 6 ;) For five years of development by a pretty large community, that's not showing a lot. The reason I suspected a joke is that you seem to want to discard a fairly good existing widely used solution in favor of one that's just vapor right now. Granted Py3k is a ways off, but it's not /that/ far off. We're talking about a year or two here. Is Parrot going to be as solid in a year as ctypes already is? I doubt it. If you /really/ want to look outside of the Python community for solutions here, the lisp community has thought about this for a long time. Instead of looking at Parrot, you should look at the ffi provided by almost any lisp runtime. Jean-Paul From guido at python.org Mon Aug 14 19:38:25 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 14 Aug 2006 10:38:25 -0700 Subject: [Python-3000] PEP3102 Keyword-Only Arguments In-Reply-To: References: Message-ID: Not remembering the PEP in detail, I agree with Jim's resolution of all these. I guess the right rule is that all positional arguments come first (first the regular ones, then * or *args). Then come the keyword arguments, again, first the regular ones (name=value), then **kwds. I believe the PEP doesn't address the opposite use case: positional arguments that should *not* be specified as keyword arguments. For example, I might want to write def foo(a, b): ... but I don't want callers to be able to call it as foo(b=1, a=2) or even foo(a=2, b=1). A realistic example is the write() method of file objects. We really don't want people starting to say f.write(s="abc") because even if that works for the current file type you're using, it won't work if an instance of some other class implementing write() is substituted -- write() is always documented as an API taking a positional argument, so different "compatible" classes are likely to have different argument names. Currently this is enforced because the default file type is implemented in C and it doesn't have keyword arguments; but in Py3k it may well be implemented in Python and then we currently have no decent way to say "this should really be a positional argument". (There's an analogy to forcing keyword arguments using **, using *args for all arguments and parsing that explicitly -- but that's tedious for a fairly common use case.) Perhaps we can use ** without following identifier to signal this? It's not entirely analogous to * without following identifier, but at least somewhat similar. --Guido On 8/12/06, Jim Jewett wrote: > On 8/11/06, Jiwon Seo wrote: > > When we have keyword-only arguments, do we allow 'keyword dictionary' > > argument? If that's the case, where would we want to place > > keyword-only arguments? > > > Are we going to allow any of followings? > > > 1. def foo(a, b, *, key1=None, key2=None, **map) > > Seems perfectly reasonable. > > I think the controversy was over whether or not to allow keyword-only > without a default. > > > 2. def foo(a, b, *, **map, key1=None, key2=None) > > Seems backward, though I suppose we could adjust if we needed to. > > > 3. def foo(a, b, *, **map) > > What would the * even mean, since there aren't any named keywords to separate? > > -jJ > _______________________________________________ > Python-3000 mailing list > Python-3000 at python.org > http://mail.python.org/mailman/listinfo/python-3000 > Unsubscribe: http://mail.python.org/mailman/options/python-3000/guido%40python.org > -- --Guido van Rossum (home page: http://www.python.org/~guido/) From steven.bethard at gmail.com Mon Aug 14 19:49:36 2006 From: steven.bethard at gmail.com (Steven Bethard) Date: Mon, 14 Aug 2006 11:49:36 -0600 Subject: [Python-3000] PEP3102 Keyword-Only Arguments In-Reply-To: References: Message-ID: On 8/14/06, Guido van Rossum wrote: > I believe the PEP doesn't address the opposite use case: positional > arguments that should *not* be specified as keyword arguments. For > example, I might want to write > > def foo(a, b): ... > > but I don't want callers to be able to call it as foo(b=1, a=2) or > even foo(a=2, b=1). Another use case is when you want to accept the arguments of another callable, but you have your own positional arguments:: >>> class Wrapper(object): ... def __init__(self, func): ... self.func = func ... def __call__(self, *args, **kwargs): ... print 'calling wrapped function' ... return self.func(*args, **kwargs) ... >>> @Wrapper ... def func(self, other): ... return self, other ... >>> func(other=1, self=2) Traceback (most recent call last): File "", line 1, in ? TypeError: __call__() got multiple values for keyword argument 'self' It would be really nice in the example above to mark ``self`` in ``__call__`` as a positional only argument. > Perhaps we can use ** without following identifier to signal this? > It's not entirely analogous to * without following identifier, but at > least somewhat similar. I'm certainly not opposed to going this way, but I don't think it would solve the problem above since you still need to take keyword arguments. STeVe -- I'm not *in*-sane. Indeed, I am so far *out* of sane that you appear a tiny blip on the distant coast of sanity. --- Bucky Katt, Get Fuzzy From guido at python.org Mon Aug 14 20:04:19 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 14 Aug 2006 11:04:19 -0700 Subject: [Python-3000] PEP3102 Keyword-Only Arguments In-Reply-To: References: Message-ID: On 8/14/06, Steven Bethard wrote: > On 8/14/06, Guido van Rossum wrote: > > I believe the PEP doesn't address the opposite use case: positional > > arguments that should *not* be specified as keyword arguments. For > > example, I might want to write > > > > def foo(a, b): ... > > > > but I don't want callers to be able to call it as foo(b=1, a=2) or > > even foo(a=2, b=1). > > Another use case is when you want to accept the arguments of another > callable, but you have your own positional arguments:: > > >>> class Wrapper(object): > ... def __init__(self, func): > ... self.func = func > ... def __call__(self, *args, **kwargs): > ... print 'calling wrapped function' > ... return self.func(*args, **kwargs) > ... > >>> @Wrapper > ... def func(self, other): > ... return self, other > ... > >>> func(other=1, self=2) > Traceback (most recent call last): > File "", line 1, in ? > TypeError: __call__() got multiple values for keyword argument 'self' > > It would be really nice in the example above to mark ``self`` in > ``__call__`` as a positional only argument. But this is a rather unusual use case isn't it? It's due to the bound methods machinery. Do you have other use cases? I would assume that normally such wrappers take their own control arguments in the form of keyword-only arguments (that are unlikely to conflict with arguments of the wrapped method). > > Perhaps we can use ** without following identifier to signal this? > > It's not entirely analogous to * without following identifier, but at > > least somewhat similar. > > I'm certainly not opposed to going this way, but I don't think it > would solve the problem above since you still need to take keyword > arguments. Can you elaborate? -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Mon Aug 14 20:08:56 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 14 Aug 2006 11:08:56 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: <20060814172000.1717.863905740.divmod.quotient.22821@ohm> References: <20060814172000.1717.863905740.divmod.quotient.22821@ohm> Message-ID: On 8/14/06, Jean-Paul Calderone wrote: > On Mon, 14 Aug 2006 09:09:49 -0700, Guido van Rossum wrote: > >On 8/14/06, Jean-Paul Calderone wrote: > >>This is a joke, right? > >Because it's a good idea to have to write separate wrappers > >around every useful library for each dynamic languague separately? > > If a project has done this successfully, I don't think I've seen it. Can > you point out some examples where this has been accomplished in a useful > form? The nearest thing I can think of is SWIG, which is basically a > failure. SWIG is not my favorite (msotly because I don't like C++ much) but it's used very effectively here at Google (for example); I wouldn't dream of calling it a failure. I also consider .NET's CLR a success, based on the testimony of Jim Hugunin (who must be Microsoft's most reluctant employee :). And I see the JVM as a successful case too -- Jython can link to anything written in Java or compiled to JVM bytecode, and so can other languages that use JVM introspection the same way as Jython (I hear there's Ruby analogue). The major difference between all these examples and ctypes is that ctypes has no way of introspecting the wrapped library; you have to repeat everything you know about the API in your calls to ctypes (and as was just shown in another thread about 64-bit issues, that's not always easy). > This is not to say that it is not a noble goal, but I think it remains to > be shown that Parrot is actually a solution here. Parrot definitely has to show itself still. But a year ago Sam Ruby reported on his efforts of making Python work on Parrot, and he sounded like it was very a feasible proposition. > Parrot has been around for quite a while now without accomplishing anything > much of practical value. Does anyone _use_ it for Ruby, JavaScript, or Tcl? > (I know no one uses it for Perl 6 ;) > > For five years of development by a pretty large community, that's not showing > a lot. The reason I suspected a joke is that you seem to want to discard a > fairly good existing widely used solution in favor of one that's just vapor > right now. Granted Py3k is a ways off, but it's not /that/ far off. We're > talking about a year or two here. Is Parrot going to be as solid in a year > as ctypes already is? I doubt it. That's not exactly the point I am making. I find Parrot's approach, assuming the project won't fail due to internal friction, much more long-term viable than ctypes. The big difference being (I hope) introspective generation of APIs rather than having to repeat the linkage information in each client language. > If you /really/ want to look outside of the Python community for solutions > here, the lisp community has thought about this for a long time. Instead of > looking at Parrot, you should look at the ffi provided by almost any lisp > runtime. This seems a mostly theoretical viewpoint to me. Can you point me to an example of a Python-like language that is successful in reusing a Lisp runtime? (And I don't consider Lisp or Scheme Python-like in this context. ;-) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Mon Aug 14 20:13:56 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 14 Aug 2006 11:13:56 -0700 Subject: [Python-3000] Python/C++ question In-Reply-To: References: <44DA6C01.2040904@acm.org> <44DF0800.4060204@acm.org> Message-ID: On 8/13/06, Georg Brandl wrote: > Talin wrote: > > Guido van Rossum wrote: > >> On 8/9/06, Talin wrote: > >> For the majority of Python developers it's probably the other way > >> around. It's been 15 years since I wrote C++, and unlike C, that > >> language has changed a lot since then... > >> > >> It would be a complete rewrite; I prefer doing a gradual > >> transmogrification of the current codebase into Py3k rather than > >> starting from scratch (read Joel Spolsky on why). > > > > BTW, Should this be added to PEP 3099? > > Yes, why not. Although perhaps it makes more sense to add something positive to PEP 3000, e.g. Implementation Language ================== Python 3000 will be implemented in C, and the implementation will be derived as an evolution of the Python 2 code base. This reflects my views (which I share with Joel Spolsky) on the dangers of complete rewrites. Since Python 3000 as a language is a relatively mild improvement on Python 2, we can gain a lot by not attempting to reimplement the language from scratch. I am not against parallel from-scratch implementation efforts, but my own efforts will be directed at the language and implementation that I know best. -- --Guido van Rossum (home page: http://www.python.org/~guido/) From guido at python.org Mon Aug 14 20:17:14 2006 From: guido at python.org (Guido van Rossum) Date: Mon, 14 Aug 2006 11:17:14 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: <1d85506f0608111713m15cf2e67v8b94f06c928e9125@mail.gmail.com> References: <1d85506f0608111713m15cf2e67v8b94f06c928e9125@mail.gmail.com> Message-ID: On 8/11/06, tomer filiba wrote: > i mailed this to several people separately, but then i thought it could > benefit the entire group: > > http://sebulba.wikispaces.com/recipe+thread2 > > it's an implementation of the proposed " thread.raise_exc", through an extension > to the threading.Thread class. you can test it for yourself; if it proves useful, > it should be exposed as thread.raise_exc in the stdlib (instead of the ctypes > hack)... and of course it should be reflected in threading.Thread as welll. Cool. Question: what's the problem with raising exception instances? Especially in the light of my proposal to use raise SomeException(42) in preference over (and perhaps exclusively instead of) raise SomeException, 42 in Py3k. The latter IMO is a relic from the days of string exceptions which are as numbered as they come. :-) -- --Guido van Rossum (home page: http://www.python.org/~guido/) From paul at prescod.net Mon Aug 14 20:40:04 2006 From: paul at prescod.net (Paul Prescod) Date: Mon, 14 Aug 2006 11:40:04 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: References: <1cb725390608092219v695b7f24t92534d3aa444ca8c@mail.gmail.com> Message-ID: <1cb725390608141140g480e0c66q6f1e74f32ad1e540@mail.gmail.com> I guess I don't see ctypes and Parrot solving the same problem at all. My idea with ctypes was the opposite of choosing a new runtime. It was to help various runtimes (PyPy, Jython, IronPython, CPython 2.5, CPython 3.0, Parrot, ...) to compete on their own merits (primarily performance and interoperability) and not on the basis that they don't support some Python library whether it be "crypt" or "pyopengl". It would also be nice to move beyond the situation where everyone in the world must re-release their C modules (no matter how trivial) every time Python goes through a minor upgrade. Does Parrot these problems or exacerbate them? Also, Parrot seems like a bit of a random choice considering the fact that there are many candidates for a next-generation Python runtime: PyPy, IronPython/mono, etc. They have both come much further, much quicker, than Parrot. I'm a bit skeptical of the Parrot story after the Guile mess. It was supposed to be a multi-language dynamic runtime as well. But that's a digression. I don't think you're betting on any particular strategy, just saying that we should watch Parrot and see how it turns out. But anyhow, my original suggestion did not start with ctypes at all. From my point of view, the goal is to express Pythonic constructs in Python (whether using Ctypes, Pyrex, rctypes, or whaver) where possible rather than expressing Pythonic constructs in C (PyErr_SetString, PyDict_SetItem, etc.). Then each runtime can map the Pythonic constructs to their internal model and use their native FFI strategy (JNI, P/Invoke, libffi) to handle the C stuff. The actual details of the syntax do not matter to me (though they do matter!). I also do not care whether it uses a compiler strategy like Pyrex or a runtime model like ctypes, or a dual-mode strategy like PyPy/extcompiler. I accept the current limitations of this technique when it comes to (especially) C++, and therefore don't promote it as a panacea. Let me ask a question about our current status. If there were a requirement to do a simple wrapper library like "crypt" or "getpasswd"...is there any high level wrapping strategy that you would allow into the standard library? A ctypes-based module? The C output of a Pyrex compiler? The output of SWIG? Or is hand-written C code the only thing you want for extensions in the Python library? Even if the answer is "hand-written C code" it might be nice to have an explicit statement so that people know in advance. I propose that if the developer can make the case that a ctypes-based library is more maintainable than the C code would be, and performance is acceptable for the problem domain, that the ctypes-based library be acceptable. Would you agree that that small step is reasonable? Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060814/04bca409/attachment.htm From jimjjewett at gmail.com Mon Aug 14 21:11:00 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Mon, 14 Aug 2006 15:11:00 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> Message-ID: On 8/13/06, Paul Prescod wrote: > My proposed text for the PEP is as follows: Mostly good. A few remaining comments... Should annotation objects with defined semantics have some standard way to indicate this? (By analogy, new exceptions *should* inherit from Exception; should annotation objects inherit from an Annotation class, at least as a mixin?) > "This implies that the interpretation of built-in types would be controlled > by Python's developers and documented in Python's documentation. It also implies that the interpretation of annotations made with a built-in type should be safe -- they shouldn't trigger any irreversible actions. > "In Python 3000, semantics will be attached to the following types: objects > of type string (or subtype of string) are to be used for documentation > (though they are not necessarily the exclusive source of documentation about > the type). Objects of type list (or subtype of list) are to be used for > attaching multiple independent annotations." subtypes should be available for other frameworks. This implies that something other than lists should be used if the annotations are not independent. The obvious candidates are tuples and dicts, but this should be explicit (or explicitly not defined). The definition of a type as an annotation should probably be either defined or explicitly undefined. Earlier discussions talked about things like def f (a:int, b:(float | Decimal), c:[int, str, X]) ->str) This implied that a type object would represent the type of the argument (but would it be safe to call as an adapter?), that special syntactic support might be added for type unions, and that the "independent" part of the list specification should probably be repeated at least in an example. I'm not sure if these implications *should* be true, but they're obvious enough to some people (and not to others) that the decision should be explicit. -jJ From jcarlson at uci.edu Mon Aug 14 21:15:18 2006 From: jcarlson at uci.edu (Josiah Carlson) Date: Mon, 14 Aug 2006 12:15:18 -0700 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608111713m15cf2e67v8b94f06c928e9125@mail.gmail.com> Message-ID: <20060814121235.19A8.JCARLSON@uci.edu> "Guido van Rossum" wrote: > > On 8/11/06, tomer filiba wrote: > > i mailed this to several people separately, but then i thought it could > > benefit the entire group: > > > > http://sebulba.wikispaces.com/recipe+thread2 > > > > it's an implementation of the proposed " thread.raise_exc", through an extension > > to the threading.Thread class. you can test it for yourself; if it proves useful, > > it should be exposed as thread.raise_exc in the stdlib (instead of the ctypes > > hack)... and of course it should be reflected in threading.Thread as welll. > > Cool. Question: what's the problem with raising exception instances? > Especially in the light of my proposal to use > > raise SomeException(42) > > in preference over (and perhaps exclusively instead of) The problem is that it is not implemented in the underlying CPython API PyThreadState_SetAsyncExc function. - Josiah From g.brandl at gmx.net Mon Aug 14 21:12:50 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Mon, 14 Aug 2006 21:12:50 +0200 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608111713m15cf2e67v8b94f06c928e9125@mail.gmail.com> Message-ID: Guido van Rossum wrote: > On 8/11/06, tomer filiba wrote: >> i mailed this to several people separately, but then i thought it could >> benefit the entire group: >> >> http://sebulba.wikispaces.com/recipe+thread2 >> >> it's an implementation of the proposed " thread.raise_exc", through an extension >> to the threading.Thread class. you can test it for yourself; if it proves useful, >> it should be exposed as thread.raise_exc in the stdlib (instead of the ctypes >> hack)... and of course it should be reflected in threading.Thread as welll. > > Cool. Question: what's the problem with raising exception instances? > Especially in the light of my proposal to use > > raise SomeException(42) > > in preference over (and perhaps exclusively instead of) > > raise SomeException, 42 > > in Py3k. The latter IMO is a relic from the days of string exceptions > which are as numbered as they come. :-) I think this is the answer: http://mail.python.org/pipermail/python-dev/2006-August/068165.html Georg From g.brandl at gmx.net Mon Aug 14 21:13:50 2006 From: g.brandl at gmx.net (Georg Brandl) Date: Mon, 14 Aug 2006 21:13:50 +0200 Subject: [Python-3000] Python/C++ question In-Reply-To: References: <44DA6C01.2040904@acm.org> <44DF0800.4060204@acm.org> Message-ID: Guido van Rossum wrote: > On 8/13/06, Georg Brandl wrote: >> Talin wrote: >> > Guido van Rossum wrote: >> >> On 8/9/06, Talin wrote: >> >> For the majority of Python developers it's probably the other way >> >> around. It's been 15 years since I wrote C++, and unlike C, that >> >> language has changed a lot since then... >> >> >> >> It would be a complete rewrite; I prefer doing a gradual >> >> transmogrification of the current codebase into Py3k rather than >> >> starting from scratch (read Joel Spolsky on why). >> > >> > BTW, Should this be added to PEP 3099? >> >> Yes, why not. > > Although perhaps it makes more sense to add something positive to PEP 3000, e.g. > > Implementation Language > ================== > > Python 3000 will be implemented in C, and the implementation will be > derived as an evolution of the Python 2 code base. This reflects my > views (which I share with Joel Spolsky) on the dangers of complete > rewrites. Since Python 3000 as a language is a relatively mild > improvement on Python 2, we can gain a lot by not attempting to > reimplement the language from scratch. I am not against parallel > from-scratch implementation efforts, but my own efforts will be > directed at the language and implementation that I know best. I had already added something to PEP 3099, but if you like that approach better, I'll add that to PEP 3000. Georg From tim.peters at gmail.com Mon Aug 14 21:15:30 2006 From: tim.peters at gmail.com (Tim Peters) Date: Mon, 14 Aug 2006 15:15:30 -0400 Subject: [Python-3000] threading, part 2 In-Reply-To: References: <1d85506f0608111713m15cf2e67v8b94f06c928e9125@mail.gmail.com> Message-ID: <1f7befae0608141215y72e827cfo4f541b7e5fe927a8@mail.gmail.com> [tomer filiba] >> i mailed this to several people separately, but then i thought it could >> benefit the entire group: >> >> http://sebulba.wikispaces.com/recipe+thread2 >> >> it's an implementation of the proposed " thread.raise_exc", >> ... [Guido] > Cool. Question: what's the problem with raising exception instances? See http://mail.python.org/pipermail/python-dev/2006-August/068165.html Short course: in ceval.c, x = tstate->async_exc; ... PyErr_SetNone(x); That is, with the current code it's only possible to set the exception type via PyThreadState_SetAsyncExc(); the exception value is forced to None/NULL. What was the intent ;-)? Example: """ from time import sleep import ctypes, thread, sys, threading setexc = ctypes.pythonapi.PyThreadState_SetAsyncExc f_done = threading.Event() def f(): try: while 1: time.sleep(1) finally: f_done.set() tid = thread.start_new_thread(f, ()) exc = ValueError("13") setexc(ctypes.c_long(tid), ctypes.py_object(exc)) f_done.wait() """ Output: Unhandled exception in thread started by Traceback (most recent call last): File "setexc.py", line 12, in f f_done.set() File "C:\Code\python\lib\threading.py", line 351, in set self.__cond.release() SystemError: 'finally' pops bad exception Change `exc` to, e.g., exc = ValueError and then it's fine: Unhandled exception in thread started by Traceback (most recent call last): File "setexc.py", line 12, in f f_done.set() File "C:\Code\python\lib\threading.py", line 349, in set self.__cond.notifyAll() File "C:\Code\python\lib\threading.py", line 265, in notifyAll self.notify(len(self.__waiters)) File "C:\Code\python\lib\threading.py", line 258, in notify waiter.release() ValueError From jimjjewett at gmail.com Mon Aug 14 21:26:18 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Mon, 14 Aug 2006 15:26:18 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> Message-ID: On 8/13/06, Phillip J. Eby wrote: > However, it's only a problem if you insist on writing brain-damaged > code. If you want interoperability here, you must write tell-don't-ask > code. ... is it really the case that > so many people don't know what tell-don't-ask code is or why you want > it? I guess maybe it's something that's only grasped by people who have > experience writing code intended for interoperability. > [Meanwhile, I'm not going to respond to the rest of your message, since it > contained some things that appeared to me to be a mixture of ad hominem > attack and straw man argument. I hope that was not actually your intent.] I did not intend to insult you. My point is simply that what is obvious to you -- and even what is obvious to almost anyone experienced enough to be reading this message -- won't be obvious to everyone first starting out. I want to be able to use a new programmer's first contribution. I absolutely don't want to tell them "Great, but you really should have used XYZ. We didn't really make that explicit because experienced folks tend to do it naturally." -jJ From exarkun at divmod.com Mon Aug 14 21:34:25 2006 From: exarkun at divmod.com (Jean-Paul Calderone) Date: Mon, 14 Aug 2006 15:34:25 -0400 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: Message-ID: <20060814193425.1717.135462452.divmod.quotient.22922@ohm> On Mon, 14 Aug 2006 11:08:56 -0700, Guido van Rossum wrote: >On 8/14/06, Jean-Paul Calderone wrote: >>On Mon, 14 Aug 2006 09:09:49 -0700, Guido van Rossum >>wrote: >I also consider .NET's CLR a success, based on the testimony of Jim >Hugunin (who must be Microsoft's most reluctant employee :). > >And I see the JVM as a successful case too -- Jython can link to >anything written in Java or compiled to JVM bytecode, and so can other >languages that use JVM introspection the same way as Jython (I hear >there's Ruby analogue). These successes are necessarily limited in scope. Jython can use any Java library, and that's great, as far as it goes. Clearly, though, it isn't a complete solution. Relying on Parrot to have a rich library of wrapper modules seems ill advised. If it /already/ had a rich library, then maybe it would seem more reasonable. > >The major difference between all these examples and ctypes is that >ctypes has no way of introspecting the wrapped library; you have to >repeat everything you know about the API in your calls to ctypes (and >as was just shown in another thread about 64-bit issues, that's not >always easy). The codegenerator package which is closely related to ctypes is capable of this as well. PyPy has a complete ctypes-based OpenSSL wrapper which is automatically generated. >That's not exactly the point I am making. I find Parrot's approach, >assuming the project won't fail due to internal friction, much more >long-term viable than ctypes. The big difference being (I hope) >introspective generation of APIs rather than having to repeat the >linkage information in each client language. Given the existence of codegenerator, do you still find Parrot's approach more viable? It seems to me that it easily levels the playing field, and makes ctypes still more attractive than Parrot, since it side-steps the not insignificant internal political issues with the Parrot team. >This seems a mostly theoretical viewpoint to me. Can you point me to >an example of a Python-like language that is successful in reusing a >Lisp runtime? (And I don't consider Lisp or Scheme Python-like in this >context. ;-) PyPy has a Common Lisp backend. It's not the primary target, but it's not inconceivable that it could someday provide an ffi from a Common Lisp runtime to Python programs. There has also been work done on an IL backend for PyPy. This could be used to make any CLR library available to Python programs. Of course, with those two examples in hand, we see a fundamental drawback to the Parrot-style solution (of which these are both essentially examples). What if I want to use the CL FFI at the same time as a library exposed via .NET? I'm out of luck. Had the libraries I wanted both been wrapped with ctypes, I could have used them both from either runtime. In general, what are alternate runtimes like PyPy to do if Parrot becomes the de facto standard for extension modules? Link against Parrot? Suffer without those modules until someone does a custom binding for that runtime? Jean-Paul From collinw at gmail.com Mon Aug 14 21:41:11 2006 From: collinw at gmail.com (Collin Winter) Date: Mon, 14 Aug 2006 15:41:11 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> Message-ID: <43aa6ff70608141241w43b7b694k77e63ba6766a1f55@mail.gmail.com> On 8/13/06, Paul Prescod wrote: > "In order for processors of function annotations to work interoperably, they > must use a common interpretation of objects used as annotations on a > particular function. For example, one might interpret string annotations as > docstrings. Another might interpet them as path segments for a web > framework. For this reason, function annotation processors SHOULD avoid > assigning processor-specific meanings to types defined outside of the > processor's framework. For example, a Django processor could process > annotations of a type defined in a Zope package, but Zope's creators should > be considered the authorities on the type's meaning for the same reasons > that they would be considered authorities on the semantics of classes or > methods in their packages." The way I read this, it forces (more or less) each annotation-consuming library to invent new ways to spell Python's built-in types. I read all this as saying that annotation processors should avoid using Python's lists, tuples and dicts in annotations (since whatever semantics the Python developers come up with will inevitably be incompatible with what some library writer needs/wants). Each processor library will then define my_processor.List, my_processor.Tuple, my_processor.Dict, etc as alternate spellings for [x, y, z], (x, y, z), {x: y} and so on. > "This implies that the interpretation of built-in types would be controlled > by Python's developers and documented in Python's documentation. The inherent difficulty in defining a standard interpretation for these types is what motivated me to leave this up to the authors of annotation consumers. I don't mean "it was hard so I gave up"; I can easily come up with a standard, but it will probably be of limited or no utility to some section of the possible userbase. If you have an idea, though, feel free to propose something concrete. > "In Python 3000, semantics will be attached to the following types: objects > of type string (or subtype of string) are to be used for documentation > (though they are not necessarily the exclusive source of documentation about > the type). Objects of type list (or subtype of list) are to be used for > attaching multiple independent annotations." Does this mean all lists "are to be used for attaching multiple independent annotations", or just top-level lists (ie, "def foo(a: [x, y])" indicates two independent annotations)? What does "def foo(a: [x, [y, z]])" indicate? Collin Winter From paul at prescod.net Mon Aug 14 22:00:59 2006 From: paul at prescod.net (Paul Prescod) Date: Mon, 14 Aug 2006 13:00:59 -0700 Subject: [Python-3000] Ctypes as cross-interpreter C calling interface In-Reply-To: References: <20060814172000.1717.863905740.divmod.quotient.22821@ohm> Message-ID: <1cb725390608141300o7b6e6503x23e6c7b9cf31b92f@mail.gmail.com> On 8/14/06, Guido van Rossum wrote: > > > The major difference between all these examples and ctypes is that > ctypes has no way of introspecting the wrapped library; you have to > repeat everything you know about the API in your calls to ctypes (and > as was just shown in another thread about 64-bit issues, that's not > always easy). An excellent point and very clarifying (though I still don't totally understand the relationship with Parrot). What do you think about techniques like these: * http://starship.python.net/crew/theller/ctypes/old/codegen.html * http://lists.copyleft.no/pipermail/pyrex/2006-June/001885.html I agree that this is an issue. But then on the other hand, given N methods and objects that you need wrapped, you will in general need to make N individual mapping statements no matter what technology you use. The question is how many lines of mapping are you doing? Ctypes currently requires you to re-declare what you know about the C library. Hand-written C libraries require you to do go through other hoops. For example, looking at Pygame ctypes, consider the following method: def __copy__(self): return Rect(self.x, self.y, self.w, self.h) That's the ctypes version. Here's the C version: /* for copy module */ static PyObject* rect_copy(PyObject* oself, PyObject* args) { PyRectObject* self = (PyRectObject*)oself; return PyRect_New4(self->r.x, self->r.y, self->r.w, self->r.h); } static struct PyMethodDef rect_methods[] = { ... {"__copy__", (PyCFunction)rect_copy, 0, NULL},... }; So there is some repetition there as well (casts, function name duplications, etc.). Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060814/a281094f/attachment.htm From paul at prescod.net Mon Aug 14 22:20:54 2006 From: paul at prescod.net (Paul Prescod) Date: Mon, 14 Aug 2006 13:20:54 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608141241w43b7b694k77e63ba6766a1f55@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> <43aa6ff70608141241w43b7b694k77e63ba6766a1f55@mail.gmail.com> Message-ID: <1cb725390608141320n11683af8q27a75309011a512c@mail.gmail.com> On 8/14/06, Collin Winter wrote: > > The way I read this, it forces (more or less) each > annotation-consuming library to invent new ways to spell Python's > built-in types. I think that this is related to your other question. What if an annotation consuming library wanted to use Python's built-in types nested within their own top-level structures. def foo(a: xxx([x, y, z])): ... I would say that the innermost list has its semantics (as metadata) defined by "xxx", not raw Python. That's the only reasonable thing. > "This implies that the interpretation of built-in types would be > controlled > > by Python's developers and documented in Python's documentation. > > The inherent difficulty in defining a standard interpretation for > these types is what motivated me to leave this up to the authors of > annotation consumers. There are three issues: first, we need to RESERVE the types for standardization by Guido and crew. Second, we can decide to do the standardization at any point. Third, we absolutely need a standard for multiple independent annotations on a parameter. Using lists is a no-brainer. So let's do that. If you have an idea, though, feel free to propose something concrete. Yes, my proposal is here: > "In Python 3000, semantics will be attached to the following types: > objects > > of type string (or subtype of string) are to be used for documentation > > (though they are not necessarily the exclusive source of documentation > about > > the type). Objects of type list (or subtype of list) are to be used for > > attaching multiple independent annotations." > > Does this mean all lists "are to be used for attaching multiple > independent annotations", or just top-level lists (ie, "def foo(a: [x, > y])" indicates two independent annotations)? What does "def foo(a: [x, > [y, z]])" indicate? I meant only top-level lists. I hadn't thought through nesting. def foo(a: [x, y, [a, b, c]]): ... This should probably be just handled recursively or disallowed. I don't feel strongly either way. Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060814/ff1b6d61/attachment.html From jimjjewett at gmail.com Mon Aug 14 22:24:15 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Mon, 14 Aug 2006 16:24:15 -0400 Subject: [Python-3000] PEP3102 Keyword-Only Arguments; Signature Message-ID: On 8/14/06, Steven Bethard wrote: > On 8/14/06, Guido van Rossum wrote: > > I believe the PEP doesn't address the opposite use case: positional > > arguments that should *not* be specified as keyword arguments. ... > It would be really nice in the example above to mark ``self`` in > ``__call__`` as a positional only argument. Would this have to be in the standard function prologue, or would it be acceptable to modify a function's Signature object? As I see it, each argument can be any combination of the following: positional keyword named defaulted annotated I can see some value in supporting all 32 possibilities, but doing it directly as part of the def syntax might get awkward. Most arguments are both positional and keyword. The bare * will support keyword-only, and you're asking for positional-only. (An argument which is neither positional nor keyword doesn't make sense.) Today (except in extension code), an argument that isn't named only appears courtesy of *args or **kwargs. Today, named + keyword <==> defaulted Today, arguments are not annotated. Would it be acceptable if functions contained a (possibly implicit) Signature object, and the way to get the odd combinations were through modifying that? For example: def unnamedargs(func): for arg in func.Signature: arg.name=None return func ... @unnamedargs def write(self, s): -jJ From steven.bethard at gmail.com Mon Aug 14 22:34:54 2006 From: steven.bethard at gmail.com (Steven Bethard) Date: Mon, 14 Aug 2006 14:34:54 -0600 Subject: [Python-3000] PEP3102 Keyword-Only Arguments In-Reply-To: References: Message-ID: On 8/14/06, Guido van Rossum wrote: > On 8/14/06, Steven Bethard wrote: > > On 8/14/06, Guido van Rossum wrote: > > > I believe the PEP doesn't address the opposite use case: positional > > > arguments that should *not* be specified as keyword arguments. For > > > example, I might want to write > > > > > > def foo(a, b): ... > > > > > > but I don't want callers to be able to call it as foo(b=1, a=2) or > > > even foo(a=2, b=1). > > > > Another use case is when you want to accept the arguments of another > > callable, but you have your own positional arguments:: > > > > >>> class Wrapper(object): > > ... def __init__(self, func): > > ... self.func = func > > ... def __call__(self, *args, **kwargs): > > ... print 'calling wrapped function' > > ... return self.func(*args, **kwargs) > > ... > > >>> @Wrapper > > ... def func(self, other): > > ... return self, other > > ... > > >>> func(other=1, self=2) > > Traceback (most recent call last): > > File "", line 1, in ? > > TypeError: __call__() got multiple values for keyword argument 'self' > > > > It would be really nice in the example above to mark ``self`` in > > ``__call__`` as a positional only argument. > > But this is a rather unusual use case isn't it? It's due to the bound > methods machinery. Do you have other use cases? Well, for example, unitest.TestCase.failUnlessRaises works this way. Here's the method signature:: def failUnlessRaises(self, excClass, callableObj, *args, **kwargs): Which means that if you write:: self.failUnlessRaises(TypeError, my_func, callableObj=foo) you'll get an error since there's a name clash between the callableObj taken by failUnlessRaises and the one taken by the my_func object. OTOH, I haven't run into this error because I don't use camelCase names. Perhaps the right answer is to always use camelCase on any arguments that you don't want to worry about conflicts, and then any PEP 8 compliant code will never have problems. ;-) > > > Perhaps we can use ** without following identifier to signal this? > > > It's not entirely analogous to * without following identifier, but at > > > least somewhat similar. > > > > I'm certainly not opposed to going this way, but I don't think it > > would solve the problem above since you still need to take keyword > > arguments. > > Can you elaborate? Well, taking the failUnlessRaises signature above, if you wanted to specify that ``self``, ``excClass`` and ``callableObj`` were positional only arguments, I believe you'd have to write:: def failUnlessRaises(self, excClass, callableObj, *args, **): I believe that means that you can't use failUnlessRaises to call a method that expects keyword arguments, e.g.:: self.assertRaises(OptionError, parser.add_option, type='foo') STeVe -- I'm not *in*-sane. Indeed, I am so far *out* of sane that you appear a tiny blip on the distant coast of sanity. --- Bucky Katt, Get Fuzzy From paul at prescod.net Mon Aug 14 22:51:10 2006 From: paul at prescod.net (Paul Prescod) Date: Mon, 14 Aug 2006 13:51:10 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> Message-ID: <1cb725390608141351n78099df6s6bf4359758d18b10@mail.gmail.com> On 8/14/06, Jim Jewett wrote: > > Should annotation objects with defined semantics have some standard > way to indicate this? (By analogy, new exceptions *should* inherit > from Exception; should annotation objects inherit from an Annotation > class, at least as a mixin?) All annotation objects have defined semantics (somewhere) or else they are useless. I don't see any benefit in making them inherit from anything in particular. Python has a very specific reason for requiring that in the exception case. I'd rather not complicate the design without a good reason. > "This implies that the interpretation of built-in types would be > controlled > > by Python's developers and documented in Python's documentation. > > It also implies that the interpretation of annotations made with a > built-in type should be safe -- they shouldn't trigger any > irreversible actions. I disagree and don't think you can come up with a clear definition of "irreversible" in any case. Is spitting out text to a stream "irreversible"? I'd rather not complicate stuff. > "In Python 3000, semantics will be attached to the following types: > objects > > of type string (or subtype of string) are to be used for documentation > > (though they are not necessarily the exclusive source of documentation > about > > the type). Objects of type list (or subtype of list) are to be used for > > attaching multiple independent annotations." > > subtypes should be available for other frameworks. I'd be happy to remove the whole subtype clause. I don't care much either way. But anyhow I (now) disagree that there is a problem as stated. If a framework wants to use a subtype of list they just need to wrap it in a top-level wrapper that makes the association. def foo(a: xxx(mylist_subtype(a, b, c))): This is clear thanks to Collin Winters' recent post. This implies that something other than lists should be used if the > annotations are not independent. The obvious candidates are tuples > and dicts, but this should be explicit (or explicitly not defined). The "dependence" between notations is totally up to the framework. To repeat the example: def foo(a: xxx(mylist_subtype(a, b, c))): xxx might say that a is passed as a ".next" attribute to b which is passed as a ".next" attribute to "c". Or xxx might say that "a" is passed to "b"'s constructor which is passed to "c"'s constructor. Remeber that "xxx" is executable so it could do whatever it wants. It should just document what it did so that various libraries know how to navigate the object structure it creates. The definition of a type as an annotation should probably be either > defined or explicitly undefined. Earlier discussions talked about > things like > > def f (a:int, b:(float | Decimal), c:[int, str, X]) ->str) I think that's a separate (large!) PEP. This PEP should disallow frameworks from inventing their own meaning for this syntax (requiring them to at least wrap). Then Guido and crew can dig into this issue on their own schedule. Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060814/1629696c/attachment.html From collinw at gmail.com Mon Aug 14 23:03:56 2006 From: collinw at gmail.com (Collin Winter) Date: Mon, 14 Aug 2006 16:03:56 -0500 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608141320n11683af8q27a75309011a512c@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> <43aa6ff70608141241w43b7b694k77e63ba6766a1f55@mail.gmail.com> <1cb725390608141320n11683af8q27a75309011a512c@mail.gmail.com> Message-ID: <43aa6ff70608141403i36dfeefcn2cb1aa7f803b5579@mail.gmail.com> On 8/14/06, Paul Prescod wrote: > There are three issues: first, we need to RESERVE the types for > standardization by Guido and crew. You're just pushing the decision off to someone else. Regardless of who makes it, decisions involving the built-in types are going to make some group unhappy. This list saw several discussions related to standard interpretations for the built-in types back in May and June; here's a selection for you to catch up on: http://mail.python.org/pipermail/python-3000/2006-May/002134.html http://mail.python.org/pipermail/python-3000/2006-May/002216.html http://mail.python.org/pipermail/python-3000/2006-June/002438.html One particularly divisive issue is whether tuples should be treated as fixed- or arbitrary-length containers. Concretely, does "tuple(Number)" match only 1-tuples with a single Number element, or does it match all tuples that have only Number elements? Regardless of which you pick, somebody's going to be pissed. > Second, we can decide to do the standardization at any point. Um, "at any point"? You mean it's conceivable that this standardisation could come *after* Python ships with function annotations? Collin Winter From paul at prescod.net Mon Aug 14 23:18:07 2006 From: paul at prescod.net (Paul Prescod) Date: Mon, 14 Aug 2006 14:18:07 -0700 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <43aa6ff70608141403i36dfeefcn2cb1aa7f803b5579@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> <43aa6ff70608141241w43b7b694k77e63ba6766a1f55@mail.gmail.com> <1cb725390608141320n11683af8q27a75309011a512c@mail.gmail.com> <43aa6ff70608141403i36dfeefcn2cb1aa7f803b5579@mail.gmail.com> Message-ID: <1cb725390608141418y4c111070l73554a2a959e5d72@mail.gmail.com> On 8/14/06, Collin Winter wrote: > > On 8/14/06, Paul Prescod wrote: > > There are three issues: first, we need to RESERVE the types for > > standardization by Guido and crew. > > You're just pushing the decision off to someone else. Regardless of > who makes it, decisions involving the built-in types are going to make > some group unhappy. Yes, I know. I spent about a month of my life going through the same process back around 2003. > Second, we can decide to do the standardization at any point. > > Um, "at any point"? You mean it's conceivable that this > standardisation could come *after* Python ships with function > annotations? Sure. Why not? All I'm saying is that the "function annotations" PEP should not depend on the "function annotations for static type declarations" PEP. That was implicit in your original pre-PEP! If the "static type declarations PEP" misses the Python 3000 deadline then the function annotations feature is still valuable. The former could be used as a testbed for the latter: def myfunc( NumTuples: [typepackage1(tuple(Number)), typepackage2("tuple(Number+))"]):... Paul Prescod -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mail.python.org/pipermail/python-3000/attachments/20060814/2ac569eb/attachment.htm From collinw at gmail.com Mon Aug 14 23:23:48 2006 From: collinw at gmail.com (Collin Winter) Date: Mon, 14 Aug 2006 16:23:48 -0500 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608141418y4c111070l73554a2a959e5d72@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> <43aa6ff70608141241w43b7b694k77e63ba6766a1f55@mail.gmail.com> <1cb725390608141320n11683af8q27a75309011a512c@mail.gmail.com> <43aa6ff70608141403i36dfeefcn2cb1aa7f803b5579@mail.gmail.com> <1cb725390608141418y4c111070l73554a2a959e5d72@mail.gmail.com> Message-ID: <43aa6ff70608141423w64afca33uc284417cec4a62fe@mail.gmail.com> On 8/14/06, Paul Prescod wrote: > On 8/14/06, Collin Winter wrote: > > On 8/14/06, Paul Prescod wrote: > > > Second, we can decide to do the standardization at any point. > > > > Um, "at any point"? You mean it's conceivable that this > > standardisation could come *after* Python ships with function > > annotations? > > Sure. Why not? Because not having standardised meanings at the same time as the feature becomes available says to developers, "don't use the built-in types in your annotations because we might give them a meaning later...or maybe we won't...but in the meantime, you're going to need to invent new spellings for lists, tuples, dicts, sets, strings, just in case". As someone writing an annotation consumer, that comes across as an incredibly arbitrary decision that forces me to do a lot of extra work. Collin Winter From collinw at gmail.com Mon Aug 14 23:59:38 2006 From: collinw at gmail.com (Collin Winter) Date: Mon, 14 Aug 2006 16:59:38 -0500 Subject: [Python-3000] Conventions for annotation consumers (was: Re: Draft pre-PEP: function annotations) Message-ID: <43aa6ff70608141459r434f7170sb725468c117ff080@mail.gmail.com> On 8/14/06, Paul Prescod wrote: > Third, we absolutely need a standard for > multiple independent annotations on a parameter. Using lists is a > no-brainer. So let's do that. The problem with using lists is that its impossible for non-decorator annotation consumers to know which element "belongs" to them. Way back in http://mail.python.org/pipermail/python-3000/2006-August/002865.html, Nick Coghlan said: > However, what we're really talking about here is a scenario where you're > defining your *own* custom annotation processor: you want the first part of > the tuple in the expression handled by the type processing library, and the > second part handled by the docstring processing library. > > Which says to me that the right solution is for the annotation to be split up > into its constituent parts before the libraries ever see it. > > This could be done as Collin suggests by tampering with > __signature__.annotations before calling each decorator, but I think it is > cleaner to do it by defining a particular signature for decorators that are > intended to process annotations. > > Specifically, such decorators should accept a separate dictionary to use in > preference to the annotations on the function itself: > > process_function_annotations(f, annotations=None): > # Process the function f > # If annotations is not None, use it > # otherwise, get the annotations from f.__signature__ I've come to like this idea more and more. Here's my stab at a dict-based convention for specifying annotations for decorator-style consumers: There are several annotation consumers, docstring, typecheck and constrain_values. Respectively, these treat annotations as documentation; as restrictions on the type of an argument; as restrictions on the values of an argument. Each of these is defined something like def consumer(annotated_function, annotations=sentinel): ... If the consumer isn't given an `annotations` parameter, it is free to assume it is the only consumer for the annotations on that function and is free to treat the annotation expressions however it sees fit. However, if it is given an `annotations` argument, it should observe those annotations and only those annotations. The more complete example: @multiple_annotations(docstring, typecheck, constrain_values) def foo(a: {'docstring': "Frobnication count", 'typecheck': Number, 'constrain_values': range(3, 9)}, b: {'typecheck': Number, # This can be only 4, 8 or 12 'constrain_values': [4, 8, 12]}) -> {'typecheck': Number} Here, multiple_annotations assumes that the annotation dicts are keyed on consumer.__name__; the test "if consumer.__name__ in per_parameter_annotations" should do nicely for figuring out whether a given consumer should be provided an `annotations` argument. (It is up to multiple_annotations() to decide whether "consumer.__name__ in per_parameter_annotations == False" should raise an exception.) Collin Winter From jimjjewett at gmail.com Tue Aug 15 00:03:17 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Mon, 14 Aug 2006 18:03:17 -0400 Subject: [Python-3000] Draft pre-PEP: function annotations In-Reply-To: <1cb725390608141351n78099df6s6bf4359758d18b10@mail.gmail.com> References: <5.1.1.6.0.20060812165132.0226e550@sparrow.telecommunity.com> <20060812205512.197A.JCARLSON@uci.edu> <5.1.1.6.0.20060813013329.0226d240@sparrow.telecommunity.com> <5.1.1.6.0.20060813125944.056a3f40@sparrow.telecommunity.com> <1cb725390608131057y122b0c0wf81611e136659793@mail.gmail.com> <1cb725390608141351n78099df6s6bf4359758d18b10@mail.gmail.com> Message-ID: On 8/14/06, Paul Prescod wrote: > > > "This implies that the interpretation of built-in types would be > controlled > > > by Python's developers and documented in Python's documentation. > > It also implies that the interpretation of annotations made with a > > built-in type should be safe -- they shouldn't trigger any > > irreversible actions. > I disagree and don't think you can come up with a clear definition of > "irreversible" in any case. Is spitting out text to a stream "irreversible"? > I'd rather not complicate stuff. That part is admittedly a guideline for development of python, rather than with python. The question is what happens with something like def f(a:int): ... If the thing starts compiling (like Pyrex) to code which assumes an int and doesn't verify, that would be a disaster waiting to happen -- unless int were explicitly reserved to the python core more strongly than the proposed wording implies. > I'd be happy to remove the whole subtype clause. I don't care much either > way. But anyhow I (now) disagree that there is a problem as stated. If a > framework wants to use a subtype of list they just need to wrap it in a > top-level wrapper that makes the association. > def foo(a: xxx(mylist_subtype(a, b, c))): mylist_subtype is as unique as an object (but not as a name); if xxx is sufficient disambiguation, then so is mylist_subtype on its own. > > This implies that something other than lists should be used if the > > annotations are not independent. The obvious candidates are tuples > > and dicts, but this should be explicit (or explicitly not defined). > The "dependence" between notations is totally up to the framework. To repeat > the example: For builtin lists, they meaning should be reserved to python core. What does the following mean? def f(a:[int, str]) I assume it doesn't mean a list of int and str (because lists are used for independent annotations). I assume it also doesn't mean "int _or_ str" because the annotations are independent. If they two are supposed to be used together, then they should be chained with something other than list. > > The definition of a type as an annotation should probably be either > > defined or explicitly undefined. Earlier discussions talked about > > things like > > def f (a:int, b:(float | Decimal), c:[int, str, X]) ->str) > I think that's a separate (large!) PEP. Agreed. But I think the PEP should explicitly reserve the (annotational) meaning of (1) builtin and standard library types, such as int and Decimal (2) The results of combining types with operators (such as |, +=, etc) (3) lists, tuples, and dictionaries of the above It doesn't have to say what they mean, but it has to warn that a standard meaning is contemplated, and that 3rd parties should consider them reserved. -jJ From jimjjewett at gmail.com Tue Aug 15 00:22:45 2006 From: jimjjewett at gmail.com (Jim Jewett) Date: Mon, 14 Aug 2006 18:22:45 -0400 Subject: [Python-3000] Conventions for annotation consumers (was: Re: Draft pre-PEP: function annotations) In-Reply-To: <43aa6ff70608141459r434f7170sb725468c117ff080@mail.gmail.com> References: <43aa6ff70608141459r434f7170sb725468c117ff080@mail.gmail.com> Message-ID: