From mark at qtrac.eu Wed Sep 8 18:50:29 2010 From: mark at qtrac.eu (Mark Summerfield) Date: Wed, 8 Sep 2010 17:50:29 +0100 Subject: [Python-ideas] with statement syntax forces ugly line breaks? Message-ID: <20100908175029.6617ae3b@dino> Hi, I can't see a _nice_ way of splitting a with statement over mulitple lines: class FakeContext: def __init__(self, name): self.name = name def __enter__(self): print("enter", self.name) def __exit__(self, *args): print("exit", self.name) with FakeContext("a") as a, FakeContext("b") as b: pass # works fine with FakeContext("a") as a, FakeContext("b") as b: pass # synax error with (FakeContext("a") as a, FakeContext("b") as b): pass # synax error The use case where this mattered to me was this: with open(args.actual, encoding="utf-8") as afh, open(args.expected, encoding="utf-8") as efh: actual = [line.rstrip("\n\r") for line in afh.readlines()] expected = [line.rstrip("\n\r") for line in efh.readlines()] Naturally, I could split the line in an ugly place: with open(args.actual, encoding="utf-8") as afh, open(args.expected, encoding="utf-8") as efh: but it seems a shame to do so. Or am I missing something? I'm using Python 3.1.2. -- Mark Summerfield, Qtrac Ltd, www.qtrac.eu C++, Python, Qt, PyQt - training and consultancy "Rapid GUI Programming with Python and Qt" - ISBN 0132354187 http://www.qtrac.eu/pyqtbook.html From nathan at cmu.edu Wed Sep 8 19:00:25 2010 From: nathan at cmu.edu (Nathan Schneider) Date: Wed, 8 Sep 2010 13:00:25 -0400 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <20100908175029.6617ae3b@dino> References: <20100908175029.6617ae3b@dino> Message-ID: Mark, I have approached these cases by using the backslash line-continuation operator: with FakeContext("a") as a, \ FakeContext("b") as b: pass Nathan On Wed, Sep 8, 2010 at 12:50 PM, Mark Summerfield wrote: > Hi, > > I can't see a _nice_ way of splitting a with statement over mulitple > lines: > > class FakeContext: > ? ?def __init__(self, name): > ? ? ? ?self.name = name > ? ?def __enter__(self): > ? ? ? ?print("enter", self.name) > ? ?def __exit__(self, *args): > ? ? ? ?print("exit", self.name) > > with FakeContext("a") as a, FakeContext("b") as b: > ? ?pass # works fine > > > with FakeContext("a") as a, > ? ? FakeContext("b") as b: > ? ?pass # synax error > > > with (FakeContext("a") as a, > ? ? ?FakeContext("b") as b): > ? ?pass # synax error > > The use case where this mattered to me was this: > > ? ?with open(args.actual, encoding="utf-8") as afh, > ? ?open(args.expected, encoding="utf-8") as efh: actual = > ? ?[line.rstrip("\n\r") for line in afh.readlines()] expected = > ? ?[line.rstrip("\n\r") for line in efh.readlines()] > > Naturally, I could split the line in an ugly place: > > ? ?with open(args.actual, encoding="utf-8") as afh, open(args.expected, > ? ? ? ? ? ?encoding="utf-8") as efh: > > but it seems a shame to do so. Or am I missing something? > > I'm using Python 3.1.2. > > -- > Mark Summerfield, Qtrac Ltd, www.qtrac.eu > ? ?C++, Python, Qt, PyQt - training and consultancy > ? ? ? ?"Rapid GUI Programming with Python and Qt" - ISBN 0132354187 > ? ? ? ? ? ?http://www.qtrac.eu/pyqtbook.html > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From mwm-keyword-python.b4bdba at mired.org Wed Sep 8 19:04:00 2010 From: mwm-keyword-python.b4bdba at mired.org (Mike Meyer) Date: Wed, 8 Sep 2010 13:04:00 -0400 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <20100908175029.6617ae3b@dino> References: <20100908175029.6617ae3b@dino> Message-ID: <20100908130400.75ec0a60@bhuda.mired.org> On Wed, 8 Sep 2010 17:50:29 +0100 Mark Summerfield wrote: > Hi, > > I can't see a _nice_ way of splitting a with statement over mulitple > lines: > > class FakeContext: > def __init__(self, name): > self.name = name > def __enter__(self): > print("enter", self.name) > def __exit__(self, *args): > print("exit", self.name) > > with FakeContext("a") as a, FakeContext("b") as b: > pass # works fine > > > with FakeContext("a") as a, > FakeContext("b") as b: > pass # synax error > > > with (FakeContext("a") as a, > FakeContext("b") as b): > pass # synax error How about: with FakeContext("a") as a: with FakeContext("B") as b: If the double-indent bothers you, using two two-space indents might be acceptable in this case. http://www.mired.org/consulting.html Independent Network/Unix/Perforce consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From g.brandl at gmx.net Wed Sep 8 20:07:56 2010 From: g.brandl at gmx.net (Georg Brandl) Date: Wed, 08 Sep 2010 20:07:56 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <20100908175029.6617ae3b@dino> References: <20100908175029.6617ae3b@dino> Message-ID: Am 08.09.2010 18:50, schrieb Mark Summerfield: > Hi, > > I can't see a _nice_ way of splitting a with statement over mulitple > lines: > > class FakeContext: > def __init__(self, name): > self.name = name > def __enter__(self): > print("enter", self.name) > def __exit__(self, *args): > print("exit", self.name) > > with FakeContext("a") as a, FakeContext("b") as b: > pass # works fine > > > with FakeContext("a") as a, > FakeContext("b") as b: > pass # synax error > > > with (FakeContext("a") as a, > FakeContext("b") as b): > pass # synax error In addition to the backslash hint already given, I'd like to explain why this version isn't allowed: the parser couldn't distinguish between a multi-context with and an expression in parentheses. (In the case of import, where parens can be used around the import list, this is different, no arbitrary expression is allowed.) Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From ncoghlan at gmail.com Wed Sep 8 23:30:26 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 9 Sep 2010 07:30:26 +1000 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino> Message-ID: On Thu, Sep 9, 2010 at 4:07 AM, Georg Brandl wrote: > In addition to the backslash hint already given, I'd like to explain why > this version isn't allowed: the parser couldn't distinguish between a > multi-context with and an expression in parentheses. > > (In the case of import, where parens can be used around the import list, > this is different, no arbitrary expression is allowed.) I've sometimes wondered if we should consider the idea of making line continuation implicit between keywords and their associated colons. I've never seriously investigated the implications for the parser, though. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From python at mrabarnett.plus.com Thu Sep 9 00:17:11 2010 From: python at mrabarnett.plus.com (MRAB) Date: Wed, 08 Sep 2010 23:17:11 +0100 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino>

Message-ID: <4C880B67.5070607@mrabarnett.plus.com> On 08/09/2010 22:30, Nick Coghlan wrote: > On Thu, Sep 9, 2010 at 4:07 AM, Georg Brandl wrote: >> In addition to the backslash hint already given, I'd like to explain why >> this version isn't allowed: the parser couldn't distinguish between a >> multi-context with and an expression in parentheses. >> >> (In the case of import, where parens can be used around the import list, >> this is different, no arbitrary expression is allowed.) > > I've sometimes wondered if we should consider the idea of making line > continuation implicit between keywords and their associated colons. > I've never seriously investigated the implications for the parser, > though. > If a colon was omitted by mistake, how much later would the parser report a syntax error? From greg.ewing at canterbury.ac.nz Thu Sep 9 01:19:47 2010 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 09 Sep 2010 11:19:47 +1200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <4C880B67.5070607@mrabarnett.plus.com> References: <20100908175029.6617ae3b@dino>

<4C880B67.5070607@mrabarnett.plus.com> Message-ID: <4C881A13.4060709@canterbury.ac.nz> MRAB wrote: > On 08/09/2010 22:30, Nick Coghlan wrote: > >> I've sometimes wondered if we should consider the idea of making line >> continuation implicit between keywords and their associated colons. >> > If a colon was omitted by mistake, how much later would the parser > report a syntax error? It might be best to allow this only if the continuation lines are indented at least as far as the starting line. -- Greg From mikegraham at gmail.com Thu Sep 9 01:47:50 2010 From: mikegraham at gmail.com (Mike Graham) Date: Wed, 8 Sep 2010 19:47:50 -0400 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino>

Message-ID: On Wed, Sep 8, 2010 at 5:30 PM, Nick Coghlan wrote: > I've sometimes wondered if we should consider the idea of making line > continuation implicit between keywords and their associated colons. This would also have the nice aesthetic quality of making colons serve a purpose. From greg at krypto.org Thu Sep 9 07:05:35 2010 From: greg at krypto.org (Gregory P. Smith) Date: Wed, 8 Sep 2010 22:05:35 -0700 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino> Message-ID: On Wed, Sep 8, 2010 at 10:00 AM, Nathan Schneider wrote: > Mark, > > I have approached these cases by using the backslash line-continuation > operator: > > with FakeContext("a") as a, \ > FakeContext("b") as b: > pass > > Nathan > I'm in the "\ is evil" at all costs camp so I'd suggest either the nested with statements or alternatively do this: fc = FakeContext with fc("a") as a, fc("b") as b: pass > On Wed, Sep 8, 2010 at 12:50 PM, Mark Summerfield wrote: > > Hi, > > > > I can't see a _nice_ way of splitting a with statement over mulitple > > lines: > > > > class FakeContext: > > def __init__(self, name): > > self.name = name > > def __enter__(self): > > print("enter", self.name) > > def __exit__(self, *args): > > print("exit", self.name) > > > > with FakeContext("a") as a, FakeContext("b") as b: > > pass # works fine > > > > > > with FakeContext("a") as a, > > FakeContext("b") as b: > > pass # synax error > > > > > > with (FakeContext("a") as a, > > FakeContext("b") as b): > > pass # synax error > > > > The use case where this mattered to me was this: > > > > with open(args.actual, encoding="utf-8") as afh, > > open(args.expected, encoding="utf-8") as efh: actual = > > [line.rstrip("\n\r") for line in afh.readlines()] expected = > > [line.rstrip("\n\r") for line in efh.readlines()] > > > > Naturally, I could split the line in an ugly place: > > > > with open(args.actual, encoding="utf-8") as afh, open(args.expected, > > encoding="utf-8") as efh: > > > > but it seems a shame to do so. Or am I missing something? > > > > I'm using Python 3.1.2. > > > > -- > > Mark Summerfield, Qtrac Ltd, www.qtrac.eu > > C++, Python, Qt, PyQt - training and consultancy > > "Rapid GUI Programming with Python and Qt" - ISBN 0132354187 > > http://www.qtrac.eu/pyqtbook.html > > _______________________________________________ > > Python-ideas mailing list > > Python-ideas at python.org > > http://mail.python.org/mailman/listinfo/python-ideas > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mark at qtrac.eu Thu Sep 9 07:49:51 2010 From: mark at qtrac.eu (Mark Summerfield) Date: Thu, 9 Sep 2010 06:49:51 +0100 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino> Message-ID: <20100909064951.1e1b4df3@dino> Hi Nathan, On Wed, 8 Sep 2010 13:00:25 -0400 Nathan Schneider wrote: > Mark, > > I have approached these cases by using the backslash > line-continuation operator: > > with FakeContext("a") as a, \ > FakeContext("b") as b: > pass Yes, of course, and that's the way I've done it. But it seems a pity to do it this way when the documentation explicitly discourages the use of the backslash for line continuation: http://docs.python.org/py3k/howto/doanddont.html (look at the very last item) > > Nathan > > On Wed, Sep 8, 2010 at 12:50 PM, Mark Summerfield > wrote: > > Hi, > > > > I can't see a _nice_ way of splitting a with statement over mulitple > > lines: > > > > class FakeContext: > > ? ?def __init__(self, name): > > ? ? ? ?self.name = name > > ? ?def __enter__(self): > > ? ? ? ?print("enter", self.name) > > ? ?def __exit__(self, *args): > > ? ? ? ?print("exit", self.name) > > > > with FakeContext("a") as a, FakeContext("b") as b: > > ? ?pass # works fine > > > > > > with FakeContext("a") as a, > > ? ? FakeContext("b") as b: > > ? ?pass # synax error > > > > > > with (FakeContext("a") as a, > > ? ? ?FakeContext("b") as b): > > ? ?pass # synax error > > > > The use case where this mattered to me was this: > > > > ? ?with open(args.actual, encoding="utf-8") as afh, > > ? ?open(args.expected, encoding="utf-8") as efh: actual = > > ? ?[line.rstrip("\n\r") for line in afh.readlines()] expected = > > ? ?[line.rstrip("\n\r") for line in efh.readlines()] > > > > Naturally, I could split the line in an ugly place: > > > > ? ?with open(args.actual, encoding="utf-8") as afh, > > open(args.expected, encoding="utf-8") as efh: > > > > but it seems a shame to do so. Or am I missing something? > > > > I'm using Python 3.1.2. > > > > -- > > Mark Summerfield, Qtrac Ltd, www.qtrac.eu > > ? ?C++, Python, Qt, PyQt - training and consultancy > > ? ? ? ?"Rapid GUI Programming with Python and Qt" - ISBN 0132354187 > > ? ? ? ? ? ?http://www.qtrac.eu/pyqtbook.html > > _______________________________________________ > > Python-ideas mailing list > > Python-ideas at python.org > > http://mail.python.org/mailman/listinfo/python-ideas > > -- Mark Summerfield, Qtrac Ltd, www.qtrac.eu C++, Python, Qt, PyQt - training and consultancy "Programming in Python 3" - ISBN 0321680561 http://www.qtrac.eu/py3book.html From ben+python at benfinney.id.au Thu Sep 9 09:55:38 2010 From: ben+python at benfinney.id.au (Ben Finney) Date: Thu, 09 Sep 2010 17:55:38 +1000 Subject: [Python-ideas] with statement syntax forces ugly line breaks? References: <20100908175029.6617ae3b@dino> Message-ID: <87k4mv9wqt.fsf@benfinney.id.au> "Gregory P. Smith" writes: > On Wed, Sep 8, 2010 at 10:00 AM, Nathan Schneider wrote: > > I have approached these cases by using the backslash line-continuation > > operator: > > > > with FakeContext("a") as a, \ > > FakeContext("b") as b: > > pass > > I'm in the "\ is evil" at all costs camp [?] I agree, especially when we have a much neater continuation mechanism that could work just fine here:: with (FakeContext("a") as a, FakeContext("b") as b): pass -- \ ?[Entrenched media corporations will] maintain the status quo, | `\ or die trying. Either is better than actually WORKING for a | _o__) living.? ?ringsnake.livejournal.com, 2007-11-12 | Ben Finney From andy at insectnation.org Thu Sep 9 11:06:25 2010 From: andy at insectnation.org (Andy Buckley) Date: Thu, 09 Sep 2010 10:06:25 +0100 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino>

Message-ID: <4C88A391.5070209@insectnation.org> On 09/09/10 00:47, Mike Graham wrote: > On Wed, Sep 8, 2010 at 5:30 PM, Nick Coghlan wrote: >> I've sometimes wondered if we should consider the idea of making line >> continuation implicit between keywords and their associated colons. > > This would also have the nice aesthetic quality of making colons serve > a purpose. Good point! I'm regularly niggled that backslash continuations are needed for long conditional statements where parentheses are not logically necessary (and look disturbingly unpythonic.) There's no ambiguity in allowing statements to extend until the colon, particularly if Greg's "at least as far" indentation rule is applied. +1 from me. Andy From g.brandl at gmx.net Thu Sep 9 14:08:25 2010 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 09 Sep 2010 14:08:25 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <4C881A13.4060709@canterbury.ac.nz> References: <20100908175029.6617ae3b@dino>

<4C880B67.5070607@mrabarnett.plus.com> <4C881A13.4060709@canterbury.ac.nz> Message-ID: Am 09.09.2010 01:19, schrieb Greg Ewing: > MRAB wrote: >> On 08/09/2010 22:30, Nick Coghlan wrote: >> >>> I've sometimes wondered if we should consider the idea of making line >>> continuation implicit between keywords and their associated colons. >>> >> If a colon was omitted by mistake, how much later would the parser >> report a syntax error? > > It might be best to allow this only if the continuation > lines are indented at least as far as the starting line. That is dangerous, it makes the whitespace rules more complicated. Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From g.brandl at gmx.net Thu Sep 9 14:14:50 2010 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 09 Sep 2010 14:14:50 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <20100909064951.1e1b4df3@dino> References: <20100908175029.6617ae3b@dino> <20100909064951.1e1b4df3@dino> Message-ID: Am 09.09.2010 07:49, schrieb Mark Summerfield: > Hi Nathan, > > On Wed, 8 Sep 2010 13:00:25 -0400 > Nathan Schneider wrote: >> Mark, >> >> I have approached these cases by using the backslash >> line-continuation operator: >> >> with FakeContext("a") as a, \ >> FakeContext("b") as b: >> pass > > Yes, of course, and that's the way I've done it. But it seems a pity to > do it this way when the documentation explicitly discourages the use of > the backslash for line continuation: > http://docs.python.org/py3k/howto/doanddont.html > (look at the very last item) Which is actually factually incorrect and should be rewritten. The only situation where stray whitespace after a backslash is valid syntax is within a string literal (and there, there is no alternative). So at least the "stray whitespace leads to silently buggy code" reason not to use backslashes is wrong. Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From g.brandl at gmx.net Thu Sep 9 14:17:37 2010 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 09 Sep 2010 14:17:37 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <87k4mv9wqt.fsf@benfinney.id.au> References: <20100908175029.6617ae3b@dino> <87k4mv9wqt.fsf@benfinney.id.au> Message-ID: Am 09.09.2010 09:55, schrieb Ben Finney: > "Gregory P. Smith" > writes: > >> On Wed, Sep 8, 2010 at 10:00 AM, Nathan Schneider wrote: >> > I have approached these cases by using the backslash line-continuation >> > operator: >> > >> > with FakeContext("a") as a, \ >> > FakeContext("b") as b: >> > pass >> >> I'm in the "\ is evil" at all costs camp [?] > > I agree, especially when we have a much neater continuation mechanism > that could work just fine here:: > > with (FakeContext("a") as a, > FakeContext("b") as b): > pass No, it could not work just fine. You are basically banning tuples from the context expression (remember that the "as" clause is optional). Maybe one could argue that this is not a problem because tuples are not context managers anyway, but how would this work then: i = 0 or 1 with (a, b)[i]: Georg -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From g.brandl at gmx.net Thu Sep 9 14:16:49 2010 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 09 Sep 2010 14:16:49 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <87k4mv9wqt.fsf@benfinney.id.au> References: <20100908175029.6617ae3b@dino> <87k4mv9wqt.fsf@benfinney.id.au> Message-ID: Am 09.09.2010 09:55, schrieb Ben Finney: > "Gregory P. Smith" > writes: > >> On Wed, Sep 8, 2010 at 10:00 AM, Nathan Schneider wrote: >> > I have approached these cases by using the backslash line-continuation >> > operator: >> > >> > with FakeContext("a") as a, \ >> > FakeContext("b") as b: >> > pass >> >> I'm in the "\ is evil" at all costs camp [?] > > I agree, especially when we have a much neater continuation mechanism > that could work just fine here:: > > with (FakeContext("a") as a, > FakeContext("b") as b): > pass No, it could not work just fine. You are basically banning tuples from the context expression (remember that the "as" clause is optional). You would argue that this is not a problem because tuples are not context -- Thus spake the Lord: Thou shalt indent with four spaces. No more, no less. Four shall be the number of spaces thou shalt indent, and the number of thy indenting shall be four. Eight shalt thou not indent, nor either indent thou two, excepting that thou then proceed to four. Tabs are right out. From ncoghlan at gmail.com Thu Sep 9 14:53:37 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 9 Sep 2010 22:53:37 +1000 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino>

<4C880B67.5070607@mrabarnett.plus.com> <4C881A13.4060709@canterbury.ac.nz> Message-ID: On Thu, Sep 9, 2010 at 10:08 PM, Georg Brandl wrote: > Am 09.09.2010 01:19, schrieb Greg Ewing: >> MRAB wrote: >>> On 08/09/2010 22:30, Nick Coghlan wrote: >>> >>>> I've sometimes wondered if we should consider the idea of making line >>>> continuation implicit between keywords and their associated colons. >>>> >>> If a colon was omitted by mistake, how much later would the parser >>> report a syntax error? >> >> It might be best to allow this only if the continuation >> lines are indented at least as far as the starting line. > > That is dangerous, it makes the whitespace rules more complicated. I'm actually not sure it is even *possible* in general to implement my suggestion given the deliberate limitations of Python's parser. Parentheses normally work their indentation-ignoring magic by dropping down into expression evaluation scope where indentation isn't significant (import is a special case where this doesn't quite happen, but it's a rather constrained one). This is definitely a wart in the with statement syntax, but it really isn't clear how best to resolve it. You can at least use parentheses in the individual context expressions, even though you can't wrap the whole thing: .>> from contextlib import contextmanager .>> @contextmanager ... def FakeContext(a): ... yield a ... .>> with FakeContext(1) as x, ( ... FakeContext(2)) as y: ... print(x, y) ... 1 2 Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From grosser.meister.morti at gmx.net Thu Sep 9 15:02:24 2010 From: grosser.meister.morti at gmx.net (=?UTF-8?B?TWF0aGlhcyBQYW56ZW5iw7Zjaw==?=) Date: Thu, 09 Sep 2010 15:02:24 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino> <87k4mv9wqt.fsf@benfinney.id.au> Message-ID: <4C88DAE0.9070607@gmx.net> On 09/09/2010 02:17 PM, Georg Brandl wrote: > Am 09.09.2010 09:55, schrieb Ben Finney: >> "Gregory P. Smith" >> writes: >> >>> On Wed, Sep 8, 2010 at 10:00 AM, Nathan Schneider wrote: >>>> I have approached these cases by using the backslash line-continuation >>>> operator: >>>> >>>> with FakeContext("a") as a, \ >>>> FakeContext("b") as b: >>>> pass >>> >>> I'm in the "\ is evil" at all costs camp [?] >> >> I agree, especially when we have a much neater continuation mechanism >> that could work just fine here:: >> >> with (FakeContext("a") as a, >> FakeContext("b") as b): >> pass > > No, it could not work just fine. You are basically banning tuples from the > context expression (remember that the "as" clause is optional). > > Maybe one could argue that this is not a problem because tuples are not > context managers anyway, but how would this work then: > > i = 0 or 1 > with (a, b)[i]: > > Georg > Just write: with ((a, b)[i]): It's ugly but it would work. ;) -panzi From mal at egenix.com Thu Sep 9 15:32:15 2010 From: mal at egenix.com (M.-A. Lemburg) Date: Thu, 09 Sep 2010 15:32:15 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <20100908175029.6617ae3b@dino> References: <20100908175029.6617ae3b@dino> Message-ID: <4C88E1DF.3090502@egenix.com> Mark Summerfield wrote: > Hi, > > I can't see a _nice_ way of splitting a with statement over mulitple > lines: > > class FakeContext: > def __init__(self, name): > self.name = name > def __enter__(self): > print("enter", self.name) > def __exit__(self, *args): > print("exit", self.name) > > with FakeContext("a") as a, FakeContext("b") as b: > pass # works fine > > > with FakeContext("a") as a, > FakeContext("b") as b: > pass # synax error > > > with (FakeContext("a") as a, > FakeContext("b") as b): > pass # synax error > > The use case where this mattered to me was this: > > with open(args.actual, encoding="utf-8") as afh, > open(args.expected, encoding="utf-8") as efh: actual = > [line.rstrip("\n\r") for line in afh.readlines()] expected = > [line.rstrip("\n\r") for line in efh.readlines()] > > Naturally, I could split the line in an ugly place: > > with open(args.actual, encoding="utf-8") as afh, open(args.expected, > encoding="utf-8") as efh: > > but it seems a shame to do so. Or am I missing something? Why do you need to put everything on one line ? afh = open(args.actual, encoding="utf-8") efh = open(args.expected, encoding="utf-8") with afh, efh: ... In the context of files, the only purpose of the with statement is to close them when leaving the block. >>> a = open('/etc/passwd') >>> b = open('/etc/group') >>> with a,b: print a.readline(), b.readline() ... at:x:25:25:Batch jobs daemon:/var/spool/atjobs:/bin/bash at:!:25: >>> a >>> b -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, Sep 09 2010) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ 2010-08-19: Released mxODBC 3.1.0 http://python.egenix.com/ 2010-09-15: DZUG Tagung, Dresden, Germany 6 days to go ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/ From fuzzyman at voidspace.org.uk Thu Sep 9 15:41:52 2010 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Thu, 9 Sep 2010 14:41:52 +0100 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <4C88E1DF.3090502@egenix.com> References: <20100908175029.6617ae3b@dino> <4C88E1DF.3090502@egenix.com> Message-ID: On 9 September 2010 14:32, M.-A. Lemburg wrote: > [snip...] > Why do you need to put everything on one line ? > > afh = open(args.actual, encoding="utf-8") > efh = open(args.expected, encoding="utf-8") > > with afh, efh: > ... > > In the context of files, the only purpose of the with statement > is to close them when leaving the block. > > >>> a = open('/etc/passwd') > >>> b = open('/etc/group') > If my understanding is correct (which is perhaps unlikely...), using a single line will close a if opening b fails. Whereas doing them separately before the with statement risks leaving the first un-exited if creating the second fails. Michael > >>> with a,b: print a.readline(), b.readline() > ... > at:x:25:25:Batch jobs daemon:/var/spool/atjobs:/bin/bash > at:!:25: > > >>> a > > >>> b > > > -- > Marc-Andre Lemburg > eGenix.com > > Professional Python Services directly from the Source (#1, Sep 09 2010) > >>> Python/Zope Consulting and Support ... http://www.egenix.com/ > >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ > >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ > ________________________________________________________________________ > 2010-08-19: Released mxODBC 3.1.0 http://python.egenix.com/ > 2010-09-15 : DZUG Tagung, Dresden, > Germany 6 days to go > > ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: > > > eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 > D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg > Registered at Amtsgericht Duesseldorf: HRB 46611 > http://www.egenix.com/company/contact/ > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- http://www.voidspace.org.uk -------------- next part -------------- An HTML attachment was scrubbed... URL: From mal at egenix.com Thu Sep 9 15:53:49 2010 From: mal at egenix.com (M.-A. Lemburg) Date: Thu, 09 Sep 2010 15:53:49 +0200 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino> <4C88E1DF.3090502@egenix.com> Message-ID: <4C88E6ED.8000807@egenix.com> Michael Foord wrote: > On 9 September 2010 14:32, M.-A. Lemburg wrote: > >> [snip...] >> Why do you need to put everything on one line ? >> >> afh = open(args.actual, encoding="utf-8") >> efh = open(args.expected, encoding="utf-8") >> >> with afh, efh: >> ... >> >> In the context of files, the only purpose of the with statement >> is to close them when leaving the block. >> >>>>> a = open('/etc/passwd') >>>>> b = open('/etc/group') >> > > If my understanding is correct (which is perhaps unlikely...), using a > single line will close a if opening b fails. Whereas doing them separately > before the with statement risks leaving the first un-exited if creating the > second fails. Right, but if you stuff everything on a single line, your error handling will have a hard time figuring out which of the two failed to open. I was under the impression that Mark wanted to "protect" the inner block of the with statement, not the context manager creation itself. As usual: hiding away too much stuff in your closet makes things look tidy, but causes a hell of a mess if you ever need to open it again :-) > Michael > > >>>>> with a,b: print a.readline(), b.readline() >> ... >> at:x:25:25:Batch jobs daemon:/var/spool/atjobs:/bin/bash >> at:!:25: >> >>>>> a >> >>>>> b >> >> >> -- >> Marc-Andre Lemburg >> eGenix.com >> >> Professional Python Services directly from the Source (#1, Sep 09 2010) >>>>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>>>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>>>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ >> ________________________________________________________________________ >> 2010-08-19: Released mxODBC 3.1.0 http://python.egenix.com/ >> 2010-09-15 : DZUG Tagung, Dresden, >> Germany 6 days to go >> >> ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: >> >> >> eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 >> D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg >> Registered at Amtsgericht Duesseldorf: HRB 46611 >> http://www.egenix.com/company/contact/ >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> > > > -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, Sep 09 2010) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ 2010-08-19: Released mxODBC 3.1.0 http://python.egenix.com/ 2010-09-15: DZUG Tagung, Dresden, Germany 6 days to go ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/ From mark at qtrac.eu Thu Sep 9 16:13:54 2010 From: mark at qtrac.eu (Mark Summerfield) Date: Thu, 9 Sep 2010 15:13:54 +0100 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <4C88E6ED.8000807@egenix.com> References: <20100908175029.6617ae3b@dino> <4C88E1DF.3090502@egenix.com> <4C88E6ED.8000807@egenix.com> Message-ID: <20100909151354.6d0ce7a8@dino> On Thu, 09 Sep 2010 15:53:49 +0200 "M.-A. Lemburg" wrote: > Michael Foord wrote: > > On 9 September 2010 14:32, M.-A. Lemburg wrote: > > > >> [snip...] > >> Why do you need to put everything on one line ? > >> > >> afh = open(args.actual, encoding="utf-8") > >> efh = open(args.expected, encoding="utf-8") > >> > >> with afh, efh: > >> ... > >> > >> In the context of files, the only purpose of the with statement > >> is to close them when leaving the block. > >> > >>>>> a = open('/etc/passwd') > >>>>> b = open('/etc/group') > >> > > > > If my understanding is correct (which is perhaps unlikely...), > > using a single line will close a if opening b fails. Whereas doing > > them separately before the with statement risks leaving the first > > un-exited if creating the second fails. > > Right, but if you stuff everything on a single line, your > error handling will have a hard time figuring out which of > the two failed to open. > > I was under the impression that Mark wanted to "protect" the > inner block of the with statement, not the context manager > creation itself. Actually, I was more interested in the aesthetics. I've become habituated to _never_ using \ continuations and found it unsightly to need one here. > As usual: hiding away too much stuff in your closet makes things > look tidy, but causes a hell of a mess if you ever need to open > it again :-) :-) > > > Michael > > > > > >>>>> with a,b: print a.readline(), b.readline() > >> ... > >> at:x:25:25:Batch jobs daemon:/var/spool/atjobs:/bin/bash > >> at:!:25: > >> > >>>>> a > >> > >>>>> b > >> > >> > >> -- > >> Marc-Andre Lemburg > >> eGenix.com > >> > >> Professional Python Services directly from the Source (#1, Sep 09 > >> 2010) > >>>>> Python/Zope Consulting and Support ... > >>>>> http://www.egenix.com/ > >>>>> mxODBC.Zope.Database.Adapter ... > >>>>> http://zope.egenix.com/ mxODBC, mxDateTime, > >>>>> mxTextTools ... http://python.egenix.com/ > >> ________________________________________________________________________ > >> 2010-08-19: Released mxODBC 3.1.0 > >> http://python.egenix.com/ 2010-09-15 > >> : DZUG Tagung, Dresden, > >> Germany 6 days to go > >> > >> ::: Try our new mxODBC.Connect Python Database Interface for > >> free ! :::: > >> > >> > >> eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 > >> D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg > >> Registered at Amtsgericht Duesseldorf: HRB 46611 > >> http://www.egenix.com/company/contact/ > >> _______________________________________________ > >> Python-ideas mailing list > >> Python-ideas at python.org > >> http://mail.python.org/mailman/listinfo/python-ideas > >> > > > > > > > -- Mark Summerfield, Qtrac Ltd, www.qtrac.eu C++, Python, Qt, PyQt - training and consultancy "Programming in Python 3" - ISBN 0321680561 http://www.qtrac.eu/py3book.html From fuzzyman at voidspace.org.uk Thu Sep 9 16:34:25 2010 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Thu, 9 Sep 2010 15:34:25 +0100 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: <4C88E6ED.8000807@egenix.com> References: <20100908175029.6617ae3b@dino> <4C88E1DF.3090502@egenix.com> <4C88E6ED.8000807@egenix.com> Message-ID: On 9 September 2010 14:53, M.-A. Lemburg wrote: > Michael Foord wrote: > > On 9 September 2010 14:32, M.-A. Lemburg wrote: > > > >> [snip...] > >> Why do you need to put everything on one line ? > >> > >> afh = open(args.actual, encoding="utf-8") > >> efh = open(args.expected, encoding="utf-8") > >> > >> with afh, efh: > >> ... > >> > >> In the context of files, the only purpose of the with statement > >> is to close them when leaving the block. > >> > >>>>> a = open('/etc/passwd') > >>>>> b = open('/etc/group') > >> > > > > If my understanding is correct (which is perhaps unlikely...), using a > > single line will close a if opening b fails. Whereas doing them > separately > > before the with statement risks leaving the first un-exited if creating > the > > second fails. > > Right, but if you stuff everything on a single line, your > error handling will have a hard time figuring out which of > the two failed to open. > If you *need* to distinguish at a higher level then you have no choice. I was really just pointing out that there are *semantic* differences as well, and in fact the code you posted is less safe than the one line version. You lose some of the error handling built-in to context manager creation. Michael > > I was under the impression that Mark wanted to "protect" the > inner block of the with statement, not the context manager > creation itself. > > As usual: hiding away too much stuff in your closet makes things > look tidy, but causes a hell of a mess if you ever need to open > it again :-) > > > Michael > > > > > >>>>> with a,b: print a.readline(), b.readline() > >> ... > >> at:x:25:25:Batch jobs daemon:/var/spool/atjobs:/bin/bash > >> at:!:25: > >> > >>>>> a > >> > >>>>> b > >> > >> > >> -- > >> Marc-Andre Lemburg > >> eGenix.com > >> > >> Professional Python Services directly from the Source (#1, Sep 09 2010) > >>>>> Python/Zope Consulting and Support ... http://www.egenix.com/ > >>>>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ > >>>>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ > >> ________________________________________________________________________ > >> 2010-08-19: Released mxODBC 3.1.0 > http://python.egenix.com/ > >> 2010-09-15 : DZUG Tagung, > Dresden, > >> Germany 6 days to go > >> > >> ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: > >> > >> > >> eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 > >> D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg > >> Registered at Amtsgericht Duesseldorf: HRB 46611 > >> http://www.egenix.com/company/contact/ > >> _______________________________________________ > >> Python-ideas mailing list > >> Python-ideas at python.org > >> http://mail.python.org/mailman/listinfo/python-ideas > >> > > > > > > > > -- > Marc-Andre Lemburg > eGenix.com > > Professional Python Services directly from the Source (#1, Sep 09 2010) > >>> Python/Zope Consulting and Support ... http://www.egenix.com/ > >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ > >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ > ________________________________________________________________________ > 2010-08-19: Released mxODBC 3.1.0 http://python.egenix.com/ > 2010-09-15 : DZUG Tagung, Dresden, > Germany 6 days to go > > ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: > > > eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 > D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg > Registered at Amtsgericht Duesseldorf: HRB 46611 > http://www.egenix.com/company/contact/ > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- http://www.voidspace.org.uk -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Thu Sep 9 22:55:15 2010 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 09 Sep 2010 16:55:15 -0400 Subject: [Python-ideas] with statement syntax forces ugly line breaks? In-Reply-To: References: <20100908175029.6617ae3b@dino> <20100909064951.1e1b4df3@dino> Message-ID: On 9/9/2010 8:14 AM, Georg Brandl wrote: > Am 09.09.2010 07:49, schrieb Mark Summerfield: >> Hi Nathan, >> >> On Wed, 8 Sep 2010 13:00:25 -0400 >> Nathan Schneider wrote: >>> Mark, >>> >>> I have approached these cases by using the backslash >>> line-continuation operator: >>> >>> with FakeContext("a") as a, \ Adding a space makes the following a SyntaxError. No silent error here. >>> FakeContext("b") as b: >>> pass >> >> Yes, of course, and that's the way I've done it. But it seems a pity to >> do it this way when the documentation explicitly discourages the use of >> the backslash for line continuation: >> http://docs.python.org/py3k/howto/doanddont.html >> (look at the very last item) If no one uses \ for end of line escape, it should be removed ... But I am not suggesting that. > Which is actually factually incorrect and should be rewritten. The only > situation where stray whitespace after a backslash is valid syntax is > within a string literal (and there, there is no alternative). > > So at least the "stray whitespace leads to silently buggy code" reason > not to use backslashes is wrong. > > Georg > -- Terry Jan Reedy From cool-rr at cool-rr.com Fri Sep 10 18:37:44 2010 From: cool-rr at cool-rr.com (cool-RR) Date: Fri, 10 Sep 2010 18:37:44 +0200 Subject: [Python-ideas] Why not f(*my_list, *my_other_list) ? Message-ID: I noticed that it's impossible to call a Python function with two starred argument lists, like this: `f(*my_list, *my_other_list)`. I mean, if someone wants to feed two lists of arguments into a function, why not? I understand why you can't have two stars in a function definition; But why can't you have two (or more) stars in a function call? Ram. -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Fri Sep 10 18:54:33 2010 From: python at mrabarnett.plus.com (MRAB) Date: Fri, 10 Sep 2010 17:54:33 +0100 Subject: [Python-ideas] Why not f(*my_list, *my_other_list) ? In-Reply-To: References: Message-ID: <4C8A62C9.1040206@mrabarnett.plus.com> On 10/09/2010 17:37, cool-RR wrote: > I noticed that it's impossible to call a Python function with two > starred argument lists, like this: `f(*my_list, *my_other_list)`. I > mean, if someone wants to feed two lists of arguments into a function, > why not? > > I understand why you can't have two stars in a function definition; But > why can't you have two (or more) stars in a function call? > Would there be any advantage over `f(*(my_list + my_other_list))`? (Send to wrong list originally :-() From benjamin at python.org Fri Sep 10 19:03:20 2010 From: benjamin at python.org (Benjamin Peterson) Date: Fri, 10 Sep 2010 17:03:20 +0000 (UTC) Subject: [Python-ideas] =?utf-8?q?Why_not_f=28*my=5Flist=2C*my=5Fother=5Fl?= =?utf-8?b?aXN0KSA/?= References: Message-ID: cool-RR writes: > > I noticed that it's impossible to call a Python function with two starred argument lists, like this: `f(*my_list, *my_other_list)`. I mean, if someone wants to feed two lists of arguments into a function, why not? Okay, so why would you want to? From phd at phd.pp.ru Fri Sep 10 18:57:13 2010 From: phd at phd.pp.ru (Oleg Broytman) Date: Fri, 10 Sep 2010 20:57:13 +0400 Subject: [Python-ideas] Why not f(*my_list, *my_other_list) ? In-Reply-To: References: Message-ID: <20100910165713.GA24612@phd.pp.ru> On Fri, Sep 10, 2010 at 06:37:44PM +0200, cool-RR wrote: > f(*my_list, *my_other_list) Not every one-lined should be a syntax. Just call f(*(my_list + my_other_list)) Oleg. -- Oleg Broytman http://phd.pp.ru/ phd at phd.pp.ru Programmers don't die, they just GOSUB without RETURN. From stefan_ml at behnel.de Fri Sep 10 19:16:52 2010 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 10 Sep 2010 19:16:52 +0200 Subject: [Python-ideas] Why not f(*my_list,*my_other_list) ? In-Reply-To: References:

Message-ID: Benjamin Peterson, 10.09.2010 19:03: > cool-RR writes: > >> >> I noticed that it's impossible to call a Python function with two starred > argument lists, like this: `f(*my_list, *my_other_list)`. I mean, if someone > wants to feed two lists of arguments into a function, why not? > > Okay, so why would you want to? Well, it can happen. It doesn't merit a syntax extension, though. You can just do args_for_f = tuple(my_list) + tuple(my_other_list) f(*args_for_f) (using tuple() here in case both are not really lists) Stefan From daniel at stutzbachenterprises.com Fri Sep 10 19:34:42 2010 From: daniel at stutzbachenterprises.com (Daniel Stutzbach) Date: Fri, 10 Sep 2010 12:34:42 -0500 Subject: [Python-ideas] Why not f(*my_list,*my_other_list) ? In-Reply-To: References:

Message-ID: On Fri, Sep 10, 2010 at 12:16 PM, Stefan Behnel wrote: > args_for_f = tuple(my_list) + tuple(my_other_list) > f(*args_for_f) > An alternative with better performance is: from itertools import chain f(*chain(my_list, my_other_list)) -- Daniel Stutzbach, Ph.D. President, Stutzbach Enterprises, LLC -------------- next part -------------- An HTML attachment was scrubbed... URL: From sergio at gruposinternet.com.br Fri Sep 10 19:43:30 2010 From: sergio at gruposinternet.com.br (=?ISO-8859-1?Q?S=E9rgio?= Surkamp) Date: Fri, 10 Sep 2010 14:43:30 -0300 Subject: [Python-ideas] Why not f(*my_list, *my_other_list) ? In-Reply-To: References: Message-ID: <20100910144330.640866f2@icedearth.corp.grupos.com.br> Em Fri, 10 Sep 2010 18:37:44 +0200 cool-RR escreveu: > I noticed that it's impossible to call a Python function with two > starred argument lists, like this: `f(*my_list, *my_other_list)`. I > mean, if someone wants to feed two lists of arguments into a > function, why not? > > I understand why you can't have two stars in a function definition; > But why can't you have two (or more) stars in a function call? > > > Ram. How the compiler should treat that? Put half of the arguments in the first list and the other half on the second list? Regards, -- .:''''':. .:' ` S?rgio Surkamp | Gerente de Rede :: ........ sergio at gruposinternet.com.br `:. .:' `:, ,.:' *Grupos Internet S.A.* `: :' R. Lauro Linhares, 2123 Torre B - Sala 201 : : Trindade - Florian?polis - SC :.' :: +55 48 3234-4109 : ' http://www.gruposinternet.com.br From mikegraham at gmail.com Fri Sep 10 21:28:09 2010 From: mikegraham at gmail.com (Mike Graham) Date: Fri, 10 Sep 2010 15:28:09 -0400 Subject: [Python-ideas] Why not f(*my_list,*my_other_list) ? In-Reply-To: References:

Message-ID: On Fri, Sep 10, 2010 at 1:34 PM, Daniel Stutzbach wrote: > An alternative with better performance is: > > from itertools import chain > f(*chain(my_list, my_other_list)) Maybe. From tjreedy at udel.edu Fri Sep 10 23:25:35 2010 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 10 Sep 2010 17:25:35 -0400 Subject: [Python-ideas] Why not f(*my_list, *my_other_list) ? In-Reply-To: References: Message-ID: On 9/10/2010 12:37 PM, cool-RR wrote: > I noticed that it's impossible to call a Python function with two > starred argument lists, like this: `f(*my_list, *my_other_list)`. I > mean, if someone wants to feed two lists of arguments into a function, > why not? > > I understand why you can't have two stars in a function definition; But > why can't you have two (or more) stars in a function call? Beyond 0. Not needed as others explained, some speculations: 1. Calls are designed to mirror definition. No multiple stars in definition means no multiple stars in calls. 2. Multiple stars begin to look like typing errors. 3. No one ever thought to support such. 4. It would make the call process even more complex, and it is slow enough already. 5. It might conflict with the current implementation. -- Terry Jan Reedy From guido at python.org Sat Sep 11 01:25:04 2010 From: guido at python.org (Guido van Rossum) Date: Fri, 10 Sep 2010 16:25:04 -0700 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: <4C8AB874.9010703@openvpn.net> References: <4C8AB874.9010703@openvpn.net> Message-ID: Moving to python-ideas. Have you seen http://www.python.org/dev/peps/pep-3148/ ? That seems exactly what you want. --Guido On Fri, Sep 10, 2010 at 4:00 PM, James Yonan wrote: > I'd like to propose that the Python community standardize on a "deferred" > object for asynchronous return values, modeled after the well-thought-out > Twisted Deferred class. > > With more and more Python libraries implementing asynchronicity (for example > Futures -- PEP 3148), it's crucial to have a standard deferred object in > place so that code using a single asynchronous reactor can interoperate with > different asynchronous libraries. > > I think a lot of people don't realize how much cooler and more elegant it is > to return a deferred object from an asynchronous function rather than using > a generic callback approach (where you pass a function argument to the > asynchronous function telling it where to call when the asynchronous > operation completes). > > While asynchronous systems have been shown to have excellent scalability > properties, the callback-based programming style often used in asynchronous > programming has been criticized for breaking up the sequential readability > of program logic. > > This problem is elegantly addressed by using Deferred Generators. ?Since > Python 2.5 added enhanced generators (i.e. the capability for "yield" to > return a value), the infrastructure is now in place to allow an asynchronous > function to be written in a sequential style, without the use of explicit > callbacks. > > See the following blog article for a nice write-up on the capability: > > http://blog.mekk.waw.pl/archives/14-Twisted-inlineCallbacks-and-deferredGenerator.html > > Mekk's Twisted Deferred example: > > @defer.inlineCallbacks > def someFunction(): > ? ?a = 1 > ? ?b = yield deferredReturningFunction(a) > ? ?c = yield anotherDeferredReturningFunction(a, b) > ? ?defer.returnValue(c) > > What's cool about this is that between the two yield statements, the Twisted > reactor is in control meaning that other pending asynchronous tasks can be > attended to or the thread's remaining time slice can be yielded to the > kernel, yet this is all accomplished without the use of multi-threading. > ?Another interesting aspect of this approach is that since it leverages on > Python's enhanced generators, an exception thrown inside either of the > deferred-returning functions will be propagated through to someFunction() > where it can be handled with try/except. > > Think about what this means -- this sort of emulates the "stackless" design > pattern you would expect in Erlang or Stackless Python without leaving > standard Python. ?And it's made possible under the hood by Python Enhanced > Generators. > > Needless to say, it would be great to see this coolness be part of the > standard Python library, instead of having every Python asynchronous library > implement its own ad-hoc callback system. > > James Yonan > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/guido%40python.org > -- --Guido van Rossum (python.org/~guido) From ncoghlan at gmail.com Sat Sep 11 02:07:19 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 11 Sep 2010 10:07:19 +1000 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: References: <4C8AB874.9010703@openvpn.net> Message-ID: On Sat, Sep 11, 2010 at 9:25 AM, Guido van Rossum wrote: > Moving to python-ideas. > > Have you seen http://www.python.org/dev/peps/pep-3148/ ? That seems > exactly what you want. James did mention that in the post, although he didn't say what deferreds really added beyond what futures provide, and why the "add_done_callback" method isn't adequate to provide interoperability between futures and deferreds (which would be odd, since Brian made changes to that part of PEP 3148 to help with that interoperability after discussions with Glyph). Between PEP 380 and PEP 3148 I'm not really seeing a lot more scope for standardisation in this space though. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From jnoller at gmail.com Sat Sep 11 18:03:12 2010 From: jnoller at gmail.com (Jesse Noller) Date: Sat, 11 Sep 2010 09:03:12 -0700 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: References: <4C8AB874.9010703@openvpn.net> Message-ID: On Fri, Sep 10, 2010 at 5:07 PM, Nick Coghlan wrote: > On Sat, Sep 11, 2010 at 9:25 AM, Guido van Rossum wrote: >> Moving to python-ideas. >> >> Have you seen http://www.python.org/dev/peps/pep-3148/ ? That seems >> exactly what you want. > > James did mention that in the post, although he didn't say what > deferreds really added beyond what futures provide, and why the > "add_done_callback" method isn't adequate to provide interoperability > between futures and deferreds (which would be odd, since Brian made > changes to that part of PEP 3148 to help with that interoperability > after discussions with Glyph). > > Between PEP 380 and PEP 3148 I'm not really seeing a lot more scope > for standardisation in this space though. > > Cheers, > Nick. That was my initial reaction as well, but I'm more than open to hearing from Jean Paul/Glyph and the other twisted folks on this. From guido at python.org Sun Sep 12 04:26:50 2010 From: guido at python.org (Guido van Rossum) Date: Sat, 11 Sep 2010 19:26:50 -0700 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: References: <4C8AB874.9010703@openvpn.net> Message-ID: (Summary: I want to make an apology, and reopen the debate. Possibly relevant: PEP 342, PEP 380, PEP 3148, PEP 3152.) On Sat, Sep 11, 2010 at 9:03 AM, Jesse Noller wrote: > On Fri, Sep 10, 2010 at 5:07 PM, Nick Coghlan wrote: >> On Sat, Sep 11, 2010 at 9:25 AM, Guido van Rossum wrote: >>> Moving to python-ideas. >>> >>> Have you seen http://www.python.org/dev/peps/pep-3148/ ? That seems >>> exactly what you want. >> >> James did mention that in the post, Whoops. I was a bit quick at the trigger there. >> although he didn't say what >> deferreds really added beyond what futures provide, and why the >> "add_done_callback" method isn't adequate to provide interoperability >> between futures and deferreds (which would be odd, since Brian made >> changes to that part of PEP 3148 to help with that interoperability >> after discussions with Glyph). >> >> Between PEP 380 and PEP 3148 I'm not really seeing a lot more scope >> for standardisation in this space though. >> >> Cheers, >> Nick. > > That was my initial reaction as well, but I'm more than open to > hearing from Jean Paul/Glyph and the other twisted folks on this. Re-reading the OP's post[0] and the blog[1] he references, I notice that he did not mention PEP 380 (which for the blog's example doesn't actually add much except adding a nicer way to return a value from a generator) but he did mention the awesomeness of not needing threads when using deferreds. He sounds as if the python-dev community had never heard of that style of handling concurrency, which seems backwards: the generator-based style of doing it was introduced in PEP 342 which enabled Twisted's inline callbacks. (Though he does mention Python Enhanced Generators which could be an implicit reference to PEP 342 -- "Coroutines via Enhanced Generators".) But thinking about this more I don't know that it will be easy to mix PEP 3148, which is solidly thread-based, with a PEP 342 style scheduler (whether or not the PEP 380 enhancements are applied, or even PEP 3152). And if we take the OP's message at face value, his point isn't so much that Twisted is great, but that in order to benefit maximally from PEP 342 there needs to be a standard way of using callbacks. I think that's probably true. And comparing the blog's examples to PEP 3148, I find Twisted's terminology rather confusing compared to the PEP's clean Futures API (where IMO you can ignore almost everything except result()). Maybe it's possible to write a little framework that lets you create Futures using either threads, processes (both supported by PEP 3148) or generators. But I haven't tried it. And maybe the need to use 'yield' for everything that may block when using generators, but not when using threads or processes, will make this awkward. So maybe we'll be stuck with at least two Future-like APIs: PEP 3148 and something else, generator-based. Or maybe PEP 3152. So, yes, there may be something here, and let's reopen the discussion. And I apologize for shooting first and asking questions second. [0] http://mail.python.org/pipermail/python-dev/2010-September/103576.html [1] http://blog.mekk.waw.pl/archives/14-Twisted-inlineCallbacks-and-deferredGenerator.html -- --Guido van Rossum (python.org/~guido) From solipsis at pitrou.net Sun Sep 12 13:03:38 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 12 Sep 2010 13:03:38 +0200 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object References: <4C8AB874.9010703@openvpn.net> Message-ID: <20100912130338.714643f8@pitrou.net> On Sat, 11 Sep 2010 19:26:50 -0700 Guido van Rossum wrote: > > But thinking about this more I don't know that it will be easy to mix > PEP 3148, which is solidly thread-based, with a PEP 342 style > scheduler (whether or not the PEP 380 enhancements are applied, or > even PEP 3152). I'm not sure why. The implementation is certainly thread-based, but functions such as `wait(fs, timeout=None, return_when=ALL_COMPLETED)` could be implemented in termes of a single-threaded event loop / job scheduler. Actually, Twisted has a similar primitive in DeferredList, although more powerful since the DeferredList itself is a Deferred, and can therefore be further combined, etc.: http://twistedmatrix.com/documents/10.0.0/api/twisted.internet.defer.DeferredList.html > And comparing the > blog's examples to PEP 3148, I find Twisted's terminology rather > confusing compared to the PEP's clean Futures API (where IMO you can > ignore almost everything except result()). Well, apart from the API which may be considered a taste issue (I have used Deferreds long before I heard about Futures, so perhaps I'm a bit biased), the following API doc in PEP 3148 shows that the Future model of callbacks is less rich than Twisted's: ?add_done_callback(fn) Attaches a callable fn to the future that will be called when the future is cancelled or finishes running. fn will be called with the future as its only argument. Added callables are called in the order that they were added and are always called in a thread belonging to the process that added them. If the callable raises an Exception then it will be logged and ignored. If the callable raises another BaseException then behavior is not defined.? With Twisted Deferreds, when a callback or errback raises an error, its exception isn't ?logged and ignored?, it is passed to the remaining errback chain attached to the Deferred. This is part of what makes Deferreds more complicated to understand, but it also makes them more powerful. Another key point is that a callback can itself return another Deferred object, in which case the next callback (or errback, in case of error) will be called only once the other Deferred produces a result. This is all handled transparently and you can freely mix callbacks that immediately return a value, and callbacks that return a Deferred whose final value will be available later. And the other Deferred can have its own callback/errback chain, etc. (just for the record, the ?final value? of a Deferred is the value returned by the last callback in the chain) I think the main reason, though, that people find Deferreds inconvenient is that they force you to think in terms of asynchronicity (well, almost: you can of course hack yourself some code which blocks until a Deferred has a value, but it's extremely discouraged). They would like to have officially supported methods like `result(timeout=None)` which make simple things (like quick scripts to fetch a bunch of URLs) simpler. Twisted is generally used for server applications where such code is out of question (in an async model, that is). Regards Antoine. From guido at python.org Sun Sep 12 17:49:56 2010 From: guido at python.org (Guido van Rossum) Date: Sun, 12 Sep 2010 08:49:56 -0700 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: <20100912130338.714643f8@pitrou.net> References: <4C8AB874.9010703@openvpn.net> <20100912130338.714643f8@pitrou.net> Message-ID: On Sun, Sep 12, 2010 at 4:03 AM, Antoine Pitrou wrote: > On Sat, 11 Sep 2010 19:26:50 -0700 > Guido van Rossum wrote: >> >> But thinking about this more I don't know that it will be easy to mix >> PEP 3148, which is solidly thread-based, with a PEP 342 style >> scheduler (whether or not the PEP 380 enhancements are applied, or >> even PEP 3152). > > I'm not sure why. The implementation is certainly thread-based, but > functions such as `wait(fs, timeout=None, return_when=ALL_COMPLETED)` > could be implemented in termes of a single-threaded event loop / job > scheduler. Sure, but the tricky thing is to make it pluggable so that PEP 3148 and Twisted and other frameworks can use it all together, and a single call will accept a mixture of Futures. I also worry that "impure" code will have a hard time -- e.g. when mixing generator-based coroutines and thread-based futures, it would be quite bad if a coroutine called .result() on a Future or the .wait() function instead of yielding to the scheduler. > Actually, Twisted has a similar primitive in DeferredList, although > more powerful since the DeferredList itself is a Deferred, and can > therefore be further combined, etc.: > > http://twistedmatrix.com/documents/10.0.0/api/twisted.internet.defer.DeferredList.html This sounds similar to the way you can create derived futures in Java. >> And comparing the >> blog's examples to PEP 3148, I find Twisted's terminology rather >> confusing compared to the PEP's clean Futures API (where IMO you can >> ignore almost everything except result()). > > Well, apart from the API which may be considered a taste issue (I have > used Deferreds long before I heard about Futures, so perhaps I'm a bit > biased), I heard of Deferred long before PEP 3148 was even conceived, but I find Twisted's terminology terribly confusing while I find the PEP's names easy to understand. > the following API doc in PEP 3148 shows that the Future model > of callbacks is less rich than Twisted's: > > ?add_done_callback(fn) > > ? ?Attaches a callable fn to the future that will be called when the > ? ?future is cancelled or finishes running. fn will be called with the > ? ?future as its only argument. > > ? ?Added callables are called in the order that they were added and > ? ?are always called in a thread belonging to the process that added > ? ?them. If the callable raises an Exception then it will be logged > ? ?and ignored. If the callable raises another BaseException then > ? ?behavior is not defined.? > > With Twisted Deferreds, when a callback or errback raises an error, its > exception isn't ?logged and ignored?, it is passed to the remaining > errback chain attached to the Deferred. This is part of what makes > Deferreds more complicated to understand, but it also makes them more > powerful. Yeah, please do explain why Twisted has so much machinery to handle exceptions? ISTM that the main difference is that add_done_callback() isn't meant for callbacks that return a value. So then the exceptions that might be raised are kind of "out of band". For any API that returns a value I agree that raising an exception should be handled -- but in the PEP 342 world we can do that by passing exceptions back into coroutine using throw(), so no separate "success" and "failure" callbacks are needed. > Another key point is that a callback can itself return another Deferred > object, in which case the next callback (or errback, in case of error) > will be called only once the other Deferred produces a result. This is > all handled transparently and you can freely mix callbacks that > immediately return a value, and callbacks that return a Deferred whose > final value will be available later. And the other Deferred can have > its own callback/errback chain, etc. Yeah, that is part of what makes it so utterly confusing. PEP 380 supports a similar thing but much cleaner, without ever using callbacks. > (just for the record, the ?final value? of a Deferred is the value > returned by the last callback in the chain) > > > I think the main reason, though, that people find Deferreds > inconvenient is that they force you to think in terms of > asynchronicity (well, almost: you can of course hack yourself > some code which blocks until a Deferred has a value, but it's > extremely discouraged). They would like to have officially > supported methods like `result(timeout=None)` which make simple things > (like quick scripts to fetch a bunch of URLs) simpler. Twisted is > generally used for server applications where such code is out of > question (in an async model, that is). Actually I think the main reason is historic: Twisted introduced callback-based asynchronous (thread-less) programming when there was no alternative in Python, and they invented both the mechanisms and the terminology as they were figuring it all out. That is no mean feat. But with PEP 342 (generator-based coroutines) and especially PEP 380 (yield from) there *is* an alternative, and while Twisted has added APIs to support generators, it hasn't started to deprecate its other APIs, and its terminology becomes hard to follow for people (like me, frankly) who first learned this stuff through PEP 342. -- --Guido van Rossum (python.org/~guido) From solipsis at pitrou.net Sun Sep 12 18:17:51 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Sun, 12 Sep 2010 18:17:51 +0200 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object References: <4C8AB874.9010703@openvpn.net> <20100912130338.714643f8@pitrou.net> Message-ID: <20100912181751.2aa5bb32@pitrou.net> On Sun, 12 Sep 2010 08:49:56 -0700 Guido van Rossum wrote: > > Sure, but the tricky thing is to make it pluggable so that PEP 3148 > and Twisted and other frameworks can use it all together, and a single > call will accept a mixture of Futures. Having a common abstraction (Future or Deferred) allows for scheduling-agnostic libraries which consume and/or produce these abstractions (*). I'm not sure it is desireable to mix scheduling models in a single process (let alone a single thread), though. (*) Of course, the abstraction is somehow leaky since being called from different threads, depending on the scheduling model, could have adverse consequences > ISTM that the main difference is that add_done_callback() isn't meant > for callbacks that return a value. So then the exceptions that might > be raised are kind of "out of band". It implies that it's mostly useful for simple callbacks (which would e.g. print out a success report, or set an Event to wake up another thread). The Twisted model allows the major part of processing to occur in the callbacks themselves, in which case proper error handling and propagation is mandatory. Regards Antoine. From guido at python.org Sun Sep 12 18:48:20 2010 From: guido at python.org (Guido van Rossum) Date: Sun, 12 Sep 2010 09:48:20 -0700 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: <20100912181751.2aa5bb32@pitrou.net> References: <4C8AB874.9010703@openvpn.net> <20100912130338.714643f8@pitrou.net> <20100912181751.2aa5bb32@pitrou.net> Message-ID: On Sun, Sep 12, 2010 at 9:17 AM, Antoine Pitrou wrote: > On Sun, 12 Sep 2010 08:49:56 -0700 > Guido van Rossum wrote: >> >> Sure, but the tricky thing is to make it pluggable so that PEP 3148 >> and Twisted and other frameworks can use it all together, and a single >> call will accept a mixture of Futures. > > Having a common abstraction (Future or Deferred) allows for > scheduling-agnostic libraries which consume and/or produce these > abstractions (*). I'm not sure it is desireable to mix scheduling models > in a single process (let alone a single thread), though. IIRC even Twisted supports putting stuff in a thread if you really need it. And have you looked at Go's Goroutines? They are a hybrid -- they don't map 1:1 to OS threads, but they aren't pure coroutines either, so that if a goroutine blocks on I/O the others will still make progress. > (*) Of course, the abstraction is somehow leaky since being called from > different threads, depending on the scheduling model, could have adverse > consequences Yeah, this is always a problem with pure async frameworks -- if one callback or coroutine blocks by mistake, the whole world is blocked. (So Goroutines attempt to fix this; I have no idea how successful they are.) >> ISTM that the main difference is that add_done_callback() isn't meant >> for callbacks that return a value. So then the exceptions that might >> be raised are kind of "out of band". > > It implies that it's mostly useful for simple callbacks (which would > e.g. print out a success report, or set an Event to wake up another > thread). The Twisted model allows the major part of processing to occur > in the callbacks themselves, in which case proper error handling and > propagation is mandatory. A generator-based coroutines approach can do this too (just put the work between the yields in the generator) and has all the proper exception-propagation stuff built in since PEP 342 (PEP 380 will just make it easier). And a Futures-based approach can do it too -- it's not described in PEP 3148, but you can easily design an API for wrappable Futures. -- --Guido van Rossum (python.org/~guido) From yoavglazner at gmail.com Mon Sep 13 14:09:23 2010 From: yoavglazner at gmail.com (yoav glazner) Date: Mon, 13 Sep 2010 14:09:23 +0200 Subject: [Python-ideas] Why not break cycles with one __del__? Message-ID: Hi! I was thinking, why not let python gc break cycles with only one object.__del__ ? I don't see a problem with calling the __del__ method and then proceed as usual (break the cycle if it wasn't already broken by __del__) Many Thanks, Yoav Glazner -------------- next part -------------- An HTML attachment was scrubbed... URL: From jimjjewett at gmail.com Mon Sep 13 18:16:36 2010 From: jimjjewett at gmail.com (Jim Jewett) Date: Mon, 13 Sep 2010 12:16:36 -0400 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: References: Message-ID: On Mon, Sep 13, 2010 at 8:09 AM, yoav glazner wrote: > why not let python gc break cycles with only one > object.__del__ ? If you can point to the code that prevents this, please report a bug. The last time I checked, there were proposals toeither add a __close__ or weaken __del__ to handle multi-__del__ cycles -- but single-__del__ cycles were already handled OK. -jJ From solipsis at pitrou.net Mon Sep 13 19:05:49 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 13 Sep 2010 19:05:49 +0200 Subject: [Python-ideas] Why not break cycles with one __del__? References:

Message-ID: <20100913190549.15f218ce@pitrou.net> On Mon, 13 Sep 2010 12:16:36 -0400 Jim Jewett wrote: > > The last time I checked, there were proposals toeither add a > __close__ or weaken __del__ to handle multi-__del__ cycles -- but > single-__del__ cycles were already handled OK. They aren't: >>> class C(list): ... def __del__(self): pass ... >>> c = C() >>> c.append(c) >>> del c >>> import gc >>> gc.collect() 1 >>> gc.garbage [[[...]]] >>> type(gc.garbage[0]) From tim.peters at gmail.com Mon Sep 13 19:25:54 2010 From: tim.peters at gmail.com (Tim Peters) Date: Mon, 13 Sep 2010 13:25:54 -0400 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: <20100913190549.15f218ce@pitrou.net> References:

<20100913190549.15f218ce@pitrou.net> Message-ID: [Jim Jewett] >> The last time I checked ... >> single-__del__ cycles were already handled OK. [Antoine Pitrou] > They aren't: ... Antoine's right, unless things have changed dramatically since last time I was intimate with that code. CPython's "cyclic garbage detection" makes no attempt to analyze cycle structure. It infers that all trash it sees must be in cycles simply because the trash hasn't already been collected by the regular refcount-based gc. The presence of __del__ on a trash object then disqualifies it from further analysis, but there's no analysis of cycle structure regardless. Of course it doesn't _have_ to be that way. Nobody cared enough yet to add a pile of new code to special-case cycles with a single __del__. From benjamin at python.org Mon Sep 13 21:22:02 2010 From: benjamin at python.org (Benjamin) Date: Mon, 13 Sep 2010 19:22:02 +0000 (UTC) Subject: [Python-ideas] =?utf-8?q?Why_not_break_cycles_with_one_=5F=5Fdel?= =?utf-8?b?X18/?= References:

<20100913190549.15f218ce@pitrou.net> Message-ID: Tim Peters writes: > Of course it doesn't _have_ to be that way. Nobody cared enough yet > to add a pile of new code to special-case cycles with a single > __del__. And hopefully no one will. That would be very brittle. From solipsis at pitrou.net Mon Sep 13 22:28:08 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Mon, 13 Sep 2010 22:28:08 +0200 Subject: [Python-ideas] Why not break cycles with one __del__? References:

<20100913190549.15f218ce@pitrou.net>

Message-ID: <20100913222808.2459784a@pitrou.net> On Mon, 13 Sep 2010 19:22:02 +0000 (UTC) Benjamin wrote: > Tim Peters writes: > > Of course it doesn't _have_ to be that way. Nobody cared enough yet > > to add a pile of new code to special-case cycles with a single > > __del__. > > And hopefully no one will. That would be very brittle. Why would it be? From fuzzyman at voidspace.org.uk Mon Sep 13 22:36:35 2010 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Mon, 13 Sep 2010 21:36:35 +0100 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: References:

<20100913190549.15f218ce@pitrou.net>

Message-ID: On 13 September 2010 20:22, Benjamin wrote: > Tim Peters writes: > > Of course it doesn't _have_ to be that way. Nobody cared enough yet > > to add a pile of new code to special-case cycles with a single > > __del__. > > And hopefully no one will. That would be very brittle. > > More brittle than what PyPy, IronPython (and presumably) jython do? (Which is make cycles collectable by arbitrarily breaking them IIUC.) Michael > > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- http://www.voidspace.org.uk -------------- next part -------------- An HTML attachment was scrubbed... URL: From yoavglazner at gmail.com Mon Sep 13 22:56:09 2010 From: yoavglazner at gmail.com (yoav glazner) Date: Mon, 13 Sep 2010 22:56:09 +0200 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: References:

<20100913190549.15f218ce@pitrou.net>

Message-ID: > > And hopefully no one will. That would be very brittle >> > Why do you hope for that? that is the "one obvious way to do it" -------------- next part -------------- An HTML attachment was scrubbed... URL: From benjamin at python.org Mon Sep 13 23:31:45 2010 From: benjamin at python.org (Benjamin Peterson) Date: Mon, 13 Sep 2010 21:31:45 +0000 (UTC) Subject: [Python-ideas] =?utf-8?q?Why_not_break_cycles_with_one_=5F=5Fdel?= =?utf-8?b?X18/?= References:

<20100913190549.15f218ce@pitrou.net>

<20100913222808.2459784a@pitrou.net> Message-ID: Antoine Pitrou writes: > > On Mon, 13 Sep 2010 19:22:02 +0000 (UTC) > Benjamin wrote: > > Tim Peters writes: > > > Of course it doesn't _have_ to be that way. Nobody cared enough yet > > > to add a pile of new code to special-case cycles with a single > > > __del__. > > > > And hopefully no one will. That would be very brittle. > > Why would it be? Because if you're cycle suddenly had more than one __del__, it would stop being collected. From ncoghlan at gmail.com Mon Sep 13 23:39:00 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 14 Sep 2010 07:39:00 +1000 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: References:

<20100913190549.15f218ce@pitrou.net> Message-ID: On Tue, Sep 14, 2010 at 3:25 AM, Tim Peters wrote: > [Jim Jewett] >>> The last time I checked ... >>> single-__del__ cycles were already handled OK. > > [Antoine Pitrou] >> They aren't: ... > > Antoine's right, unless things have changed dramatically since last > time I was intimate with that code. ?CPython's "cyclic garbage > detection" makes no attempt to analyze cycle structure. ?It infers > that all trash it sees must be in cycles simply because the trash > hasn't already been collected by the regular refcount-based gc. ?The > presence of __del__ on a trash object then disqualifies it from > further analysis, but there's no analysis of cycle structure > regardless. I had a skim through that code last night, and as far as I can tell it still works that way. However, it should be noted that the cyclic GC actually does release everything *else* in the cycle - it's solely the objects with __del__ methods that remain alive. There does appear to a *little* bit of structural analysis going on - it looks like the "finalizers" list ends up containing both objects with __del__ methods, as well as all other objects in the cyclic trash that are reachable from the objects with __del__ methods. > Of course it doesn't _have_ to be that way. ?Nobody cared enough yet > to add a pile of new code to special-case cycles with a single > __del__. Just from skimming the code, I wonder if, once finalizers has been figured out, the GC could further partition that list into "to_delete" (no __del__ method), "to_finalize" (__del__ method, but all referrers in cycle have no __del__ method) and "uncollectable" (multiple __del__ methods in cycle). Alternatively, when building finalizers, build two lists: one for objects with __del__ methods and one for objects that are reachable from objects with __del__ methods. Objects that appear only in the first list could safely have their finalisers invoked, while those that also in the latter could not. This is definitely a case of "code talks" though - there's no fundamental problem with the idea, but also no great incentive for anyone to code it when __del__ is comparatively easy to avoid (although not trivial, see Raymond's recent modifications to OrderedDictionary to avoid exactly this issue). Or, accept that __del__ is evil, and try to come up with a workable proposal for that better weakref callback based scheme Jim mentioned. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From greg.ewing at canterbury.ac.nz Tue Sep 14 04:44:25 2010 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 14 Sep 2010 14:44:25 +1200 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: References:

<20100913190549.15f218ce@pitrou.net> Message-ID: <4C8EE189.40408@canterbury.ac.nz> Nick Coghlan wrote: > Alternatively, when building finalizers, build two > lists: one for objects with __del__ methods and one for objects that > are reachable from objects with __del__ methods. But since it's a cycle, isn't *everything* in the cycle going to be reachable from everything else? -- Greg From tim.peters at gmail.com Tue Sep 14 05:04:08 2010 From: tim.peters at gmail.com (Tim Peters) Date: Mon, 13 Sep 2010 23:04:08 -0400 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: <4C8EE189.40408@canterbury.ac.nz> References:

<20100913190549.15f218ce@pitrou.net> <4C8EE189.40408@canterbury.ac.nz> Message-ID: [Nick Coghlan] >> Alternatively, when building finalizers, build two >> lists: one for objects with __del__ methods and one for objects that >> are reachable from objects with __del__ methods. [Greg Ewing] > But since it's a cycle, isn't *everything* in the cycle > going to be reachable from everything else? Note that I was sloppy in saying that CPython's cyclic gc only sees trash objects in cycles. More accurately, it sees trash objects in cycles, and objects (which may or may not be in cycles) reachable only from trash objects in cycles. For example, if objects A and B point to each other, that's a cycle. If A also happens to point to D, where D has a __del__ method, and nothing else points to D, then that's a case where D is not in a cycle, but is nevertheless trash if A and B are trash. And if A and B lack finalizers, then CPython's cyclic gc will reclaim D, despite that it does have a __del__. That pattern is exploitable too. If, e.g., you have some resource R that needs to be cleaned up, owned by an object A that may participate in cycles, it's often possible to put R in a different, very simple object with a __del__ method, and have A point to that latter object instead. From guido at python.org Tue Sep 14 05:07:10 2010 From: guido at python.org (Guido van Rossum) Date: Mon, 13 Sep 2010 20:07:10 -0700 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: References:

<20100913190549.15f218ce@pitrou.net> <4C8EE189.40408@canterbury.ac.nz> Message-ID: On Mon, Sep 13, 2010 at 8:04 PM, Tim Peters wrote: > [Nick Coghlan] >>> Alternatively, when building finalizers, build two >>> lists: one for objects with __del__ methods and one for objects that >>> are reachable from objects with __del__ methods. > > [Greg Ewing] >> But since it's a cycle, isn't *everything* in the cycle >> going to be reachable from everything else? > > Note that I was sloppy in saying that CPython's cyclic gc only sees > trash objects in cycles. ?More accurately, it sees trash objects in > cycles, and objects (which may or may not be in cycles) reachable only > from trash objects in cycles. ?For example, if objects A and B point > to each other, that's a cycle. ?If A also happens to point to D, where > D has a __del__ method, and nothing else points to D, then that's a > case where D is not in a cycle, but is nevertheless trash if A and B > are trash. ?And if A and B lack finalizers, then CPython's cyclic gc > will reclaim D, despite that it does have a __del__. > > That pattern is exploitable too. ?If, e.g., you have some resource R > that needs to be cleaned up, owned by an object A that may participate > in cycles, it's often possible to put R in a different, very simple > object with a __del__ method, and have A point to that latter object > instead. Yeah, I think we even recommended this pattern at some point. ISTR we designed the new io library to exploit it. -- --Guido van Rossum (python.org/~guido) From greg.ewing at canterbury.ac.nz Tue Sep 14 06:16:37 2010 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Tue, 14 Sep 2010 16:16:37 +1200 Subject: [Python-ideas] Using * in indexes Message-ID: <4C8EF725.3050807@canterbury.ac.nz> I just found myself writing a method like this: def __getitem__(self, index): return self.data[(Ellipsis,) + index + (slice(),)] I would have liked to write it like this: self.data[..., index, :] because that would make it much easier to see what's being done. However, that won't work if index is itself a tuple of index elements. So I'd like to be able to do this: self.data[..., *index, :] -- Greg From scott+python-ideas at scottdial.com Tue Sep 14 07:12:37 2010 From: scott+python-ideas at scottdial.com (Scott Dial) Date: Tue, 14 Sep 2010 01:12:37 -0400 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: References:

<20100913190549.15f218ce@pitrou.net> <4C8EE189.40408@canterbury.ac.nz> Message-ID: <4C8F0445.2000905@scottdial.com> On 9/13/2010 11:07 PM, Guido van Rossum wrote: > On Mon, Sep 13, 2010 at 8:04 PM, Tim Peters wrote: >> [Nick Coghlan] >>>> Alternatively, when building finalizers, build two >>>> lists: one for objects with __del__ methods and one for objects that >>>> are reachable from objects with __del__ methods. >> >> [Greg Ewing] >>> But since it's a cycle, isn't *everything* in the cycle >>> going to be reachable from everything else? >> >> That pattern is exploitable too. If, e.g., you have some resource R >> that needs to be cleaned up, owned by an object A that may participate >> in cycles, it's often possible to put R in a different, very simple >> object with a __del__ method, and have A point to that latter object >> instead. > > Yeah, I think we even recommended this pattern at some point. ISTR we > designed the new io library to exploit it. > Yes, this topic came up some while back on this list and Tim's solution is exactly the design pattern I suggested then: http://mail.python.org/pipermail/python-ideas/2009-October/006222.html -- Scott Dial scott at scottdial.com scodial at cs.indiana.edu From ncoghlan at gmail.com Tue Sep 14 11:51:19 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 14 Sep 2010 19:51:19 +1000 Subject: [Python-ideas] Why not break cycles with one __del__? In-Reply-To: <4C8EE189.40408@canterbury.ac.nz> References:

<20100913190549.15f218ce@pitrou.net> <4C8EE189.40408@canterbury.ac.nz> Message-ID: On Tue, Sep 14, 2010 at 12:44 PM, Greg Ewing wrote: > Nick Coghlan wrote: >> >> Alternatively, when building finalizers, build two >> lists: one for objects with __del__ methods and one for objects that >> are reachable from objects with __del__ methods. > > But since it's a cycle, isn't *everything* in the cycle > going to be reachable from everything else? In addition to what Tim said, there may be more than one cycle being collected. So you can have situations like objects, A, B C in one cycle and D, E, F in a different cycle. Suppose A, B and D all have __del__ methods. Then your two lists would be: __del__ method: A, B, D Reachable from objects with __del__ method: A, B, C, E, F It's just another way of viewing what the OP described: cycles containing only a single object with __del__ don't actually have an ordering problem, so you can just call it before you destroy any of the objects. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From mikegraham at gmail.com Tue Sep 14 15:54:49 2010 From: mikegraham at gmail.com (Mike Graham) Date: Tue, 14 Sep 2010 09:54:49 -0400 Subject: [Python-ideas] Using * in indexes In-Reply-To: <4C8EF725.3050807@canterbury.ac.nz> References: <4C8EF725.3050807@canterbury.ac.nz> Message-ID: On Tue, Sep 14, 2010 at 12:16 AM, Greg Ewing wrote: > I just found myself writing a method like this: > > ?def __getitem__(self, index): > ? ?return self.data[(Ellipsis,) + index + (slice(),)] > > I would have liked to write it like this: > > ? self.data[..., index, :] > > because that would make it much easier to see what's > being done. However, that won't work if index is itself > a tuple of index elements. > > So I'd like to be able to do this: > > ? self.data[..., *index, :] If in indexes, why not when making other tuples? Mike From alexander.belopolsky at gmail.com Tue Sep 14 16:09:05 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Tue, 14 Sep 2010 10:09:05 -0400 Subject: [Python-ideas] Using * in indexes In-Reply-To: References: <4C8EF725.3050807@canterbury.ac.nz> Message-ID: On Tue, Sep 14, 2010 at 9:54 AM, Mike Graham wrote: .. >> So I'd like to be able to do this: >> >> ? self.data[..., *index, :] > > If in indexes, why not when making other tuples? I believe this and other unpacking generalizations are implemented in issue #2292: http://bugs.python.org/issue2292 From greg.ewing at canterbury.ac.nz Wed Sep 15 00:15:10 2010 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 15 Sep 2010 10:15:10 +1200 Subject: [Python-ideas] Using * in indexes In-Reply-To: References: <4C8EF725.3050807@canterbury.ac.nz> Message-ID: <4C8FF3EE.9020209@canterbury.ac.nz> Mike Graham wrote: > On Tue, Sep 14, 2010 at 12:16 AM, Greg Ewing > wrote: > >> self.data[..., *index, :] > > If in indexes, why not when making other tuples? It would be handy to be able to use it when making other tuples, yes. There's a particularly strong motivation for it in relation to indexes, though, because otherwise you not only end up having to use ugly (foo,) constructs, but you lose the ability to use any of the special indexing syntax. There's also a performance penalty if you end up having to look up 'slice' a bunch of times. -- Greg From greg.ewing at canterbury.ac.nz Wed Sep 15 00:16:22 2010 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Wed, 15 Sep 2010 10:16:22 +1200 Subject: [Python-ideas] Using * in indexes In-Reply-To: References: <4C8EF725.3050807@canterbury.ac.nz> Message-ID: <4C8FF436.40305@canterbury.ac.nz> Alexander Belopolsky wrote: > I believe this and other unpacking generalizations are implemented in > issue #2292: http://bugs.python.org/issue2292 Yes, it appears so. Did a PEP for that ever materialise, or is everyone waiting until after the moratorium? -- Greg From tjreedy at udel.edu Wed Sep 15 06:23:18 2010 From: tjreedy at udel.edu (Terry Reedy) Date: Wed, 15 Sep 2010 00:23:18 -0400 Subject: [Python-ideas] Using * in indexes In-Reply-To: <4C8FF436.40305@canterbury.ac.nz> References: <4C8EF725.3050807@canterbury.ac.nz> <4C8FF436.40305@canterbury.ac.nz> Message-ID: On 9/14/2010 6:16 PM, Greg Ewing wrote: > Alexander Belopolsky wrote: > >> I believe this and other unpacking generalizations are implemented in >> issue #2292: http://bugs.python.org/issue2292 > > Yes, it appears so. Did a PEP for that ever materialise, > or is everyone waiting until after the moratorium? The only PEP I know of is the one for what has been done: http://www.python.org/dev/peps/pep-3132/ Extended Iterable Unpacking -- Terry Jan Reedy From glyph at twistedmatrix.com Wed Sep 15 23:56:52 2010 From: glyph at twistedmatrix.com (Glyph Lefkowitz) Date: Wed, 15 Sep 2010 17:56:52 -0400 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: References: <4C8AB874.9010703@openvpn.net> Message-ID: <9AF93392-544C-4539-98B2-19DB2563172D@twistedmatrix.com> Thanks for the ping about this (I don't think I subscribe to python-ideas, so someone may have to moderate my post in). Sorry for the delay in responding, but I've been kinda busy and cooking up these examples took a bit of thinking. And thanks, James, for restarting this discussion. I obviously find it interesting :). I'm going to mix in some other stuff I found on the web archives, since it's easiest just to reply in one message. I'm sorry that this response is a bit sprawling and doesn't have a single clear narrative, the thread thus far didn't seem to lend it to one. For those of you who don't want to read my usual novel-length post, you can probably stop shortly after the end of the first block of code examples. On Sep 11, 2010, at 10:26 PM, Guido van Rossum wrote: >>> although he didn't say what >>> deferreds really added beyond what futures provide, and why the >>> "add_done_callback" method isn't adequate to provide interoperability >>> between futures and deferreds (which would be odd, since Brian made >>> changes to that part of PEP 3148 to help with that interoperability >>> after discussions with Glyph). >>> >>> Between PEP 380 and PEP 3148 I'm not really seeing a lot more scope >>> for standardisation in this space though. >>> >>> Cheers, >>> Nick. >> >> That was my initial reaction as well, but I'm more than open to >> hearing from Jean Paul/Glyph and the other twisted folks on this. > But thinking about this more I don't know that it will be easy to mix > PEP 3148, which is solidly thread-based, with a PEP 342 style > scheduler (whether or not the PEP 380 enhancements are applied, or > even PEP 3152). And if we take the OP's message at face value, his > point isn't so much that Twisted is great, but that in order to > benefit maximally from PEP 342 there needs to be a standard way of > using callbacks. I think that's probably true. And comparing the > blog's examples to PEP 3148, I find Twisted's terminology rather > confusing compared to the PEP's clean Futures API (where IMO you can > ignore almost everything except result()). That blog post was written to demonstrate why programs using generators are "... far easier to read and write ..." than ones using Deferreds, so it stands to reason it would choose an example where that helps :). When you want to write systems that manage varying levels of parallelism within a single computation, generators can start to get pretty hairy and the "normal" Deferred way of doing things looks more straightforward. Thinking in terms of asynchronicity is tricky, and generators can be a useful tool for promoting that understanding, but they only make it superficially easier. For example: >>> def serial(): >>> results = set() >>> for x in ...: >>> results.add((yield do_something_async(x))) >>> return results If you're writing an application whose parallelism calls for an asynchronous approach, after all, you presumably don't want to be standing around waiting for each network round trip to complete. How do you re-write this so that there are always at least N outstanding do_something_async calls running in parallel? You can sorta do it like this: >>> def parallel(N): >>> results = set() >>> outstanding = [] >>> for x in ...: >>> if len(outstanding) > N: >>> results.add((yield outstanding.pop(0))) >>> else: >>> outstanding.append(do_something_async(x)) but that will always block on one particular do_something_async, when you really want to say "let me know when any outstanding call is complete". So I could handwave about 'yield any_completed(outstanding)'... >>> def parallel(N): >>> results = set() >>> outstanding = set() >>> for x in ...: >>> if len(outstanding) > N: >>> results.add((yield any_completed(outstanding))) >>> else: >>> outstanding.add(do_something_async(x)) but that just begs the question of how you implement any_completed(), and I can't think of a way to do that with generators, without getting into the specifics of some Deferred-or-Future-like asynchronous result object. You could implement such a function with such primitives, and here's what it looks like with Deferreds: >>> def any_completed(setOfDeferreds): >>> d = Deferred() >>> called = [] >>> def fireme(result, whichDeferred): >>> if not called: >>> called.append(True) >>> setOfDeferreds.remove(whichDeferred) >>> d.callback(result) >>> return result >>> for subd in setOfDeferreds: >>> subd.addBoth(fireme, subd) >>> return d Here's how you do the top-level task in Twisted, without generators, in the truly-parallel fashion (keep in mind this combines the functionality of 'any_completed' and 'parallel', so it's a bit shorter): >>> def parallel(N): >>> ds = DeferredSemaphore(N) >>> l = [] >>> def release(result): >>> ds.release() >>> return result >>> def after(sem, it): >>> return do_something_async(it) >>> for x in ...: >>> l.append(ds.acquire().addCallback(after_acquire, x).addBoth(release)) >>> return gatherResults(l).addCallback(set) Some informal benchmarking has shown this method to be considerably faster (on the order of 1/2 to 1/3 as much CPU time) than at least our own inlineCallbacks generator-scheduling method. Take this with the usual fist-sized grain of salt that you do any 'informal' benchmarks, but the difference is significant enough that I do try to refactor into this style in my own code, and I have seen performance benefits from doing this on more specific benchmarks. This is all untested, and that's far too many lines of code to expect to work without testing, but hopefully it gives a pretty good impression of the differences in flavor between the different styles. > Yeah, please do explain why Twisted has so much machinery to handle exceptions? There are a lot of different implied questions here, so I'll answer a few of those. Why does twisted.python.failure exist? The answer to that is that we wanted an object that represented an exception as raised at a particular point, associated with a particular stack, that could live on without necessarily capturing all the state in that stack. If you're going to report failures asynchronously, you don't necessarily want to hold a reference to every single thing in a potentially giant stack while you're waiting to send it to some network endpoint. Also, in 1.5.2 we had no way of chaining exceptions, and this code is that old. Finally, even if you can chain exceptions, it's a serious performance hit to have to re-raise and re-catch the same exception 4 or 5 times in order to translate it or handle it at many different layers of the stack, so a Failure is intended to encapsulate that state such that it can just be returned, in performance-sensitive areas. (This is sort of a weak point though, since the performance of Failure itself is so terrible, for unrelated reasons.) Why is twisted.python.failure such a god damned mess? The answer to that is ... uh, sorry. Yes, it is. We should clean it up. It was written a long time ago and the equivalent module now could be _much_ shorter, simpler, and less of a performance problem. It just never seems to be the highest priority. Maybe after we're done porting to py3 :). My one defense here is that still a slight improvement over the stdlib 'traceback' module ;-). Why do Deferreds have an errback chain rather than just handing you an exception object in the callback chain? Basically, this is for the same reason that Python has exceptions instead of just making you check return codes. We wanted it to be easy to say: >>> d = getPage("http://...") >>> def ok(page): >>> doSomething(...) >>> d.addCallback(ok) and know that the argument to 'ok' would always be what getPage promised (you don't need to typecheck it for exception-ness) and the default error behavior would be to simply bail out with a traceback, not to barrel through your success-path code wreaking havoc. > ISTM that the main difference is that add_done_callback() isn't meant for callbacks that return a value. add_done_callback works fine with callbacks that return a value. If it didn't, I'd be concerned, because then it would have the barrel-through-the-success-path flaw. But, I assume the idiomatic asynchronous-code-using-Futures would look like this: >>> f = some_future_thing(...) >>> def my_callback(future): >>> result = future.result() >>> do_something(result) >>> f.add_done_callback(my_callback) This is one extra line of code as compared to the Twisted version, and chaining involves a bit more gymnastics (somehow creating more futures to return further up the stack, I guess, I haven't thought about it too hard), but it does allow you to handle exceptions with a simple 'except:', rather than calling some exception-handling methods, so I can see why some people would prefer it. > Maybe it's possible to write a little framework that lets you create Futures using either threads, processes (both supported by PEP 3148) or generators. But I haven't tried it. And maybe the need to use 'yield' for everything that may block when using generators, but not when using threads or processes, will make this awkward. You've already addressed the main point that I really wanted to mention here, but I'd like to emphasize it. Blocking and not-blocking are fundamentally different programming styles, and if you sometimes allow blocking on asynchronous results, that means you are effectively always programming in the blocking-and-threaded style and not getting much benefit from the code which does choose to be politely non-blocking. I was somewhat pleased with the changes made to the Futures PEP because you could use them as an asynchronous result, and have things that implemented the Future API but raised an exception if you tried to wait on them. That would at least allow some layer of stdlib compatibility. If you are disciplined and careful, this would let you write async code which used a common interoperability mechanism, and if you weren't careful, it would blow up when you tried to use it the wrong way. But - and I am guessing that this is the main thrust of this discussion - I do think that having Deferred in the standard library would be much, much better if we can do that. > So maybe we'll be stuck with at least two Future-like APIs: PEP 3148 and something else, generator-based. Having something "generator-based" is, in my opinion, an abstraction inversion. The things which you are yielding from these generators are asynchronous results. There should be a specific type for asynchronous results which can be easily interacted with. Generators are syntactic sugar for doing that interaction in a way which doesn't involve defining tons of little functions. This is useful, and it makes the concept more accessible, so I don't say "just" syntactic sugar: but nevertheless, the generators need to be 'yield'ing something, and the type of thing that they're yielding is a Deferred-or-something-like-it. I don't think that this is really two 'Future-like APIs'. At least, they're not redundant, any more than having both socket.makefile() and socket.recv() is redundant. If Future had a deferred() method rather than an add_done_callback() method, then it would always be very clear whether you had a synchronous-but-possibly-not-ready or a purely-asynchronous result. Although it would be equally easy to just have a function that turned a Future into a Deferred by calling add_done_callback(). You can go from any arbitrary Future to a full-featured Deferred, but not the other way around. > Or maybe PEP 3152. I don't like PEP 3152 aesthetically on many levels, but I can't deny that it would do the job. 'cocall', though, really? It would be nice if it read like an actual word, i.e. "yield to" or "invoke" or even just "call" or something. In another message, where Guido is replying to Antoine: >> I think the main reason, though, that people find Deferreds inconvenient is that they force you to think in terms of asynchronicity (...) > > Actually I think the main reason is historic: Twisted introduced callback-based asynchronous (thread-less) programming when there was no alternative in Python, and they invented both the mechanisms and the terminology as they were figuring it all out. That is no mean feat. But with PEP 342 (generator-based coroutines) and especially PEP 380 (yield from) there *is* an alternative, and while Twisted has added APIs to support generators, it hasn't started to deprecate its other APIs, and its terminology becomes hard to follow for people (like me, frankly) who first learned this stuff through PEP 342. I really have to go with Antoine on this one: people were confused about Deferreds long before PEP 342 came along :). Given that Javascript environments have mostly adopted the Twisted terminology (oddly, Node.js doesn't, but Dojo and MochiKit both have pretty literal-minded Deferred translations), there are plenty of people who are familiar with the terminology but still get confused. See the beginning of the message for why we're not deprecating our own APIs. Once again, sorry for not compressing this down further! If you got this far, you win a prize :). -------------- next part -------------- An HTML attachment was scrubbed... URL: From glyph at twistedmatrix.com Thu Sep 16 00:13:23 2010 From: glyph at twistedmatrix.com (Glyph Lefkowitz) Date: Wed, 15 Sep 2010 18:13:23 -0400 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: <20100915220952.2058.14020740.divmod.xquotient.544@localhost.localdomain> References: <4C8AB874.9010703@openvpn.net> <9AF93392-544C-4539-98B2-19DB2563172D@twistedmatrix.com> <20100915220952.2058.14020740.divmod.xquotient.544@localhost.localdomain> Message-ID: On Sep 15, 2010, at 6:09 PM, exarkun at twistedmatrix.com wrote: > > Glyph meant this: > > def parallel(N): > ds = DeferredSemaphore(N) > l = [] > for x in ...: > l.append(ds.run(do_something_async, it)) > return gatherResults(l).addCallback(set) > > Jean-Paul I knew it should have looked shorter and sweeter. Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: From daniel at stutzbachenterprises.com Thu Sep 16 17:35:14 2010 From: daniel at stutzbachenterprises.com (Daniel Stutzbach) Date: Thu, 16 Sep 2010 10:35:14 -0500 Subject: [Python-ideas] list.sort with a int or str key Message-ID: list.sort, sorted, and similar methods currently have a "key" argument that accepts a callable. Often, that leads to code looking like this: mylist.sort(key=lambda x: x[1]) myotherlist.sort(key=lambda x: x.length) I would like to propose that the "key" parameter be generalized to accept str and int types, so the above code could be rewritten as follows: mylist.sort(key=1) myotherlist.sort(key='length') I find the latter to be much more readable. As a bonus, performance for those cases would also improve. -- Daniel Stutzbach -------------- next part -------------- An HTML attachment was scrubbed... URL: From mwm-keyword-python.b4bdba at mired.org Thu Sep 16 17:41:37 2010 From: mwm-keyword-python.b4bdba at mired.org (Mike Meyer) Date: Thu, 16 Sep 2010 11:41:37 -0400 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: Message-ID: <20100916114137.51f6f90e@bhuda.mired.org> On Thu, 16 Sep 2010 10:35:14 -0500 Daniel Stutzbach wrote: > list.sort, sorted, and similar methods currently have a "key" argument that > accepts a callable. Often, that leads to code looking like this: > > mylist.sort(key=lambda x: x[1]) > myotherlist.sort(key=lambda x: x.length) > > I would like to propose that the "key" parameter be generalized to accept > str and int types, so the above code could be rewritten as follows: > > mylist.sort(key=1) > myotherlist.sort(key='length') -1 I think the idiom using the operator module tools: mylist.sort(key=itemgetter(1)) mylist.sort(key=attrgetter('length')) is more readable than your proposal - it makes what's going on explicit. http://www.mired.org/consulting.html Independent Network/Unix/Perforce consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From guido at python.org Thu Sep 16 17:44:15 2010 From: guido at python.org (Guido van Rossum) Date: Thu, 16 Sep 2010 08:44:15 -0700 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: Message-ID: On Thu, Sep 16, 2010 at 8:35 AM, Daniel Stutzbach wrote: > list.sort, sorted, and similar methods currently have a "key" argument that > accepts a callable.? Often, that leads to code looking like this: > > mylist.sort(key=lambda x: x[1]) > myotherlist.sort(key=lambda x: x.length) > > I would like to propose that the "key" parameter be generalized to accept > str and int types, so the above code could be rewritten as follows: > > mylist.sort(key=1) > myotherlist.sort(key='length') > > I find the latter to be much more readable. -1. I think this is too cryptic. > As a bonus, performance for those cases would also improve. Have you measured this? Remember that the key function is only called N times while the number of comparisons (using the values returned from the key function) is O(N log N). -- --Guido van Rossum (python.org/~guido) From robert.kern at gmail.com Thu Sep 16 17:51:55 2010 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 16 Sep 2010 10:51:55 -0500 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: Message-ID: On 9/16/10 10:35 AM, Daniel Stutzbach wrote: > list.sort, sorted, and similar methods currently have a "key" argument that > accepts a callable. Often, that leads to code looking like this: > > mylist.sort(key=lambda x: x[1]) > myotherlist.sort(key=lambda x: x.length) > > I would like to propose that the "key" parameter be generalized to accept str > and int types, so the above code could be rewritten as follows: > > mylist.sort(key=1) > myotherlist.sort(key='length') > > I find the latter to be much more readable. As a bonus, performance for those > cases would also improve. I find the latter significantly less readable because they are special cases that I need to remember. Right now, you can achieve the performance and arguably better readability using operator.itemgetter() and operator.attrgetter(): from operator import attrgetter, itemgetter mylist.sort(key=itemgetter(1)) myotherlist.sort(key=attrgetter('length')) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From bruce at leapyear.org Thu Sep 16 18:05:53 2010 From: bruce at leapyear.org (Bruce Leban) Date: Thu, 16 Sep 2010 09:05:53 -0700 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: Message-ID: -1 key='length' could reasonably mean lambda a:a.length or lambda a:a['length'] an explicit lambda or itemgetter/attrgetter is clearer. --- Bruce http://www.vroospeak.com http://j.mp/gruyere-security On Thu, Sep 16, 2010 at 8:35 AM, Daniel Stutzbach < daniel at stutzbachenterprises.com> wrote: > list.sort, sorted, and similar methods currently have a "key" argument that > accepts a callable. Often, that leads to code looking like this: > > mylist.sort(key=lambda x: x[1]) > myotherlist.sort(key=lambda x: x.length) > > I would like to propose that the "key" parameter be generalized to accept > str and int types, so the above code could be rewritten as follows: > > mylist.sort(key=1) > myotherlist.sort(key='length') > > I find the latter to be much more readable. As a bonus, performance for > those cases would also improve. > -- > Daniel Stutzbach > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From solipsis at pitrou.net Thu Sep 16 18:11:29 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Thu, 16 Sep 2010 18:11:29 +0200 Subject: [Python-ideas] list.sort with a int or str key References: Message-ID: <20100916181129.5e39c6d4@pitrou.net> On Thu, 16 Sep 2010 10:35:14 -0500 Daniel Stutzbach wrote: > list.sort, sorted, and similar methods currently have a "key" argument that > accepts a callable. Often, that leads to code looking like this: > > mylist.sort(key=lambda x: x[1]) > myotherlist.sort(key=lambda x: x.length) > > I would like to propose that the "key" parameter be generalized to accept > str and int types, so the above code could be rewritten as follows: > > mylist.sort(key=1) > myotherlist.sort(key='length') It is not obvious whether key='length' should use __getitem__ or __getattr__. Your example claims attribute lookup but an indexed lookup would be more consistent with key=1. I'm quite skeptical towards this. Special cases make things harder to remember, and foreign code more difficult to read. Regards Antoine. From daniel at stutzbachenterprises.com Thu Sep 16 18:12:37 2010 From: daniel at stutzbachenterprises.com (Daniel Stutzbach) Date: Thu, 16 Sep 2010 11:12:37 -0500 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: Message-ID: Since most everyone else finds it less readable, I withdraw the proposal. Thanks for the feedback, -- Daniel Stutzbach, Ph.D. President, Stutzbach Enterprises, LLC -------------- next part -------------- An HTML attachment was scrubbed... URL: From raymond.hettinger at gmail.com Thu Sep 16 20:28:32 2010 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Thu, 16 Sep 2010 11:28:32 -0700 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: Message-ID: <5B7D2EAA-672E-4744-9D11-A9C4CA4CD7D4@gmail.com> On Sep 16, 2010, at 8:35 AM, Daniel Stutzbach wrote: > list.sort, sorted, and similar methods currently have a "key" argument that accepts a callable. Often, that leads to code looking like this: > > mylist.sort(key=lambda x: x[1]) > myotherlist.sort(key=lambda x: x.length) > > I would like to propose that the "key" parameter be generalized to accept str and int types, so the above code could be rewritten as follows: > > mylist.sort(key=1) > myotherlist.sort(key='length') -1 The key= parameter is a protocol that is used across multiple tools min(). max(), groupby(), nmallest(), nlargest(), etc. All of those would need to change to stay in-sync. > I find the latter to be much more readable. It also becomes harder to learn. Multiple signatures (int or str or other callable) create more problems that they solve. > As a bonus, performance for those cases would also improve. ISTM, the performance would be about the same as you already get from attrgetter(), itemgetter(), and methodcaller(). Also, those three tools are already more flexible than the proposal, for example: attrgetter('lastname', 'firstname') # key = lambda r: (r.lastname, r.firstname) itemgetter(0, 7) # key = lambda r: (r[0], r[7]) methodcaller('get_stats', 'size') # key = lambda r: r.get_stats('size') We've already got a way to do it, so the proposal is basically about saving a few characters in exchange for complexifying the protocol with a form of multiple dispatch. Raymond From tjreedy at udel.edu Fri Sep 17 05:11:22 2010 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 16 Sep 2010 23:11:22 -0400 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: <5B7D2EAA-672E-4744-9D11-A9C4CA4CD7D4@gmail.com> References: <5B7D2EAA-672E-4744-9D11-A9C4CA4CD7D4@gmail.com> Message-ID: On 9/16/2010 2:28 PM, Raymond Hettinger wrote: > The key= parameter is a protocol that is used across multiple tools min(). max(), groupby(), nmallest(), nlargest(), etc. All of those would need to change to stay in-sync. ... > ISTM, the performance would be about the same as you already get from attrgetter(), itemgetter(), and methodcaller(). Also, those three tools are already more flexible than the proposal, for example: > > attrgetter('lastname', 'firstname') # key = lambda r: (r.lastname, r.firstname) > itemgetter(0, 7) # key = lambda r: (r[0], r[7]) > methodcaller('get_stats', 'size') # key = lambda r: r.get_stats('size') It is easy to not know about these. I think the doc set could usefully use an expanded entry on *key functions* (that would be a cross-reference link) that includes examples like the above. Currently, for example, the min entry has "The optional keyword-only key argument specifies a one-argument ordering function like that used for list.sort()." but there is no link and going to list.sort only adds "that is used to extract a comparison key from each list element: key=str.lower. The default value is None." Perhaps we could expand that and make the existing cross-references into links. -- Terry Jan Reedy From masklinn at masklinn.net Fri Sep 17 06:49:21 2010 From: masklinn at masklinn.net (Masklinn) Date: Fri, 17 Sep 2010 10:19:21 +0530 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: <5B7D2EAA-672E-4744-9D11-A9C4CA4CD7D4@gmail.com> Message-ID: On 2010-09-17, at 08:41 , Terry Reedy wrote: > On 9/16/2010 2:28 PM, Raymond Hettinger wrote: >> The key= parameter is a protocol that is used across multiple tools min(). max(), groupby(), nmallest(), nlargest(), etc. All of those would need to change to stay in-sync. > ... > >> ISTM, the performance would be about the same as you already get from attrgetter(), itemgetter(), and methodcaller(). Also, those three tools are already more flexible than the proposal, for example: >> >> attrgetter('lastname', 'firstname') # key = lambda r: (r.lastname, r.firstname) >> itemgetter(0, 7) # key = lambda r: (r[0], r[7]) >> methodcaller('get_stats', 'size') # key = lambda r: r.get_stats('size') > > It is easy to not know about these. I think the doc set could usefully use an expanded entry on *key functions* (that would be a cross-reference link) that includes examples like the above. +1, in my experience, the operator module in general is fairly unknown and the attrgetter/itemgetter/methodcaller family criminally so. It doesn't help that they're kind-of lost in a big bunch of text at the very bottom of the module. From raymond.hettinger at gmail.com Fri Sep 17 11:04:04 2010 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Fri, 17 Sep 2010 02:04:04 -0700 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: <5B7D2EAA-672E-4744-9D11-A9C4CA4CD7D4@gmail.com> Message-ID: <98438F80-5D4F-48D1-B7E3-37E991F65ED1@gmail.com> >> ISTM, the performance would be about the same as you already get from attrgetter(), itemgetter(), and methodcaller(). Also, those three tools are already more flexible than the proposal, for example: >> >> attrgetter('lastname', 'firstname') # key = lambda r: (r.lastname, r.firstname) >> itemgetter(0, 7) # key = lambda r: (r[0], r[7]) >> methodcaller('get_stats', 'size') # key = lambda r: r.get_stats('size') > > It is easy to not know about these. FWIW, those and other sorting related topics are covered in the sorting-howto: http://wiki.python.org/moin/HowTo/Sorting/ We link to that from the main docs for sorted(): http://docs.python.org/library/functions.html#sorted > I think the doc set could usefully use an expanded entry on *key functions* That might also make a useful entry to the glossary. Raymond P.S. I don't know that it applies here but one limitation of the docs is that they can get too voluminous. Already, it is a significant time investment just to read the doc page on builtin functions. You can kill a whole afternoon just reading the docs for unittest and logging. The gestalt of the language gets lost when the docs get too fat. Instead, I like the howto write-ups because they bring together many thoughts on a single topic. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ncoghlan at gmail.com Fri Sep 17 14:14:23 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Fri, 17 Sep 2010 22:14:23 +1000 Subject: [Python-ideas] list.sort with a int or str key In-Reply-To: References: <5B7D2EAA-672E-4744-9D11-A9C4CA4CD7D4@gmail.com> Message-ID: On Fri, Sep 17, 2010 at 1:11 PM, Terry Reedy wrote: > It is easy to not know about these. I think the doc set could usefully use > an expanded entry on *key functions* (that would be a cross-reference link) > that includes examples like the above. Currently, for example, the min entry > has "The optional keyword-only key argument specifies a one-argument > ordering function like that used for list.sort()." but there is no link and > going to list.sort only adds "that is used to extract a comparison key from > each list element: key=str.lower. The default value is None." Perhaps we > could expand that and make the existing cross-references into links. Tracker issue to capture this idea: http://bugs.python.org/issue9886 Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From lie.1296 at gmail.com Fri Sep 17 16:40:48 2010 From: lie.1296 at gmail.com (Lie Ryan) Date: Sat, 18 Sep 2010 00:40:48 +1000 Subject: [Python-ideas] Cofunctions: It's alive! Its alive! In-Reply-To: References: <4C5D0759.30606@canterbury.ac.nz> <4C60FE37.2020303@canterbury.ac.nz> Message-ID: On 08/11/10 01:57, Guido van Rossum wrote: > - Would it be sufficient if codef was a decorator instead of a > keyword? (This new keyword in particular chafes me, since we've been > so successful at overloading 'def' for so many meanings -- functions, > methods, class methods, static methods, properties...) +1. I'd like to see this implemented as decorator (perhaps with special casing by the VM if necessary), and see how this cofunction will be used in wider practice before deciding whether the syntax sugar is necessary. The decorator could live as a built-in function or as stdlib module (from cofunction import cofunction), and be clearly marked as experimental. From raymond.hettinger at gmail.com Fri Sep 17 21:44:53 2010 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Fri, 17 Sep 2010 12:44:53 -0700 Subject: [Python-ideas] New 3.x restriction in list comprehensions Message-ID: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> In Python2, you can transform: r = [] for x in 2, 4, 6: r.append(x*x+1) into: r = [x*x+1 for x in 2, 4, 6] In Python3, the first still works but the second gives a SyntaxError. It wants the 2, 4, 6 to have parentheses. The good parts of the change: + it matches what genexps do + that simplifies the grammar a bit (listcomps bodies and genexp bodies) + a listcomp can be reliably transformed to a genexp The bad parts: + The restriction wasn't necessary (we could undo it) + It makes 2-to-3 conversion a bit harder + It no longer parallels other paren-free tuple constructions: return x, y yield x, y t = x, y ... + It particular, it no longer parallels regular for-loop syntax The last part is the one that seems the most problematic. If you write for-loops day in and day out with the unrestricted syntax, you (or least me) will tend to do the wrong thing when writing a list comprehension. It is a bit jarring to get the SyntaxError when the code looks correct -- it took me a bit of fiddling to figure-out what was going on. My question for the group is whether it would be a good idea to drop the new restriction. Raymond From raymond.hettinger at gmail.com Fri Sep 17 22:00:08 2010 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Fri, 17 Sep 2010 13:00:08 -0700 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments Message-ID: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> One of the use cases for named tuples is to have them be automatically created from a SQL query or CSV header. Sometimes (but not often), those can have a huge number of columns. In Python 2.x, it worked just fine -- we had a test for a named tuple with 5000 fields. In Python 3.x, there is a SyntaxError when there are more than 255 fields. The origin of the change was a hack to fit positional argument counts and keyword-only argument counts in a single oparg in the python opcode encoding. ISTM, this is an implementation specific hack and there is no reason that other implementations would have the same restriction (unless their starting point is Python's bytecode). The good news is that long argument lists are uncommon. They probably only arise in cases with dynamically created functions and classes. Most people are unaffected. The bad news is that an implementation detail has become visible and added a language restriction. The 255 limit seems weird to me in a version of Python that has gone to lengths to unify ints and longs so that char/short/long boundaries stop manifesting themselves to users. Is there any support here for trying to get smarter about the keyword-only argument implementation? The 255 limit does not seem unreasonably low, but then it was once thought that no one would ever need more that 640k of ram. If the new restriction isn't necessary, it would be great to remove it. Raymond From matthew.russell at ovi.com Fri Sep 17 22:03:34 2010 From: matthew.russell at ovi.com (Matthew Russell) Date: Fri, 17 Sep 2010 21:03:34 +0100 Subject: [Python-ideas] New 3.x restriction in list comprehensions In-Reply-To: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> References: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> Message-ID: <1284753814.365.142.camel@stone> Personally, I tend to always add parens to tuple expressions since it removes any and all ambiguity about when they're required or ney. I'd actually prefer it if parens were always required, but can appreciate that might/would offend those who prefer otherwise. >>> for (a, b) in d.items(): ... process(a, b) >>> def items(t): ... return (a, b) Always using parens means that when refactoring one can avoid the extra mental step of 'are the parens required in use with python feature >' Additionally, in some language features, the use of parens has become required to squash warts: >>> try: ... a = b[k] >>> except (KeyError, IndexError), no_item: ... a = handle(no_item) Regards, Matt On Fri, 2010-09-17 at 12:44 -0700, Raymond Hettinger wrote: > In Python2, you can transform: > r = [] > for x in 2, 4, 6: > r.append(x*x+1) > > into: > > r = [x*x+1 for x in 2, 4, 6] > > In Python3, the first still works but the second gives a SyntaxError. > It wants the 2, 4, 6 to have parentheses. > > The good parts of the change: > + it matches what genexps do > + that simplifies the grammar a bit (listcomps bodies and genexp bodies) > + a listcomp can be reliably transformed to a genexp > > The bad parts: > + The restriction wasn't necessary (we could undo it) > + It makes 2-to-3 conversion a bit harder > + It no longer parallels other paren-free tuple constructions: > return x, y > yield x, y > t = x, y > ... > + It particular, it no longer parallels regular for-loop syntax > > The last part is the one that seems the most problematic. > If you write for-loops day in and day out with the unrestricted > syntax, you (or least me) will tend to do the wrong thing when > writing a list comprehension. It is a bit jarring to get the SyntaxError > when the code looks correct -- it took me a bit of fiddling to figure-out > what was going on. > > My question for the group is whether it would be a good > idea to drop the new restriction. > > > Raymond > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas -------------------------------------------------------------- Ovi Mail: Making email access easy http://mail.ovi.com From python at mrabarnett.plus.com Fri Sep 17 22:23:49 2010 From: python at mrabarnett.plus.com (MRAB) Date: Fri, 17 Sep 2010 21:23:49 +0100 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> Message-ID: <4C93CE55.1030308@mrabarnett.plus.com> On 17/09/2010 21:00, Raymond Hettinger wrote: > One of the use cases for named tuples is to have them be > automatically created from a SQL query or CSV header. Sometimes (but > not often), those can have a huge number of columns. In Python 2.x, > it worked just fine -- we had a test for a named tuple with 5000 > fields. In Python 3.x, there is a SyntaxError when there are more > than 255 fields. > > The origin of the change was a hack to fit positional argument counts > and keyword-only argument counts in a single oparg in the python > opcode encoding. > > ISTM, this is an implementation specific hack and there is no reason > that other implementations would have the same restriction (unless > their starting point is Python's bytecode). > > The good news is that long argument lists are uncommon. They > probably only arise in cases with dynamically created functions and > classes. Most people are unaffected. > > The bad news is that an implementation detail has become visible and > added a language restriction. The 255 limit seems weird to me in a > version of Python that has gone to lengths to unify ints and longs so > that char/short/long boundaries stop manifesting themselves to > users. > > Is there any support here for trying to get smarter about the > keyword-only argument implementation? The 255 limit does not seem > unreasonably low, but then it was once thought that no one would ever > need more that 640k of ram. If the new restriction isn't necessary, > it would be great to remove it. > Strings can be any length, lists can be any length, even the humble int can be any length! It does seem unPythonic to have a low limit like that. I think that the implementation hack needs a bit of a rethink if that's what it's causing, IMHO. From python at mrabarnett.plus.com Fri Sep 17 22:27:37 2010 From: python at mrabarnett.plus.com (MRAB) Date: Fri, 17 Sep 2010 21:27:37 +0100 Subject: [Python-ideas] New 3.x restriction in list comprehensions In-Reply-To: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> References: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> Message-ID: <4C93CF39.5090406@mrabarnett.plus.com> On 17/09/2010 20:44, Raymond Hettinger wrote: > In Python2, you can transform: > > r = [] > for x in 2, 4, 6: > r.append(x*x+1) > > into: > > r = [x*x+1 for x in 2, 4, 6] > > In Python3, the first still works but the second gives a SyntaxError. > It wants the 2, 4, 6 to have parentheses. > > The good parts of the change: > + it matches what genexps do > + that simplifies the grammar a bit (listcomps bodies and genexp bodies) > + a listcomp can be reliably transformed to a genexp > > The bad parts: > + The restriction wasn't necessary (we could undo it) > + It makes 2-to-3 conversion a bit harder > + It no longer parallels other paren-free tuple constructions: > return x, y > yield x, y > t = x, y > ... > + It particular, it no longer parallels regular for-loop syntax > > The last part is the one that seems the most problematic. > If you write for-loops day in and day out with the unrestricted > syntax, you (or least me) will tend to do the wrong thing when > writing a list comprehension. It is a bit jarring to get the SyntaxError > when the code looks correct -- it took me a bit of fiddling to figure-out > what was going on. > > My question for the group is whether it would be a good > idea to drop the new restriction. > Listcomps look more like genexps than for loops, so they should probably have the same syntax retrictions (or lack of), IMHO. From solipsis at pitrou.net Fri Sep 17 23:11:46 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 17 Sep 2010 23:11:46 +0200 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> Message-ID: <20100917231146.23f0cef1@pitrou.net> On Fri, 17 Sep 2010 13:00:08 -0700 Raymond Hettinger wrote: > One of the use cases for named tuples is to have them be automatically created from a SQL > query or CSV header. Sometimes (but not often), those can have a huge number of columns. In > Python 2.x, it worked just fine -- we had a test for a named tuple with 5000 fields. In > Python 3.x, there is a SyntaxError when there are more than 255 fields. I don't understand your explanation. You can't pass a namedtuple using the **kw convention: >>> import collections >>> T = collections.namedtuple('a', 'b c d') >>> t = T(1,2,3) >>> def f(**a): pass ... >>> f(**t) Traceback (most recent call last): File "", line 1, in TypeError: f() argument after ** must be a mapping, not a Besides, even if that worked, you are doing an intermediate conversion to a dict, which is wasteful. Why not simply pass the namedtuple as a regular parameter? > The bad news is that an implementation detail has become visible and added a language > restriction. The 255 limit seems weird to me in a version of Python that has gone to lengths > to unify ints and longs so that char/short/long boundaries stop manifesting themselves to users. Well, it sounds like a theoretical worry of no practical value to me. The **kw notation is meant to marshal passing of actual keyword args, which are going to be explicitly typed in either at the call site or at the function definition site (ignoring any proxies in-between). Nobody is going to type more than 255 keyword arguments by hand. And there's generated code, but since it's generated they can easily find a workaround anyway. > If the new restriction isn't necessary, it would be great to remove it. I assume the restriction is useful since, according to your explanation, it improves the encoding of opcodes. Of course, we could switch bytecode to use a standard 32-bit word size, but someone has to propose a patch. Regards Antoine. From cs at zip.com.au Fri Sep 17 23:05:46 2010 From: cs at zip.com.au (Cameron Simpson) Date: Sat, 18 Sep 2010 07:05:46 +1000 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <4C93CE55.1030308@mrabarnett.plus.com> References: <4C93CE55.1030308@mrabarnett.plus.com> Message-ID: <20100917210546.GA32088@cskk.homeip.net> On 17Sep2010 21:23, MRAB wrote: | On 17/09/2010 21:00, Raymond Hettinger wrote: | >One of the use cases for named tuples is to have them be | >automatically created from a SQL query or CSV header. Sometimes (but | >not often), those can have a huge number of columns. In Python 2.x, | >it worked just fine -- we had a test for a named tuple with 5000 | >fields. In Python 3.x, there is a SyntaxError when there are more | >than 255 fields. | > | >The origin of the change was a hack to fit positional argument counts | >and keyword-only argument counts in a single oparg in the python | >opcode encoding. [...] | >Is there any support here for trying to get smarter about the | >keyword-only argument implementation? [...] | | Strings can be any length, lists can be any length, even the humble int | can be any length! | It does seem unPythonic to have a low limit like that. A big +10 from me. Implementation internals should not cause language level limitations. If there's a (entirely reasonable IMHO) desire to get the opcode small, the count should be encoded in a compact be extendable form. (I speak here with no idea how inflexible the opcode readers are.) As an example, I use a personal encoding for natural numbers scheme where values below 128 fit in one byte, 128 or more set the top bit on leading bytes to indicate followon bytes, so values up to 16383 fit in two bytes and so on arbitrarily. Compact and simple but unbounded. Is something like that tractable for the Python opcodes? Cheers, -- Cameron Simpson DoD#743 http://www.cskk.ezoshosting.com/cs/ I am returning this otherwise good typing paper to you because someone has printed gibberish all over it and put your name at the top. - English Professor, Ohio University From solipsis at pitrou.net Fri Sep 17 23:21:33 2010 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 17 Sep 2010 23:21:33 +0200 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments References: <4C93CE55.1030308@mrabarnett.plus.com> <20100917210546.GA32088@cskk.homeip.net> Message-ID: <20100917232133.6088424a@pitrou.net> On Sat, 18 Sep 2010 07:05:46 +1000 Cameron Simpson wrote: > > As an example, I use a personal encoding for natural numbers scheme > where values below 128 fit in one byte, 128 or more set the top bit on > leading bytes to indicate followon bytes, so values up to 16383 fit in > two bytes and so on arbitrarily. Compact and simple but unbounded. Well, you are proposing that we (Python core maintainers) live with additional complication in one of the most central and critical parts of the interpreter, just so that we satisfy some theoretical impulse for "consistency". That doesn't sound reasonable. (and, sure, the variable-length encoding wouldn't be very complicated; it would still be more complicated than it needs to be, and that's already a problem) For the record, have you been hit by this problem, or do you even think you might be hit by it in the near future? Thank you Antoine. From tjreedy at udel.edu Fri Sep 17 23:32:04 2010 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 17 Sep 2010 17:32:04 -0400 Subject: [Python-ideas] New 3.x restriction in list comprehensions In-Reply-To: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> References: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> Message-ID: On 9/17/2010 3:44 PM, Raymond Hettinger wrote: > In Python2, you can transform: > > r = [] > for x in 2, 4, 6: > r.append(x*x+1) for x in 2,4,6: yield x*x+1 also works in 2/3.x > > into: > > r = [x*x+1 for x in 2, 4, 6] > > In Python3, the first still works but the second gives a SyntaxError. > It wants the 2, 4, 6 to have parentheses. > > The good parts of the change: > + it matches what genexps do Is the restriction necessary for genexps? If the parser could handle [x*x+1 for x in 2, 4, 6] is (x*x+1 for x in 2, 4, 6) impossible, perhaps due to paren confusion? > + that simplifies the grammar a bit (listcomps bodies and genexp bodies) > + a listcomp can be reliably transformed to a genexp > > The bad parts: > + The restriction wasn't necessary (we could undo it) > + It makes 2-to-3 conversion a bit harder > + It no longer parallels other paren-free tuple constructions: > return x, y > yield x, y > t = x, y > ... > + It particular, it no longer parallels regular for-loop syntax > > The last part is the one that seems the most problematic. > If you write for-loops day in and day out with the unrestricted > syntax, you (or least me) will tend to do the wrong thing when > writing a list comprehension. It is a bit jarring to get the SyntaxError > when the code looks correct -- it took me a bit of fiddling to figure-out > what was going on. > > My question for the group is whether it would be a good > idea to drop the new restriction. 3.x is in a sense more consistent than 2.x in that converting a for loop with a bare tuple always requires addition of parentheses rather than just sometimes. Never requiring parens would be even better to me if it did not make the implementation too messy. -- Terry Jan Reedy From tjreedy at udel.edu Fri Sep 17 23:50:00 2010 From: tjreedy at udel.edu (Terry Reedy) Date: Fri, 17 Sep 2010 17:50:00 -0400 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> Message-ID: On 9/17/2010 4:00 PM, Raymond Hettinger wrote: > One of the use cases for named tuples is to have them be > automatically created from a SQL query or CSV header. Sometimes (but > not often), those can have a huge number of columns. In Python 2.x, > it worked just fine -- we had a test for a named tuple with 5000 > fields. In Python 3.x, there is a SyntaxError when there are more > than 255 fields. So, when the test failed due to the code change, the test was simply removed? > The origin of the change was a hack to fit positional argument counts > and keyword-only argument counts in a single oparg in the python > opcode encoding. I do not remember any discussion of adding such a language restriction, though I could have forgotten or missed it. As near as I can tell, it is undocumented. While there are undocumented limits to the interpreter, like nesting depth, this one is so low that I would consider the discrepancy between doc and behavior a bug. -- Terry Jan Reedy From alexander.belopolsky at gmail.com Fri Sep 17 23:50:15 2010 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Fri, 17 Sep 2010 17:50:15 -0400 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> Message-ID: <3F05AB9C-2353-429F-8343-9777C4F2F874@gmail.com> On Sep 17, 2010, at 4:00 PM, Raymond Hettinger wrote: .. > > Is there any support here for trying to get smarter about the keyword-only argument implementation? The 255 limit does not seem unreasonably low, but then it was once thought that no one would ever need more that 640k of ram. If the new restriction isn't necessary, it would be great to remove This has been requested before, but rejected for the lack of a valid use case. See issue 1636. I think supporting huge named tuples for the benefit of database applications is a valid use case. -------------- next part -------------- An HTML attachment was scrubbed... URL: From cs at zip.com.au Fri Sep 17 23:56:55 2010 From: cs at zip.com.au (Cameron Simpson) Date: Sat, 18 Sep 2010 07:56:55 +1000 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <20100917232133.6088424a@pitrou.net> References: <20100917232133.6088424a@pitrou.net> Message-ID: <20100917215655.GA7813@cskk.homeip.net> On 17Sep2010 23:21, Antoine Pitrou wrote: | On Sat, 18 Sep 2010 07:05:46 +1000 | Cameron Simpson wrote: | > As an example, I use a personal encoding for natural numbers scheme | > where values below 128 fit in one byte, 128 or more set the top bit on | > leading bytes to indicate followon bytes, so values up to 16383 fit in | > two bytes and so on arbitrarily. Compact and simple but unbounded. | | Well, you are proposing that we (Python core maintainers) live with | additional complication in one of the most central and critical parts of | the interpreter, just so that we satisfy some theoretical impulse for | "consistency". That doesn't sound reasonable. [...] | For the record, have you been hit by this problem, or do you even think | you might be hit by it in the near future? Me, no. But arbitrary _syntactic_ constraints in an otherwise flexible language grate. I was only suggesting a compactness-supporting approach, not lobbying very hard for making the devs use it. I'm +10 on removing the syntactic constraint, not on hacking the opcode definitons. Cheers, -- Cameron Simpson DoD#743 http://www.cskk.ezoshosting.com/cs/ Withdrawing in disgust is not the same as conceding. - Jon Adams From dirkjan at ochtman.nl Sat Sep 18 00:00:57 2010 From: dirkjan at ochtman.nl (Dirkjan Ochtman) Date: Sat, 18 Sep 2010 00:00:57 +0200 Subject: [Python-ideas] New 3.x restriction in list comprehensions In-Reply-To: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> References: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> Message-ID: On Fri, Sep 17, 2010 at 21:44, Raymond Hettinger wrote: > My question for the group is whether it would be a good > idea to drop the new restriction. I like the restriction and would actually advocate having it for regular for-loops too (though that would be a big no-no, I guess). Here's why I never use them without parenthesis, in python 2: >>> (1 if True else 3, 4) (1, 4) >>> (lambda x: x * x, 6) ( at 0x100475ed8>, 6) >>> [i for i in 2, 3] [2, 3] >>> (i for i in 2, 3) File "", line 1 (i for i in 2, 3) ^ SyntaxError: invalid syntax And in Python 3: >>> (1 if True else 3, 4) (1, 4) >>> (lambda x: x * x, 6) ( at 0x7f4ef41785a0>, 6) >>> [i for i in 2, 3] File "", line 1 [i for i in 2, 3] ^ SyntaxError: invalid syntax >>> (i for i in 2, 3) File "", line 1 (i for i in 2, 3) ^ SyntaxError: invalid syntax Cheers, Dirkjan From guido at python.org Sat Sep 18 02:16:39 2010 From: guido at python.org (Guido van Rossum) Date: Fri, 17 Sep 2010 17:16:39 -0700 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> Message-ID: On Fri, Sep 17, 2010 at 1:00 PM, Raymond Hettinger wrote: > One of the use cases for named tuples is to have them be automatically created from a SQL query or CSV header. ?Sometimes (but not often), those can have a huge number of columns. ?In Python 2.x, it worked just fine -- we had a test for a named tuple with 5000 fields. ?In Python 3.x, there is a SyntaxError when there are more than 255 fields. > > The origin of the change was a hack to fit positional argument counts and keyword-only argument counts in a single oparg in the python opcode encoding. > > ISTM, this is an implementation specific hack and there is no reason that other implementations would have the same restriction (unless their starting point is Python's bytecode). > > The good news is that long argument lists are uncommon. ?They probably only arise in cases with dynamically created functions and classes. ?Most people are unaffected. > > The bad news is that an implementation detail has become visible and added a language restriction. ?The 255 limit seems weird to me in a version of Python that has gone to lengths to unify ints and longs so that char/short/long boundaries stop manifesting themselves to users. > > Is there any support here for trying to get smarter about the keyword-only argument implementation? ?The 255 limit does not seem unreasonably low, but then it was once thought that no one would ever need more that 640k of ram. ?If the new restriction isn't necessary, it would be great to remove it. +256 on removing this limit from the language. I've come across code generators that produced quite insane-looking code that worked perfectly fine because Python's grammar has no (or very large) limits, and I consider this a language feature. I've also written code where there was a good reason to use **kwds in the function definition and another good reason to pass **kwds to the call where the kwds passed could be huge. -- --Guido van Rossum (python.org/~guido) From guido at python.org Sat Sep 18 02:18:21 2010 From: guido at python.org (Guido van Rossum) Date: Fri, 17 Sep 2010 17:18:21 -0700 Subject: [Python-ideas] New 3.x restriction in list comprehensions In-Reply-To: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> References: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> Message-ID: On Fri, Sep 17, 2010 at 12:44 PM, Raymond Hettinger wrote: > In Python2, you can transform: > > ?r = [] > ?for x in 2, 4, 6: > ? ? ? r.append(x*x+1) > > into: > > ? r = [x*x+1 for x in 2, 4, 6] > > In Python3, the first still works but the second gives a SyntaxError. > It wants the 2, 4, 6 to have parentheses. > > The good parts of the change: > ?+ it matches what genexps do > ?+ that simplifies the grammar a bit (listcomps bodies and genexp bodies) > ?+ a listcomp can be reliably transformed to a genexp > > The bad parts: > ?+ The restriction wasn't necessary (we could undo it) > ?+ It makes 2-to-3 conversion a bit harder > ?+ It no longer parallels other paren-free tuple constructions: > ? ? ? ?return x, y > ? ? ? ?yield x, y > ? ? ? ?t = x, y > ? ? ? ? ? ... > ?+ It particular, it no longer parallels regular for-loop syntax > > The last part is the one that seems the most problematic. > If you write for-loops day in and day out with the unrestricted > syntax, you (or least me) will tend to do the wrong thing when > writing a list comprehension. ?It is a bit jarring to get the SyntaxError > when the code looks correct -- it took me a bit of fiddling to figure-out > what was going on. > > My question for the group is whether it would be a good > idea to drop the new restriction. This was intentional. It parallels genexps and it avoids an ambiguity (for the human reader -- I know the parser has no problem with it :-). Please don't change this back. (It would violate the moratorium too...) -- --Guido van Rossum (python.org/~guido) From ncoghlan at gmail.com Sat Sep 18 09:28:42 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 18 Sep 2010 17:28:42 +1000 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <20100917231146.23f0cef1@pitrou.net> References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> <20100917231146.23f0cef1@pitrou.net> Message-ID: On Sat, Sep 18, 2010 at 7:11 AM, Antoine Pitrou wrote: > On Fri, 17 Sep 2010 13:00:08 -0700 > Raymond Hettinger > wrote: >> One of the use cases for named tuples is to have them be automatically created from a SQL >> query or CSV header. ?Sometimes (but not often), those can have a huge number of columns. ?In >> Python 2.x, it worked just fine -- we had a test for a named tuple with 5000 fields. ?In >> Python 3.x, there is a SyntaxError when there are more than 255 fields. > > I don't understand your explanation. You can't pass a namedtuple using > the **kw convention: But you do need to *initialise* the named tuple after you create it. If it's a big tuple, then all of those field values need to be passed in either as positional arguments or as keyword arguments. A restriction to 255 parameters means that named tuples with more than 255 fields become a lot less useful. Merging the parameter count into the opcode as an optimisation when the number of parameters is < 256 is fine. *Disallowing* parameter counts >= 255 is not. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ncoghlan at gmail.com Sat Sep 18 09:39:11 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 18 Sep 2010 17:39:11 +1000 Subject: [Python-ideas] New 3.x restriction in list comprehensions In-Reply-To: References: <1F0CB196-F980-4B3D-B2F1-1969C35FE580@gmail.com> Message-ID: On Sat, Sep 18, 2010 at 8:00 AM, Dirkjan Ochtman wrote: > On Fri, Sep 17, 2010 at 21:44, Raymond Hettinger > wrote: >> My question for the group is whether it would be a good >> idea to drop the new restriction. > > I like the restriction and would actually advocate having it for > regular for-loops too (though that would be a big no-no, I guess). Yep, I tend to parenthesise tuples even when it isn't strictly necessary as well. Even if the parser doesn't care, it makes it a lot easier for human readers (including myself when I have to go back and read that code). (I have similar objections to people that rely on precedence ordering too heavily in complicated expressions - even if the compiler understands them correctly, many readers won't know the precedence table off by heart. Judicious use of parentheses turns code those readers would otherwise have to think about into something which is obviously correct even at a glance). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From greg.ewing at canterbury.ac.nz Sat Sep 18 10:29:02 2010 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 18 Sep 2010 20:29:02 +1200 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <20100917210546.GA32088@cskk.homeip.net> References: <4C93CE55.1030308@mrabarnett.plus.com> <20100917210546.GA32088@cskk.homeip.net> Message-ID: <4C94784E.1040702@canterbury.ac.nz> Cameron Simpson wrote: > If there's a (entirely reasonable IMHO) desire to get > the opcode small, the count should be encoded in a compact be extendable > form. I suspect it's more because it was easier to do it that way than to track down all the places that assume a bytecode never has more than one 16-bit operand. -- Greg From lie.1296 at gmail.com Sat Sep 18 16:23:59 2010 From: lie.1296 at gmail.com (Lie Ryan) Date: Sun, 19 Sep 2010 00:23:59 +1000 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> Message-ID: On 09/18/10 06:00, Raymond Hettinger wrote: > The good news is that long argument lists are uncommon. They > probably only arise in cases with dynamically created functions and > classes. Most people are unaffected. How about showing a Warning when trying to create a large namedtuple? The Warning contains a reference to a bug issue, and should describe that if they really, really need to have this limitation removed, then they should ask in the bug report. Just so that we don't complicate the code unnecessarily without a real usage. In Python, classes are largely syntax sugar for a dictionary anyway, if they needed such a large namedtuple, they should probably reconsider using dictionary or list or real classes instead. From taleinat at gmail.com Sun Sep 19 11:08:28 2010 From: taleinat at gmail.com (Tal Einat) Date: Sun, 19 Sep 2010 11:08:28 +0200 Subject: [Python-ideas] New 3.x restriction on number of keyword arguments In-Reply-To: References: <589C8BF5-F11F-4E10-A7ED-6627EF625E1C@gmail.com> Message-ID: Lie Ryan wrote: > On 09/18/10 06:00, Raymond Hettinger wrote: > > The good news is that long argument lists are uncommon. They > > probably only arise in cases with dynamically created functions and > > classes. Most people are unaffected. > > How about showing a Warning when trying to create a large namedtuple? > The Warning contains a reference to a bug issue, and should describe > that if they really, really need to have this limitation removed, then > they should ask in the bug report. Just so that we don't complicate the > code unnecessarily without a real usage. > > In Python, classes are largely syntax sugar for a dictionary anyway, if > they needed such a large namedtuple, they should probably reconsider > using dictionary or list or real classes instead. > +1 on removing the restriction, just because I find large namedtuples useful. I work with large tables of data and often use namedtuples for their compactness. Python dictionaries have a large memory overhead compared to tuples. This restriction could seriously hamper my future efforts to migrate to Python 3. - Tal Einat -------------- next part -------------- An HTML attachment was scrubbed... URL: From james at openvpn.net Mon Sep 20 23:41:35 2010 From: james at openvpn.net (James Yonan) Date: Mon, 20 Sep 2010 15:41:35 -0600 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object Message-ID: <4C97D50F.1000908@openvpn.net> I think that Glyph hit the nail on the head when he said that "you can go from any arbitrary Future to a full-featured Deferred, but not the other way around." This is exactly my concern, and the reason why I think it's important for Python to standardize on an async result type that is sufficiently general that it can accommodate the different kinds of async semantics in common use in the Python world today. If you don't think this is a problem, just Google for "twisted vs. tornado". While the debate is sometimes passionate and rude, it points to the fragmentation that has occured in the Python async space due to the lack of direction from the standard library. And there's a real cost to this fragmentation -- it's not easy to build an application that uses different async frameworks when there's no standardized result object or reactor model. My concern is that PEP 3148 was really designed for the purpose of thread and process pooling, and that the Future object is designed with the minimum functionality required to achieve this end. The problem is that the Future object starts to look like a stripped-down version of a Twisted Deferred. And that begs the question of why are we standardizing on the special case and not the general case? Wouldn't it be better to break this into two problems: * Develop a full-featured standard async result type and reactor model to facilitate interoperability of different async libraries. This would consist of a standard async result type and an abstract base class for a reactor model. * Let PEP 3148 focus on the problem of thread and process pooling and leverage on the above async result type. The semantics that a general async type should support include: 1. Semantics that allow you to define a callback channel for results and and optionally a separate channel for exceptions as well. 2. Semantics that offer the flexibility of working with async results at the callback level or at the generator level (having a separate channel for exceptions makes it easy for the generator decorator implementation (that facilitates "yield function_returning_async_object()") to dispatch exceptions into the caller). 3. Semantics that can easily be used to pass results and exceptions back from thread or process pools. 4. Semantics that allow for aggregate processing of parallel asynchronous results, such as "fire async result when all of the async results in an async set have fired" or "fire async result when the first result from an async set has fired." Deferreds presently support all of the above. My point here is not so much that Deferreds should be the standard, but that whatever standard is chosen, that the semantics be general enough that different async Python libraries/platforms can interoperate. James > Thanks for the ping about this (I don't think I subscribe to python-ideas, so someone may have to moderate my post in). Sorry for the delay in responding, but I've been kinda busy and cooking up these examples took a bit of thinking. > > And thanks, James, for restarting this discussion. I obviously find it interesting :). > > I'm going to mix in some other stuff I found on the web archives, since it's easiest just to reply in one message. I'm sorry that this response is a bit sprawling and doesn't have a single clear narrative, the thread thus far didn't seem to lend it to one. > > For those of you who don't want to read my usual novel-length post, you can probably stop shortly after the end of the first block of code examples. > > On Sep 11, 2010, at 10:26 PM, Guido van Rossum wrote: > >>>> although he didn't say what >>>> deferreds really added beyond what futures provide, and why the >>>> "add_done_callback" method isn't adequate to provide interoperability >>>> between futures and deferreds (which would be odd, since Brian made >>>> changes to that part of PEP 3148 to help with that interoperability >>>> after discussions with Glyph). >>>> >>>> Between PEP 380 and PEP 3148 I'm not really seeing a lot more scope >>>> for standardisation in this space though. >>>> >>>> Cheers, >>>> Nick. >>> >>> That was my initial reaction as well, but I'm more than open to >>> hearing from Jean Paul/Glyph and the other twisted folks on this. > >> But thinking about this more I don't know that it will be easy to mix >> PEP 3148, which is solidly thread-based, with a PEP 342 style >> scheduler (whether or not the PEP 380 enhancements are applied, or >> even PEP 3152). And if we take the OP's message at face value, his >> point isn't so much that Twisted is great, but that in order to >> benefit maximally from PEP 342 there needs to be a standard way of >> using callbacks. I think that's probably true. And comparing the >> blog's examples to PEP 3148, I find Twisted's terminology rather >> confusing compared to the PEP's clean Futures API (where IMO you can >> ignore almost everything except result()). > > That blog post was written to demonstrate why programs using generators are "... far easier to read and write ..." than ones using Deferreds, so it stands to reason it would choose an example where that helps :). > > When you want to write systems that manage varying levels of parallelism within a single computation, generators can start to get pretty hairy and the "normal" Deferred way of doing things looks more straightforward. > > Thinking in terms of asynchronicity is tricky, and generators can be a useful tool for promoting that understanding, but they only make it superficially easier. For example: > >>>> def serial(): >>>> results = set() >>>> for x in ...: >>>> results.add((yield do_something_async(x))) >>>> return results > > If you're writing an application whose parallelism calls for an asynchronous approach, after all, you presumably don't want to be standing around waiting for each network round trip to complete. How do you re-write this so that there are always at least N outstanding do_something_async calls running in parallel? > > You can sorta do it like this: > >>>> def parallel(N): >>>> results = set() >>>> outstanding = [] >>>> for x in ...: >>>> if len(outstanding) > N: >>>> results.add((yield outstanding.pop(0))) >>>> else: >>>> outstanding.append(do_something_async(x)) > > but that will always block on one particular do_something_async, when you really want to say "let me know when any outstanding call is complete". So I could handwave about 'yield any_completed(outstanding)'... > >>>> def parallel(N): >>>> results = set() >>>> outstanding = set() >>>> for x in ...: >>>> if len(outstanding) > N: >>>> results.add((yield any_completed(outstanding))) >>>> else: >>>> outstanding.add(do_something_async(x)) > > but that just begs the question of how you implement any_completed(), and I can't think of a way to do that with generators, without getting into the specifics of some Deferred-or-Future-like asynchronous result object. You could implement such a function with such primitives, and here's what it looks like with Deferreds: > >>>> def any_completed(setOfDeferreds): >>>> d = Deferred() >>>> called = [] >>>> def fireme(result, whichDeferred): >>>> if not called: >>>> called.append(True) >>>> setOfDeferreds.remove(whichDeferred) >>>> d.callback(result) >>>> return result >>>> for subd in setOfDeferreds: >>>> subd.addBoth(fireme, subd) >>>> return d > > Here's how you do the top-level task in Twisted, without generators, in the truly-parallel fashion (keep in mind this combines the functionality of 'any_completed' and 'parallel', so it's a bit shorter): > >>>> def parallel(N): >>>> ds = DeferredSemaphore(N) >>>> l = [] >>>> def release(result): >>>> ds.release() >>>> return result >>>> def after(sem, it): >>>> return do_something_async(it) >>>> for x in ...: >>>> l.append(ds.acquire().addCallback(after_acquire, x).addBoth(release)) >>>> return gatherResults(l).addCallback(set) > > Some informal benchmarking has shown this method to be considerably faster (on the order of 1/2 to 1/3 as much CPU time) than at least our own inlineCallbacks generator-scheduling method. Take this with the usual fist-sized grain of salt that you do any 'informal' benchmarks, but the difference is significant enough that I do try to refactor into this style in my own code, and I have seen performance benefits from doing this on more specific benchmarks. > > This is all untested, and that's far too many lines of code to expect to work without testing, but hopefully it gives a pretty good impression of the differences in flavor between the different styles. > >> Yeah, please do explain why Twisted has so much machinery to handle exceptions? > > There are a lot of different implied questions here, so I'll answer a few of those. > > Why does twisted.python.failure exist? The answer to that is that we wanted an object that represented an exception as raised at a particular point, associated with a particular stack, that could live on without necessarily capturing all the state in that stack. If you're going to report failures asynchronously, you don't necessarily want to hold a reference to every single thing in a potentially giant stack while you're waiting to send it to some network endpoint. Also, in 1.5.2 we had no way of chaining exceptions, and this code is that old. Finally, even if you can chain exceptions, it's a serious performance hit to have to re-raise and re-catch the same exception 4 or 5 times in order to translate it or handle it at many different layers of the stack, so a Failure is intended to encapsulate that state such that it can just be returned, in performance-sensitive areas. (This is sort of a weak point though, since the performance of Failure itself is so terrible, for u nrelated reasons.) > > Why is twisted.python.failure such a god damned mess? The answer to that is ... uh, sorry. Yes, it is. We should clean it up. It was written a long time ago and the equivalent module now could be _much_ shorter, simpler, and less of a performance problem. It just never seems to be the highest priority. Maybe after we're done porting to py3 :). My one defense here is that still a slight improvement over the stdlib 'traceback' module ;-). > > Why do Deferreds have an errback chain rather than just handing you an exception object in the callback chain? Basically, this is for the same reason that Python has exceptions instead of just making you check return codes. We wanted it to be easy to say: > >>>> d = getPage("http://...") >>>> def ok(page): >>>> doSomething(...) >>>> d.addCallback(ok) > > and know that the argument to 'ok' would always be what getPage promised (you don't need to typecheck it for exception-ness) and the default error behavior would be to simply bail out with a traceback, not to barrel through your success-path code wreaking havoc. > >> ISTM that the main difference is that add_done_callback() isn't meant for callbacks that return a value. > > > add_done_callback works fine with callbacks that return a value. If it didn't, I'd be concerned, because then it would have the barrel-through-the-success-path flaw. But, I assume the idiomatic asynchronous-code-using-Futures would look like this: > >>>> f = some_future_thing(...) >>>> def my_callback(future): >>>> result = future.result() >>>> do_something(result) >>>> f.add_done_callback(my_callback) > > This is one extra line of code as compared to the Twisted version, and chaining involves a bit more gymnastics (somehow creating more futures to return further up the stack, I guess, I haven't thought about it too hard), but it does allow you to handle exceptions with a simple 'except:', rather than calling some exception-handling methods, so I can see why some people would prefer it. > >> Maybe it's possible to write a little framework that lets you create Futures using either threads, processes (both supported by PEP 3148) or generators. But I haven't tried it. And maybe the need to use 'yield' for everything that may block when using generators, but not when using threads or processes, will make this awkward. > > You've already addressed the main point that I really wanted to mention here, but I'd like to emphasize it. Blocking and not-blocking are fundamentally different programming styles, and if you sometimes allow blocking on asynchronous results, that means you are effectively always programming in the blocking-and-threaded style and not getting much benefit from the code which does choose to be politely non-blocking. > > I was somewhat pleased with the changes made to the Futures PEP because you could use them as an asynchronous result, and have things that implemented the Future API but raised an exception if you tried to wait on them. That would at least allow some layer of stdlib compatibility. If you are disciplined and careful, this would let you write async code which used a common interoperability mechanism, and if you weren't careful, it would blow up when you tried to use it the wrong way. > > But - and I am guessing that this is the main thrust of this discussion - I do think that having Deferred in the standard library would be much, much better if we can do that. > >> So maybe we'll be stuck with at least two Future-like APIs: PEP 3148 and something else, generator-based. > > Having something "generator-based" is, in my opinion, an abstraction inversion. The things which you are yielding from these generators are asynchronous results. There should be a specific type for asynchronous results which can be easily interacted with. Generators are syntactic sugar for doing that interaction in a way which doesn't involve defining tons of little functions. This is useful, and it makes the concept more accessible, so I don't say "just" syntactic sugar: but nevertheless, the generators need to be 'yield'ing something, and the type of thing that they're yielding is a Deferred-or-something-like-it. > > I don't think that this is really two 'Future-like APIs'. At least, they're not redundant, any more than having both socket.makefile() and socket.recv() is redundant. > > If Future had a deferred() method rather than an add_done_callback() method, then it would always be very clear whether you had a synchronous-but-possibly-not-ready or a purely-asynchronous result. Although it would be equally easy to just have a function that turned a Future into a Deferred by calling add_done_callback(). You can go from any arbitrary Future to a full-featured Deferred, but not the other way around. > >> Or maybe PEP 3152. > > > I don't like PEP 3152 aesthetically on many levels, but I can't deny that it would do the job. 'cocall', though, really? It would be nice if it read like an actual word, i.e. "yield to" or "invoke" or even just "call" or something. > > In another message, where Guido is replying to Antoine: > >>> I think the main reason, though, that people find Deferreds inconvenient is that they force you to think in terms of asynchronicity (...) >> >> Actually I think the main reason is historic: Twisted introduced callback-based asynchronous (thread-less) programming when there was no alternative in Python, and they invented both the mechanisms and the terminology as they were figuring it all out. That is no mean feat. But with PEP 342 (generator-based coroutines) and especially PEP 380 (yield from) there *is* an alternative, and while Twisted has added APIs to support generators, it hasn't started to deprecate its other APIs, and its terminology becomes hard to follow for people (like me, frankly) who first learned this stuff through PEP 342. > > I really have to go with Antoine on this one: people were confused about Deferreds long before PEP 342 came along :). Given that Javascript environments have mostly adopted the Twisted terminology (oddly, Node.js doesn't, but Dojo and MochiKit both have pretty literal-minded Deferred translations), there are plenty of people who are familiar with the terminology but still get confused. > > See the beginning of the message for why we're not deprecating our own APIs. > > Once again, sorry for not compressing this down further! If you got this far, you win a prize :). From guido at python.org Tue Sep 21 01:49:04 2010 From: guido at python.org (Guido van Rossum) Date: Mon, 20 Sep 2010 16:49:04 -0700 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: <4C97D50F.1000908@openvpn.net> References: <4C97D50F.1000908@openvpn.net> Message-ID: On Mon, Sep 20, 2010 at 2:41 PM, James Yonan wrote: > I think that Glyph hit the nail on the head when he said that "you can go > from any arbitrary Future to a full-featured Deferred, but not the other way > around." Where by "go from X to Y" you mean "take a program written using X and change it to use Y", right? > This is exactly my concern, and the reason why I think it's important for > Python to standardize on an async result type that is sufficiently general > that it can accommodate the different kinds of async semantics in common use > in the Python world today. I think I get your gist. Unfortunately there's only a small number of people who know enough about async semantics in order to write the PEP that is needed. > If you don't think this is a problem, just Google for "twisted vs. tornado". > ?While the debate is sometimes passionate and rude, Is it ever distanced and polite? :-) > it points to the > fragmentation that has occured in the Python async space due to the lack of > direction from the standard library. ?And there's a real cost to this > fragmentation -- it's not easy to build an application that uses different > async frameworks when there's no standardized result object or reactor > model. But, circularly, the lack of direction from the standard library is that nobody has contributed an async framework to the standard library since asyncore was added in, oh, 1999. > My concern is that PEP 3148 was really designed for the purpose of thread > and process pooling, and that the Future object is designed with the minimum > functionality required to achieve this end. ?The problem is that the Future > object starts to look like a stripped-down version of a Twisted Deferred. > ?And that begs the question of why are we standardizing on the special case > and not the general case? Because we could reach agreement fairly quickly on PEP 3148. There are some core contributors who know threads and processes inside out, and after several rounds of comments (a lot, really) they were satisfied. At this point it is probably best to forget about PEP 3148 if you want to improve the async situation in the stdlib, and start thinking about that async PEP instead. > Wouldn't it be better to break this into two problems: > > * Develop a full-featured standard async result type and reactor model to > facilitate interoperability of different async libraries. ?This would > consist of a standard async result type and an abstract base class for a > reactor model. Unless you want to propose to include Twisted into the stdlib, this is not going to be ready for inclusion into Python 3.2. > * Let PEP 3148 focus on the problem of thread and process pooling and > leverage on the above async result type. But PEP 3148 *is* ready for inclusion in Python 3.2. So you've got the ordering wrong. It doesn't make sense to hold up PEP 3148, waiting for the perfect solution to appear. In fact, the changes that were made to PEP 3148 at Glyph's suggestion are probably all you are going to get regarding PEP 3148. > The semantics that a general async type should support include: > > 1. Semantics that allow you to define a callback channel for results and and > optionally a separate channel for exceptions as well. > > 2. Semantics that offer the flexibility of working with async results at the > callback level or at the generator level (having a separate channel for > exceptions makes it easy for the generator decorator implementation (that > facilitates "yield function_returning_async_object()") to dispatch > exceptions into the caller). > > 3. Semantics that can easily be used to pass results and exceptions back > from thread or process pools. > > 4. Semantics that allow for aggregate processing of parallel asynchronous > results, such as "fire async result when all of the async results in an > async set have fired" or "fire async result when the first result from an > async set has fired." > > Deferreds presently support all of the above. ?My point here is not so much > that Deferreds should be the standard, but that whatever standard is chosen, > that the semantics be general enough that different async Python > libraries/platforms can interoperate. Do you want to champion a PEP? I hope you do -- it will be a long march but rewarding, especially if you get the Tornado folks to participate and contribute. -- --Guido van Rossum (python.org/~guido) From andrew at bemusement.org Tue Sep 21 07:39:11 2010 From: andrew at bemusement.org (Andrew Bennetts) Date: Tue, 21 Sep 2010 15:39:11 +1000 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: References: <4C97D50F.1000908@openvpn.net> Message-ID: <20100921053911.GD18831@aihal.home.puzzling.org> Guido van Rossum wrote: [...] > > Unless you want to propose to include Twisted into the stdlib, this is > not going to be ready for inclusion into Python 3.2. I don't think anyone has suggested "include Twisted". What is being suggested is "include twisted.internet.defer, or something about as useful." Let's consider just how hard it would be to just adding twisted/internet/defer.py to the stdlib (possibly as 'deferred.py'). It's already almost a standalone module, especially if pared back to just the Deferred class and maybe one or two of the most useful helpers (e.g. gatherResults, to take a list of Deferreds and turn them into a single Deferred that fires when they have all fired). The two most problematic dependencies would be: 1) twisted.python.log, which for these purposes could be replaced with a call to a user-replaceable hook whenever an unhandled error occurs (similiar to sys.excepthook). 2) twisted.python.failure... this one is harder. As glyph said, it provides "an object that represent[s] an exception as raised at a particular point, associated with a particular stack". But also, as he said, it's a mess and could use a clean up. Cleaning it up or thinking of a simpler replacement is not insurmountable, but probably too ambitious for Python 3.2's schedule. My point is that adding the Deferred abstraction to the stdlib is a *much* smaller and more reasonable proposition than "include Twisted." -Andrew. From jnoller at gmail.com Tue Sep 21 15:25:13 2010 From: jnoller at gmail.com (Jesse Noller) Date: Tue, 21 Sep 2010 09:25:13 -0400 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: <20100921053911.GD18831@aihal.home.puzzling.org> References: <4C97D50F.1000908@openvpn.net> <20100921053911.GD18831@aihal.home.puzzling.org> Message-ID: On Tue, Sep 21, 2010 at 1:39 AM, Andrew Bennetts wrote: > Guido van Rossum wrote: > [...] >> >> Unless you want to propose to include Twisted into the stdlib, this is >> not going to be ready for inclusion into Python 3.2. > > I don't think anyone has suggested "include Twisted". ?What is being suggested > is "include twisted.internet.defer, or something about as useful." > > Let's consider just how hard it would be to just adding > twisted/internet/defer.py to the stdlib (possibly as 'deferred.py'). ?It's > already almost a standalone module, especially if pared back to just the > Deferred class and maybe one or two of the most useful helpers (e.g. > gatherResults, to take a list of Deferreds and turn them into a single Deferred > that fires when they have all fired). > > The two most problematic dependencies would be: > > ?1) twisted.python.log, which for these purposes could be replaced with a call > ? ?to a user-replaceable hook whenever an unhandled error occurs (similiar to > ? ?sys.excepthook). > ?2) twisted.python.failure... this one is harder. ?As glyph said, it provides > ? ?"an object that represent[s] an exception as raised at a particular point, > ? ?associated with a particular stack". ?But also, as he said, it's a mess and > ? ?could use a clean up. ?Cleaning it up or thinking of a simpler replacement > ? ?is not insurmountable, but probably too ambitious for Python 3.2's schedule. > > My point is that adding the Deferred abstraction to the stdlib is a *much* > smaller and more reasonable proposition than "include Twisted." > > -Andrew. No on was seriously proposing including twisted wholesale. There has been discussion, off and on *for years* about doing including a stripped down deferred object; and yet no one has stepped up to *do it*, so it might be hilariously easy, it might be a 40 line module, but it doesn't matter if no one steps up to do the pep, and commit the code, and commit to maintaining it. jesse From ncoghlan at gmail.com Tue Sep 21 15:40:28 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 21 Sep 2010 23:40:28 +1000 Subject: [Python-ideas] [Python-Dev] Python needs a standard asynchronous return object In-Reply-To: References: <4C97D50F.1000908@openvpn.net> <20100921053911.GD18831@aihal.home.puzzling.org> Message-ID: On Tue, Sep 21, 2010 at 11:25 PM, Jesse Noller wrote: > There has > been discussion, off and on *for years* about doing including a > stripped down deferred object; and yet no one has stepped up to *do > it*, so it might be hilariously easy, it might be a 40 line module, > but it doesn't matter if no one steps up to do the pep, and commit the > code, and commit to maintaining it. Indeed. Thread and process pools had similarly been talked about for quite some time before Brian stepped up to actually do the work of writing and championing PEP 3148. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From michael.s.gilbert at gmail.com Tue Sep 21 20:44:52 2010 From: michael.s.gilbert at gmail.com (Michael Gilbert) Date: Tue, 21 Sep 2010 14:44:52 -0400 Subject: [Python-ideas] Including elementary mathematical functions in the python data model Message-ID: <20100921144452.3cfd118b.michael.s.gilbert@gmail.com> Hi, It would be really nice if elementary mathematical operations such as sin/cosine (via __sin__ and __cos__) were available as base parts of the python data model [0]. This would make it easier to write new math classes, and it would eliminate the ugliness of things like self.exp(). This would also eliminate the need for separate math and cmath libraries since those could be built into the default float and complex types. Of course if those libs were removed, that would be a potential backwards compatibility issue. It would also help new users who just want to do math and don't know that they need to import separate classes just for elementary math functionality. I think full coverage of the elementary function set would be the goal (i.e. exp, sqrt, ln, trig, and hyperbolic functions). This would not include special functions since that would be overkill, and they are already handled well by scipy and numpy. Anyway, just a thought. Best wishes, Mike [0] http://docs.python.org/reference/datamodel.html From ncoghlan at gmail.com Tue Sep 21 23:53:09 2010 From: ncoghlan at gmail.com (Nick Coghlan) Date: Wed, 22 Sep 2010 07:53:09 +1000 Subject: [Python-ideas] Including elementary mathematical functions in the python data model In-Reply-To: <20100921144452.3cfd118b.michael.s.gilbert@gmail.com> References: <20100921144452.3cfd118b.michael.s.gilbert@gmail.com> Message-ID: On Wed, Sep 22, 2010 at 4:44 AM, Michael Gilbert wrote: > Hi, > > It would be really nice if elementary mathematical operations such as > sin/cosine (via __sin__ and __cos__) were available as base parts of > the python data model [0]. ?This would make it easier to write new math > classes, and it would eliminate the ugliness of things like self.exp(). > > This would also eliminate the need for separate math and cmath > libraries since those could be built into the default float and complex > types. ?Of course if those libs were removed, that would be a potential > backwards compatibility issue. > > It would also help new users who just want to do math and don't know > that they need to import separate classes just for elementary math > functionality. > > I think full coverage of the elementary function set would be the goal > (i.e. exp, sqrt, ln, trig, and hyperbolic functions). ?This would not > include special functions since that would be overkill, and they are > already handled well by scipy and numpy. I think the basic problem here is that, by comparison to the basic syntax-driven options, the additional functionality covered by the math, cmath and decimal modules is much harder to implement both correctly and efficiently. It's hard enough making good algorithms that work on a single data type with a known representation, let alone ones which work on arbitrary data types. Also, needing exp, sqrt, ln, trig and hyperbolic functions is *significantly* less common than the core mathematical options, so telling people to do "from math import *" if they want to do a lot of mathematical operations at the interactive prompt isn't much of a hurdle. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From cs at zip.com.au Thu Sep 23 00:31:36 2010 From: cs at zip.com.au (Cameron Simpson) Date: Thu, 23 Sep 2010 08:31:36 +1000 Subject: [Python-ideas] Python needs a standard asynchronous return object In-Reply-To: <4C97D50F.1000908@openvpn.net> References: <4C97D50F.1000908@openvpn.net> Message-ID: <20100922223136.GA23975@cskk.homeip.net> On 20Sep2010 15:41, James Yonan wrote: [...] | * Develop a full-featured standard async result type and reactor | model to facilitate interoperability of different async libraries. | This would consist of a standard async result type and an abstract | base class for a reactor model. | | * Let PEP 3148 focus on the problem of thread and process pooling | and leverage on the above async result type. | | The semantics that a general async type should support include: | | 1. Semantics that allow you to define a callback channel for results | and and optionally a separate channel for exceptions as well. | | 2. Semantics that offer the flexibility of working with async | results at the callback level or at the generator level (having a | separate channel for exceptions makes it easy for the generator | decorator implementation (that facilitates "yield | function_returning_async_object()") to dispatch exceptions into the | caller). | | 3. Semantics that can easily be used to pass results and exceptions | back from thread or process pools. [...] Just to address this particular aspect (return types and notification), I have my own futures-like module, where the equivalent of a Future is called a LateFunction. There are only 3 basic types of return in my model: there's a .report() method in the main (Executor equivalent) class that yields LateFunctions as they complete. A LateFunction has two basic get-the result methods. Having made a LateFunction: LF = Later.defer(func) You can either go: result = LF() This waits for func's ompletion and returns func's return value. If func raises an exception, this raises that exception. Or you can go: result, exc_info = LF.wait() which returns: result, None if func completed without exception and None, exc_info if an exception was raised, where exc_info is a 3-tuple as from sys.exc_info(). At any rate, when looking for completion you can either get LateFunctions as they complete via .report(), or function results plain (that may raise exceptions) or function (results xor exceptions). This makes implementing the separate streams (results vs exceptions) models trivial if it is desired while keeping the LateFunction interface simple (few interface methods). Yes, I know there's no timeout stuff in there :-( Cheers, -- Cameron Simpson