From ned at nedbatchelder.com Sun May 1 00:49:11 2011 From: ned at nedbatchelder.com (Ned Batchelder) Date: Sat, 30 Apr 2011 18:49:11 -0400 Subject: [Python-Dev] sys.settrace: behavior doesn't match docs Message-ID: <4DBC91E7.9060402@nedbatchelder.com> This week I learned something new about trace functions (how to write a C trace function that survives a sys.settrace(sys.gettrace()) round-trip), and while writing up what I learned, I was surprised to discover that trace functions don't behave the way I thought, or the way the docs say they behave. The docs say: The trace function is invoked (with /event/ set to 'call') whenever a new local scope is entered; it should return a reference to a local trace function to be used that scope, or None if the scope shouldn't be traced. The local trace function should return a reference to itself (or to another function for further tracing in that scope), or None to turn off tracing in that scope. It's that last part that's wrong: returning None from the trace function only has an effect on the first call in a new frame. Once the trace function returns a function for a frame, returning None from subsequent calls is ignored. A "local trace function" can't turn off tracing in its scope. To demonstrate: import sys UPTO_LINE = 1 def t(frame, event, arg): num = frame.f_lineno print("line %d" % num) if num < UPTO_LINE: return t def try_it(): print("twelve") print("thirteen") print("fourteen") print("fifteen") UPTO_LINE = 1 sys.settrace(t) try_it() UPTO_LINE = 13 sys.settrace(t) try_it() Produces: line 11 twelve thirteen fourteen fifteen line 11 line 12 twelve line 13 thirteen line 14 fourteen line 15 fifteen line 15 The first call to try_it() returns None immediately, preventing tracing for the rest of the function. The second call returns None at line 13, but the rest of the function is traced anyway. This behavior is the same in all versions from 2.3 to 3.2, in fact, the 100 lines of code in sysmodule.c responsible for Python tracing functions are completely unchanged through those versions. (A deeper mystery that I haven't looked into yet is why Python 3.x intersperses all of these lines with "line 18" interjections.) I'm writing this email because I'm not sure whether this is a behavior bug or a doc bug. One of them is wrong, since they disagree. The documented behavior makes sense, and is what people have all along thought the trace function did. The actual behavior is a bit more complicated to explain, but is what people have actually been experiencing. FWIW, PyPy implements the documented behavior. Should we fix the code or the docs? I'd be glad to supply a patch for either. --Ned. -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Sun May 1 02:43:27 2011 From: guido at python.org (Guido van Rossum) Date: Sat, 30 Apr 2011 17:43:27 -0700 Subject: [Python-Dev] sys.settrace: behavior doesn't match docs In-Reply-To: <4DBC91E7.9060402@nedbatchelder.com> References: <4DBC91E7.9060402@nedbatchelder.com> Message-ID: I think you need to go back farther in time. :-) In Python 2.0 the call_trace function in ceval.c has a completely different signature (but the docs are the same). I haven't checked all history but somewhere between 2.0 and 2.3, SET_LINENO-less tracing was added, and that's where the implementation must have gone wrong. So I think we should fix the code. --Guido On Sat, Apr 30, 2011 at 3:49 PM, Ned Batchelder wrote: > This week I learned something new about trace functions (how to write a C > trace function that survives a sys.settrace(sys.gettrace()) round-trip), and > while writing up what I learned, I was surprised to discover that trace > functions don't behave the way I thought, or the way the docs say they > behave. > > The docs say: > > The trace function is invoked (with event set to 'call') whenever a new > local scope is entered; it should return a reference to a local trace > function to be used that scope, or None if the scope shouldn?t be traced. > > The local trace function should return a reference to itself (or to another > function for further tracing in that scope), or None to turn off tracing in > that scope. > > It's that last part that's wrong: returning None from the trace function > only has an effect on the first call in a new frame.? Once the trace > function returns a function for a frame, returning None from subsequent > calls is ignored.? A "local trace function" can't turn off tracing in its > scope. > > To demonstrate: > > import sys > > UPTO_LINE = 1 > > def t(frame, event, arg): > ??? num = frame.f_lineno > ??? print("line %d" % num) > ??? if num < UPTO_LINE: > ??????? return t > > def try_it(): > ??? print("twelve") > ??? print("thirteen") > ??? print("fourteen") > ??? print("fifteen") > > UPTO_LINE = 1 > sys.settrace(t) > try_it() > > UPTO_LINE = 13 > sys.settrace(t) > try_it() > > Produces: > > line 11 > twelve > thirteen > fourteen > fifteen > line 11 > line 12 > twelve > line 13 > thirteen > line 14 > fourteen > line 15 > fifteen > line 15 > > The first call to try_it() returns None immediately, preventing tracing for > the rest of the function.? The second call returns None at line 13, but the > rest of the function is traced anyway.? This behavior is the same in all > versions from 2.3 to 3.2, in fact, the 100 lines of code in sysmodule.c > responsible for Python tracing functions are completely unchanged through > those versions.? (A deeper mystery that I haven't looked into yet is why > Python 3.x intersperses all of these lines with "line 18" interjections.) > > I'm writing this email because I'm not sure whether this is a behavior bug > or a doc bug.? One of them is wrong, since they disagree.? The documented > behavior makes sense, and is what people have all along thought the trace > function did.? The actual behavior is a bit more complicated to explain, but > is what people have actually been experiencing.? FWIW, PyPy implements the > documented behavior. > > Should we fix the code or the docs?? I'd be glad to supply a patch for > either. > > --Ned. > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/guido%40python.org > > -- --Guido van Rossum (python.org/~guido) From techtonik at gmail.com Sun May 1 12:40:43 2011 From: techtonik at gmail.com (anatoly techtonik) Date: Sun, 1 May 2011 13:40:43 +0300 Subject: [Python-Dev] 2to3 status, repositories and HACKING guide In-Reply-To: References:

Message-ID: Is there any high-level overview of 2to3 tool that people can use as a quick start for writing their own fixers? Source doesn't explain much (to me at least), and some kind of "learn by example" would really help a lot. In particular, I find the syntax of tree matchers the most unclear part. -- anatoly t. On Fri, Mar 25, 2011 at 9:12 PM, Benjamin Peterson wrote: > The main cpython repo. > > 2011/3/25 anatoly techtonik : >> Hi, Benjamin, >> >> Is your repository for 2to3 is still actual? >> http://svn.python.org/view/sandbox/trunk/2to3/ >> >> Which should I use to start hacking on 2to3? >> >> -- >> anatoly t. >> >> >> >> On Wed, Mar 23, 2011 at 9:01 AM, anatoly techtonik wrote: >>> Hi, >>> >>> Currently 2to3 page at http://wiki.python.org/moin/2to3 lists >>> http://svn.python.org/view/sandbox/trunk/2to3 as a repository for 2to3 >>> tool. There is also an outdated repository at http://hg.python.org/ >>> and the page says that the code is finally integrated into CPython 2.6 >>> - you can see it at >>> http://hg.python.org/cpython/file/default/Lib/lib2to3. So, what >>> version is more up-to-date? >>> >>> In svn repository there is a HACKING guide advising to use >>> find_pattern.py script for writing new fixer. However, there is no >>> find_pattern.py in CPython repository, no HACKING guide, no any >>> documentation about how to write fixers or description of PATTERN >>> format. Did I miss something? >>> -- >>> anatoly t. >>> >> > > > > -- > Regards, > Benjamin > From ncoghlan at gmail.com Sun May 1 13:27:44 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 1 May 2011 21:27:44 +1000 Subject: [Python-Dev] Not-a-Number (was PyObject_RichCompareBool identity shortcut) In-Reply-To: References: <4DB7E3EA.3030208@avl.com> <87d3k79jvt.fsf@uwakimon.sk.tsukuba.ac.jp> <4DB90748.4030501@g.nevcal.com> <4DB916DE.1050302@g.nevcal.com> <4DB927F4.3040206@dcs.gla.ac.uk> <871v0la5yg.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On Sat, Apr 30, 2011 at 3:11 AM, Guido van Rossum wrote: > Decimal, for that reason, has a context that lets one specify > different behaviors when a NaN is produced. Would it make sense to add > a float context that also lets one specify what should happen? That > could include returning Inf for 1.0/0.0 (for experts), or raising > exceptions when NaNs are produced (for the numerically naive like > myself). > > I could see a downside too, e.g. the correctness of code that > passingly uses floats might be affected by the context settings. > There's also the question of whether the float context should affect > int operations; floats vs. ints is another can of worms since (in > Python 3) we attempt to tie them together through 1/2 == 0.5, but ints > have a much larger range than floats. Given that we delegate most float() behaviour to the underlying CPU and C libraries (and then the math module tries to cope with any cross-platform discrepancies), introducing context handling isn't easy, and would likely harm the current speed advantage that floats hold over the decimal module. We decided that losing the speed advantage of native integers was worthwhile in order to better unify the semantics of int and long for Py3k, but both the speed differential and the semantic gap between float() and decimal.Decimal() are significantly larger. However, I did find Terry's suggestion of using the warnings module to report some of the floating point corner cases that currently silently produce unexpected results to be an interesting one. If those operations issued a FloatWarning, then users could either silence them or turn them into errors as desired. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From benjamin at python.org Sun May 1 17:44:10 2011 From: benjamin at python.org (Benjamin Peterson) Date: Sun, 1 May 2011 10:44:10 -0500 Subject: [Python-Dev] 2to3 status, repositories and HACKING guide In-Reply-To: References:

Message-ID: 2011/5/1 anatoly techtonik : > Is there any high-level overview of 2to3 tool that people can use as a > quick start for writing their own fixers? No. > > Source doesn't explain much (to me at least), and some kind of "learn > by example" would really help a lot. In particular, I find the syntax of > tree matchers the most unclear part. I think you can learn a lot by reading through the current fixers in lib2to3/fixers/. -- Regards, Benjamin From g.brandl at gmx.net Sun May 1 18:31:20 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 01 May 2011 18:31:20 +0200 Subject: [Python-Dev] Issue Tracker In-Reply-To: References: <4D90EA06.3030003@stoneleaf.us> <20110328223112.76482a9d@pitrou.net> <20110329013756.99EB8D64A7@kimball.webabinitio.net> Message-ID: On 30.04.2011 16:53, anatoly techtonik wrote: > On Tue, Mar 29, 2011 at 4:37 AM, R. David Murray wrote: >> >> The hardest part is debugging the TAL when you make a mistake, but >> even that isn't a whole lot worse than any other templating language. > > How much in % is it worse than Django templating language? I'm just guessing here, but I'd say 47.256 %. Georg From g.brandl at gmx.net Sun May 1 19:57:51 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Sun, 01 May 2011 19:57:51 +0200 Subject: [Python-Dev] Python 3.2.1 Message-ID: Hi, I'd like to release Python 3.2.1 on May 21, with a release candidate on May 14. Please bring any issues you think need to be fixed in it to my attention by assigning "release blocker" status in the tracker. Georg From raymond.hettinger at gmail.com Sun May 1 20:22:02 2011 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Sun, 1 May 2011 11:22:02 -0700 Subject: [Python-Dev] Python 3.2.1 In-Reply-To: References: Message-ID: <5D8F6095-D052-47F6-A65B-D578A4460F20@gmail.com> On May 1, 2011, at 10:57 AM, Georg Brandl wrote: > I'd like to release Python 3.2.1 on May 21, with a release candidate > on May 14. Please bring any issues you think need to be fixed in it > to my attention by assigning "release blocker" status in the tracker. Thanks to http://www.python.org/dev/daily-dmg/ , I've been able to work off of the head every day. Python 3.2.1 is in pretty good shape :-) Raymond -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjreedy at udel.edu Sun May 1 20:45:06 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Sun, 01 May 2011 14:45:06 -0400 Subject: [Python-Dev] Not-a-Number (was PyObject_RichCompareBool identity shortcut) In-Reply-To: References: <4DB7E3EA.3030208@avl.com> <87d3k79jvt.fsf@uwakimon.sk.tsukuba.ac.jp> <4DB90748.4030501@g.nevcal.com> <4DB916DE.1050302@g.nevcal.com> <4DB927F4.3040206@dcs.gla.ac.uk> <871v0la5yg.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: On 5/1/2011 7:27 AM, Nick Coghlan wrote: > However, I did find Terry's suggestion of using the warnings module to > report some of the floating point corner cases that currently silently > produce unexpected results to be an interesting one. If those > operations issued a FloatWarning, then users could either silence them > or turn them into errors as desired. I would like to take credit for that, but I was actually seconding Alexander's insight and idea. I may have added the specific name after looking at the currently list and seeing UnicodeWarning and BytesWarning, so why not a FloatWarning. I did read the warnings doc more carefully to verify that it would really put the user in control, which was apparently the intent of the committee. I am not sure whether FloatWarnings should ignored or printed by default. Ignored would, I guess, match current behavior, unless something else is changed as part of a more extensive overhaul. -f and -ff are available to turn ignored FloatWarning into print or raise exception, as with BytesWarning. I suspect that these would get at lease as much usage as -b and -bb. So I see 4 questions: 1. Add FloatWarning? 2. If yes, default disposition? 3. Add command line options? 4. Use the addition of FloatWarning as an opportunity to change other defaults, given that user will have more options? -- Terry Jan Reedy From brian.curtin at gmail.com Sun May 1 22:51:55 2011 From: brian.curtin at gmail.com (Brian Curtin) Date: Sun, 1 May 2011 15:51:55 -0500 Subject: [Python-Dev] Windows 2000 Support Message-ID: I'm currently writing a post about the process of removing OS/2 and VMS support and thought about a discussion of Windows 2000 some time back. http://mail.python.org/pipermail/python-dev/2010-March/098074.html makes a proposal for beginning to walk away from 2000, but doesn't appear to come to any conclusion. Was anything decided off the list? I don't see anything in PEP-11 and don't see any changes in the installer made around Windows 2000. If nothing was decided, should anything be done for 3.3? -------------- next part -------------- An HTML attachment was scrubbed... URL: From victor.stinner at haypocalc.com Mon May 2 12:06:47 2011 From: victor.stinner at haypocalc.com (Victor Stinner) Date: Mon, 2 May 2011 12:06:47 +0200 Subject: [Python-Dev] Raise OSError or RuntimeError in the OS module? Message-ID: <201105021206.47384.victor.stinner@haypocalc.com> Hi, I introduced recently the signal.pthread_sigmask() function (issue #8407). pthread_sigmask() (the C function) returns an error code using errno codes. I choosed to raise a RuntimeError using this error code, but I am not sure that RuntimeError is the best choice. It is more an OS error than a runtime error: should signal.pthread_sigmask() raise an OSError instead? signal.signal() raises a RuntimeError if setting the signal handler failed. signal.siginterrupt() raises also a RuntimeError on error. signal.setitimer() and signal.getitimer() have their own exception class: signal.ItimerError, raised on setimer() and getitimer() error. Victor From ned at nedbatchelder.com Mon May 2 13:27:40 2011 From: ned at nedbatchelder.com (Ned Batchelder) Date: Mon, 02 May 2011 07:27:40 -0400 Subject: [Python-Dev] sys.settrace: behavior doesn't match docs In-Reply-To: References: <4DBC91E7.9060402@nedbatchelder.com> Message-ID: <4DBE952C.2070005@nedbatchelder.com> Indeed, the 2.0 code is very different, and got this case right. I'm a little surprised no one is arguing that changing this code now could break some applications. Maybe the fact no one noticed the docs were wrong proves that no one ever tried returning None from a local trace function. --Ned. On 4/30/2011 8:43 PM, Guido van Rossum wrote: > I think you need to go back farther in time. :-) In Python 2.0 the > call_trace function in ceval.c has a completely different signature > (but the docs are the same). I haven't checked all history but > somewhere between 2.0 and 2.3, SET_LINENO-less tracing was added, and > that's where the implementation must have gone wrong. So I think we > should fix the code. > > --Guido > > On Sat, Apr 30, 2011 at 3:49 PM, Ned Batchelder wrote: >> This week I learned something new about trace functions (how to write a C >> trace function that survives a sys.settrace(sys.gettrace()) round-trip), and >> while writing up what I learned, I was surprised to discover that trace >> functions don't behave the way I thought, or the way the docs say they >> behave. >> >> The docs say: >> >> The trace function is invoked (with event set to 'call') whenever a new >> local scope is entered; it should return a reference to a local trace >> function to be used that scope, or None if the scope shouldn?t be traced. >> >> The local trace function should return a reference to itself (or to another >> function for further tracing in that scope), or None to turn off tracing in >> that scope. >> >> It's that last part that's wrong: returning None from the trace function >> only has an effect on the first call in a new frame. Once the trace >> function returns a function for a frame, returning None from subsequent >> calls is ignored. A "local trace function" can't turn off tracing in its >> scope. >> >> To demonstrate: >> >> import sys >> >> UPTO_LINE = 1 >> >> def t(frame, event, arg): >> num = frame.f_lineno >> print("line %d" % num) >> if num< UPTO_LINE: >> return t >> >> def try_it(): >> print("twelve") >> print("thirteen") >> print("fourteen") >> print("fifteen") >> >> UPTO_LINE = 1 >> sys.settrace(t) >> try_it() >> >> UPTO_LINE = 13 >> sys.settrace(t) >> try_it() >> >> Produces: >> >> line 11 >> twelve >> thirteen >> fourteen >> fifteen >> line 11 >> line 12 >> twelve >> line 13 >> thirteen >> line 14 >> fourteen >> line 15 >> fifteen >> line 15 >> >> The first call to try_it() returns None immediately, preventing tracing for >> the rest of the function. The second call returns None at line 13, but the >> rest of the function is traced anyway. This behavior is the same in all >> versions from 2.3 to 3.2, in fact, the 100 lines of code in sysmodule.c >> responsible for Python tracing functions are completely unchanged through >> those versions. (A deeper mystery that I haven't looked into yet is why >> Python 3.x intersperses all of these lines with "line 18" interjections.) >> >> I'm writing this email because I'm not sure whether this is a behavior bug >> or a doc bug. One of them is wrong, since they disagree. The documented >> behavior makes sense, and is what people have all along thought the trace >> function did. The actual behavior is a bit more complicated to explain, but >> is what people have actually been experiencing. FWIW, PyPy implements the >> documented behavior. >> >> Should we fix the code or the docs? I'd be glad to supply a patch for >> either. >> >> --Ned. >> >> >> _______________________________________________ >> Python-Dev mailing list >> Python-Dev at python.org >> http://mail.python.org/mailman/listinfo/python-dev >> Unsubscribe: >> http://mail.python.org/mailman/options/python-dev/guido%40python.org >> >> > > From mhammond at skippinet.com.au Mon May 2 14:47:11 2011 From: mhammond at skippinet.com.au (Mark Hammond) Date: Mon, 02 May 2011 22:47:11 +1000 Subject: [Python-Dev] sys.settrace: behavior doesn't match docs In-Reply-To: <4DBE952C.2070005@nedbatchelder.com> References: <4DBC91E7.9060402@nedbatchelder.com> <4DBE952C.2070005@nedbatchelder.com> Message-ID: <4DBEA7CF.4030307@skippinet.com.au> On 2/05/2011 9:27 PM, Ned Batchelder wrote: ... > Maybe the fact no one noticed the docs > were wrong proves that no one ever tried returning None from a local > trace function. Or if they did, they should have complained by now. IMO, if the behaviour regresses from how it is documented and how it previously worked and no reports of the regression exist, we should just fix it without regard to people relying on the "new" functionality... Mark From ncoghlan at gmail.com Mon May 2 15:12:32 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Mon, 2 May 2011 23:12:32 +1000 Subject: [Python-Dev] sys.settrace: behavior doesn't match docs In-Reply-To: <4DBEA7CF.4030307@skippinet.com.au> References: <4DBC91E7.9060402@nedbatchelder.com> <4DBE952C.2070005@nedbatchelder.com> <4DBEA7CF.4030307@skippinet.com.au> Message-ID: On Mon, May 2, 2011 at 10:47 PM, Mark Hammond wrote: > On 2/05/2011 9:27 PM, Ned Batchelder wrote: > ... >> >> Maybe the fact no one noticed the docs >> were wrong proves that no one ever tried returning None from a local >> trace function. > > Or if they did, they should have complained by now. ?IMO, if the behaviour > regresses from how it is documented and how it previously worked and no > reports of the regression exist, we should just fix it without regard to > people relying on the "new" functionality... +1 Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From vinay_sajip at yahoo.co.uk Mon May 2 16:26:56 2011 From: vinay_sajip at yahoo.co.uk (Vinay Sajip) Date: Mon, 2 May 2011 14:26:56 +0000 (UTC) Subject: [Python-Dev] Socket servers in the test suite References:

Message-ID: Nick Coghlan gmail.com> writes: > sure the urllib tests already fire up a local server). Starting down > the path of standardisation of that test functionality would be good. I've made a start with test_logging.py by implementing some potential server classes for use in tests: in the latest test_logging.py, the servers are between comments containing the text "server_helper". The basic approach for implementing socket servers is traditionally to use a request handler class which implements the custom logic, but for some testing applications this is overkill - you just want to be able to pass a handling callable which is, say, a test case method. So the signatures of the servers are all like this: __init__(self, listen_addr, handler, poll_interval ...) Initialise using the specified listen address and handler callable. Internally, a RequestHandler subclass will be used whose handle() delegates to the handler callable passed in. A zero port number can be passed in, and a port attribute will (after binding) have the actual port number used, so that clients can connect on that port. start() Start the server on a separate thread, using the poll_interval specified in the underlying poll()/select() call. Before this is called, the request handler class could be replaced with a subclass if need be. stop(timeout=None) Ask the server to stop and wait for the server thread to terminate. The server also has a ready attribute which is a threading.Event, set just when the server is entering its service loop. Typical mode of use would be: class ClientTestCase(unittest.TestCase): def setUp(self): self.server = TheAppropriateServerClass(('localhost', 0), self.handle_request, 0.01, ...) self.server.start() self.server.ready.wait() self.handled = threading.Event() def tearDown(self): self.server.stop(1.0) # wait up to 1 sec for thread to stop def handle_request(self, request): # Handle the request, e.g. by setting some attributes based on what # was received at the server # Set the flag to say we finished handling self.handled.set() def test_xxx(self): # set up client and send stuff to server # Wait for server to finish doing stuff self.handled.wait() # make assertions based on the attributes # set during request handling The server classes provided are TestSMTPServer, TestTCPServer, TestUDPServer and TestHTTPServer. There are examples of actual usage in test_logging.py: SMTPHandlerTest, SocketHandlerTest, DatagramHandlerTest, SysLogHandlerTest, HTTPHandlerTest. I'd like some comments on this suggested API. I have not yet looked at how to adapt other stdlib code than test_logging to use these classes, but the above usage mode seems convenient and sufficient for testing applications. No doubt people will be able to suggest problems with/improvements to the approach outlined above. Regards, Vinay Sajip From techtonik at gmail.com Mon May 2 18:06:58 2011 From: techtonik at gmail.com (anatoly techtonik) Date: Mon, 2 May 2011 19:06:58 +0300 Subject: [Python-Dev] Issue Tracker In-Reply-To: References: <4D90EA06.3030003@stoneleaf.us> <20110328223112.76482a9d@pitrou.net> <20110329013756.99EB8D64A7@kimball.webabinitio.net> Message-ID: On Sun, May 1, 2011 at 7:31 PM, Georg Brandl wrote: > On 30.04.2011 16:53, anatoly techtonik wrote: >> On Tue, Mar 29, 2011 at 4:37 AM, R. David Murray wrote: >>> >>> The hardest part is debugging the TAL when you make a mistake, but >>> even that isn't a whole lot worse than any other templating language. >> >> How much in % is it worse than Django templating language? > > I'm just guessing here, but I'd say 47.256 %. That means switching to Django templates will make Roundup design plumbing work 47.256% more attractive for potential contributors. -- anatoly t. From benjamin at python.org Mon May 2 18:17:59 2011 From: benjamin at python.org (Benjamin Peterson) Date: Mon, 2 May 2011 11:17:59 -0500 Subject: [Python-Dev] Issue Tracker In-Reply-To: References: <4D90EA06.3030003@stoneleaf.us> <20110328223112.76482a9d@pitrou.net> <20110329013756.99EB8D64A7@kimball.webabinitio.net> Message-ID: 2011/5/2 anatoly techtonik : > On Sun, May 1, 2011 at 7:31 PM, Georg Brandl wrote: >> On 30.04.2011 16:53, anatoly techtonik wrote: >>> On Tue, Mar 29, 2011 at 4:37 AM, R. David Murray wrote: >>>> >>>> The hardest part is debugging the TAL when you make a mistake, but >>>> even that isn't a whole lot worse than any other templating language. >>> >>> How much in % is it worse than Django templating language? >> >> I'm just guessing here, but I'd say 47.256 %. > > That means switching to Django templates will make Roundup design > plumbing work 47.256% more attractive for potential contributors. Perhaps some of those eager contributors would like to volunteer for the task. -- Regards, Benjamin From brian.curtin at gmail.com Mon May 2 18:19:28 2011 From: brian.curtin at gmail.com (Brian Curtin) Date: Mon, 2 May 2011 11:19:28 -0500 Subject: [Python-Dev] Issue Tracker In-Reply-To: References: <4D90EA06.3030003@stoneleaf.us> <20110328223112.76482a9d@pitrou.net> <20110329013756.99EB8D64A7@kimball.webabinitio.net> Message-ID: On Mon, May 2, 2011 at 11:06, anatoly techtonik wrote: > On Sun, May 1, 2011 at 7:31 PM, Georg Brandl wrote: > > On 30.04.2011 16:53, anatoly techtonik wrote: > >> On Tue, Mar 29, 2011 at 4:37 AM, R. David Murray > wrote: > >>> > >>> The hardest part is debugging the TAL when you make a mistake, but > >>> even that isn't a whole lot worse than any other templating language. > >> > >> How much in % is it worse than Django templating language? > > > > I'm just guessing here, but I'd say 47.256 %. > > That means switching to Django templates will make Roundup design > plumbing work 47.256% more attractive for potential contributors. What if these "potential contributors" never surface? Then we've made a 47.256% change in attractiveness, which is a 1423.843% waste of time. -------------- next part -------------- An HTML attachment was scrubbed... URL: From techtonik at gmail.com Mon May 2 19:14:50 2011 From: techtonik at gmail.com (anatoly techtonik) Date: Mon, 2 May 2011 20:14:50 +0300 Subject: [Python-Dev] PEP 386 and dev repository versions workflow Message-ID: http://guide.python-distribute.org/quickstart.html proposes suffixing version of a module in repository with 'dev' in a way that after release of '1.0' version, the repository version is changed to '2.0dev'. This makes sense, but it is not compatible with PEP 386, which suggests using 2.0.devN, where N is a repository revision number. I'd expand PEP 386 to include 2.0dev use case. -- anatoly t. From ziade.tarek at gmail.com Mon May 2 19:19:28 2011 From: ziade.tarek at gmail.com (=?ISO-8859-1?Q?Tarek_Ziad=E9?=) Date: Mon, 2 May 2011 19:19:28 +0200 Subject: [Python-Dev] PEP 386 and dev repository versions workflow In-Reply-To: References: Message-ID: On Mon, May 2, 2011 at 7:14 PM, anatoly techtonik wrote: > http://guide.python-distribute.org/quickstart.html proposes suffixing > version of a module in repository with 'dev' in a way that after > release of '1.0' version, the repository version is changed to > '2.0dev'. This makes sense, but it is not compatible with PEP 386, > which suggests using 2.0.devN, where N is a repository revision > number. I'd expand PEP 386 to include 2.0dev use case. This is a typo I'll fix, thanks for noticing > -- > anatoly t. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/ziade.tarek%40gmail.com > -- Tarek Ziad? | http://ziade.org From g.rodola at gmail.com Mon May 2 20:27:57 2011 From: g.rodola at gmail.com (=?ISO-8859-1?Q?Giampaolo_Rodol=E0?=) Date: Mon, 2 May 2011 20:27:57 +0200 Subject: [Python-Dev] Issue Tracker In-Reply-To: References: <4D90EA06.3030003@stoneleaf.us> <20110328223112.76482a9d@pitrou.net> <20110329013756.99EB8D64A7@kimball.webabinitio.net> Message-ID: 2011/4/30 anatoly techtonik : > On Tue, Mar 29, 2011 at 4:37 AM, R. David Murray wrote: >> >> The hardest part is debugging the TAL when you make a mistake, but >> even that isn't a whole lot worse than any other templating language. > > How much in % is it worse than Django templating language? > -- > anatoly t. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/g.rodola%40gmail.com > Knowing both of them I can say ZPT is one of the few things I like about Zope and I find it a lot more powerful than Django templating system. Other than that, I don't see how changing the templating language can make any difference. If one does not contribute something because of the language used in templates... well, I think it wouldn't have been a particular good contribution anyway. =) --- Giampaolo http://code.google.com/p/pyftpdlib/ http://code.google.com/p/psutil/ From g.brandl at gmx.net Mon May 2 20:41:12 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Mon, 02 May 2011 20:41:12 +0200 Subject: [Python-Dev] Issue Tracker In-Reply-To: References: <4D90EA06.3030003@stoneleaf.us> <20110328223112.76482a9d@pitrou.net> <20110329013756.99EB8D64A7@kimball.webabinitio.net> Message-ID: On 02.05.2011 18:06, anatoly techtonik wrote: > On Sun, May 1, 2011 at 7:31 PM, Georg Brandl wrote: >> On 30.04.2011 16:53, anatoly techtonik wrote: >>> On Tue, Mar 29, 2011 at 4:37 AM, R. David Murray wrote: >>>> >>>> The hardest part is debugging the TAL when you make a mistake, but >>>> even that isn't a whole lot worse than any other templating language. >>> >>> How much in % is it worse than Django templating language? >> >> I'm just guessing here, but I'd say 47.256 %. > > That means switching to Django templates will make Roundup design > plumbing work 47.256% more attractive for potential contributors. That's not true actually. It'll be 89.595 % more attractive. Georg From sijinjoseph at gmail.com Mon May 2 17:27:49 2011 From: sijinjoseph at gmail.com (Sijin Joseph) Date: Mon, 2 May 2011 11:27:49 -0400 Subject: [Python-Dev] Convert Py_Buffer to Py_UNICODE Message-ID: Hi - I am working on a patch where I have an argument that can either be a unicode string or binary data, I parse the argument using the PyArg_ParseTuple method using the s* format specification and get a Py_Buffer. I now need to convert this Py_Buffer object to a Py_Unicode and pass it into a function. What is the best way to do this? If I determine that the passed argument was binary using another flag parameter then I am passing Py_Buffer->buf as a pointer to the start of the data. This is in winsound module, here's the relevant code snippet sound_playsound(PyObject *s, PyObject *args) { Py_buffer *buffer; int flags; int ok; LPCWSTR pszSound; if (PyArg_ParseTuple(args, "s*i:PlaySound", &buffer, &flags)) { if (flags & SND_ASYNC && flags & SND_MEMORY) { /* Sidestep reference counting headache; unfortunately this also prevent SND_LOOP from memory. */ PyBuffer_Release(buffer); PyErr_SetString(PyExc_RuntimeError, "Cannot play asynchronously from memory"); return NULL; } if(flags & SND_MEMORY) { pszSound = buffer->buf; } else { /* pszSound = ????; */ } -- Sijin -------------- next part -------------- An HTML attachment was scrubbed... URL: From mal at egenix.com Mon May 2 21:12:27 2011 From: mal at egenix.com (M.-A. Lemburg) Date: Mon, 02 May 2011 21:12:27 +0200 Subject: [Python-Dev] Convert Py_Buffer to Py_UNICODE In-Reply-To: References: Message-ID: <4DBF021B.90602@egenix.com> Sijin Joseph wrote: > Hi - I am working on a patch where I have an argument that can either be a > unicode string or binary data, I parse the argument using the > PyArg_ParseTuple method using the s* format specification and get a > Py_Buffer. > > I now need to convert this Py_Buffer object to a Py_Unicode and pass it into > a function. What is the best way to do this? If I determine that the passed > argument was binary using another flag parameter then I am passing > Py_Buffer->buf as a pointer to the start of the data. I don't understand why you'd want to convert PyUnicode to PyBytes (encoded as UTF-8), only to decode it again afterwards in order to pass it to some other PyUnicode API. It'd be more efficient to use the "O" parser marker and then use PyObject_GetBuffer() to convert non-PyUnicode objects to a Py_buffer. > This is in winsound module, here's the relevant code snippet > > sound_playsound(PyObject *s, PyObject *args) > { > Py_buffer *buffer; > int flags; > int ok; > LPCWSTR pszSound; > > if (PyArg_ParseTuple(args, "s*i:PlaySound", &buffer, &flags)) { > if (flags & SND_ASYNC && flags & SND_MEMORY) { > /* Sidestep reference counting headache; unfortunately this also > prevent SND_LOOP from memory. */ > PyBuffer_Release(buffer); > PyErr_SetString(PyExc_RuntimeError, "Cannot play asynchronously > from memory"); > return NULL; > } > > if(flags & SND_MEMORY) { > pszSound = buffer->buf; > } > else { > /* pszSound = ????; */ > } > > -- Sijin > > > > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/mal%40egenix.com -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, May 02 2011) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ 2011-06-20: EuroPython 2011, Florence, Italy 49 days to go ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/ From benjamin at python.org Mon May 2 21:25:44 2011 From: benjamin at python.org (Benjamin Peterson) Date: Mon, 2 May 2011 14:25:44 -0500 Subject: [Python-Dev] Issue Tracker In-Reply-To: References: <4D90EA06.3030003@stoneleaf.us> <20110328223112.76482a9d@pitrou.net> <20110329013756.99EB8D64A7@kimball.webabinitio.net> Message-ID: 2011/5/2 Georg Brandl : > On 02.05.2011 18:06, anatoly techtonik wrote: >> On Sun, May 1, 2011 at 7:31 PM, Georg Brandl wrote: >>> On 30.04.2011 16:53, anatoly techtonik wrote: >>>> On Tue, Mar 29, 2011 at 4:37 AM, R. David Murray wrote: >>>>> >>>>> The hardest part is debugging the TAL when you make a mistake, but >>>>> even that isn't a whole lot worse than any other templating language. >>>> >>>> How much in % is it worse than Django templating language? >>> >>> I'm just guessing here, but I'd say 47.256 %. >> >> That means switching to Django templates will make Roundup design >> plumbing work 47.256% more attractive for potential contributors. > > That's not true actually. > > It'll be 89.595 % more attractive. I don't understand why you're truncating to 3 digits. Let's be honest in that it will be sqrt(2)^(13e/2) % more attractive. -- Regards, Benjamin From tjreedy at udel.edu Mon May 2 22:49:54 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Mon, 02 May 2011 16:49:54 -0400 Subject: [Python-Dev] running/stepping python backwards In-Reply-To: References: Message-ID: <4DBF18F2.9040202@udel.edu> On 4/29/2011 10:13 PM, Adrian Johnston wrote: > This may seem like an odd question, but I?m intrigued by the idea of > using Python as a data definition language with ?undo? support. > > If I were to try and instrument the Python interpreter to be able to > step backwards, would that be an unduly difficult or inefficient thing > to do? The pydev list is for development of the next version of Python. Please direct your question to a more appropriate forum such as python-list. > (Please reply to me directly.) I did this time, but you should not expect that when posting to a public list. -- Terry Jan Reedy From martin at v.loewis.de Mon May 2 23:14:06 2011 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Mon, 02 May 2011 23:14:06 +0200 Subject: [Python-Dev] Windows 2000 Support In-Reply-To: References: Message-ID: <4DBF1E9E.5000006@v.loewis.de> Am 01.05.2011 22:51, schrieb Brian Curtin: > I'm currently writing a post about the process of removing OS/2 and VMS > support and thought about a discussion of Windows 2000 some time > back. http://mail.python.org/pipermail/python-dev/2010-March/098074.html makes > a proposal for beginning to walk away from 2000, but doesn't appear to > come to any conclusion. > > Was anything decided off the list? I don't see anything in PEP-11 and > don't see any changes in the installer made around Windows 2000. That's what you get for not following your own processes. It seems the discussion just stopped, with no action. I vaguely recall having made changes to the installer to produce a warning, but apparently never got to commit these changes. > If nothing was decided, should anything be done for 3.3? Most certainly. It seems we missed the chance of dropping support for W2k, so we still can't actively remove any code. However, I'd a) add it to PEP 11, and b) add a warning to the installer I stand by http://mail.python.org/pipermail/python-dev/2010-March/098101.html i.e. if there are patches that happen not to work on W2k, I'd accept them anyway - anybody interested in W2k would then have to provide fixes before 3.3rc1. So please go ahead and change PEP 11. While you are at it, also threaten to remove support for systems where the COMSPEC points to command.com (#2405). Regards, Martin From drsalists at gmail.com Mon May 2 23:19:38 2011 From: drsalists at gmail.com (Dan Stromberg) Date: Mon, 2 May 2011 14:19:38 -0700 Subject: [Python-Dev] running/stepping python backwards In-Reply-To: <4DBF18F2.9040202@udel.edu> References: <4DBF18F2.9040202@udel.edu> Message-ID: On Mon, May 2, 2011 at 1:49 PM, Terry Reedy wrote: > > (Please reply to me directly.) > > I did this time, but you should not expect that when posting to a public > list. Actually, this is not only appropriate on some lists, on some lists one is actually strongly discouraged from doing anything else. EG: sun-managers, where replies are expected to be private, and the originator of the thread is expected to collect all (private) replies and summarize them, to keep the list traffic low and the S/N ratio high. -------------- next part -------------- An HTML attachment was scrubbed... URL: From barry at python.org Tue May 3 00:35:20 2011 From: barry at python.org (Barry Warsaw) Date: Mon, 2 May 2011 18:35:20 -0400 Subject: [Python-Dev] Python 2.6.7 schedule Message-ID: <20110502183520.1c9efdc0@neurotica.wooz.org> I'd like to make a Python 2.6.7 release candidate this Friday, May 6, with a final release scheduled for May 20. I've put these dates on the Python Release Schedule calendar. This will be a source-only security release. I see no release blockers for Python 2.6, so if you know of anything that must go into 2.6.7, please be sure there is a tracker issue for it, that 2.6 is marked as being affected, and with a release blocker priority. Cheers, -Barry -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: not available URL: From martin at v.loewis.de Tue May 3 01:09:42 2011 From: martin at v.loewis.de (=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=) Date: Tue, 03 May 2011 01:09:42 +0200 Subject: [Python-Dev] Fwd: viewVC shows traceback on non utf-8 module markup In-Reply-To: <4DBB19A5.4010409@voidspace.org.uk> References: <4DBB19A5.4010409@voidspace.org.uk> Message-ID: <4DBF39B6.3050100@v.loewis.de> Am 29.04.2011 22:03, schrieb Michael Foord: > I know that the svn repo is now for legacy purposes only, but I doubt it > is intended that the online source browser should raise exceptions. It's certainly not. However, I don't plan to do anything about it, either (nor would I know that anybody else would). To view the source code of the file, use http://svn.python.org/view/python/trunk/Lib/heapq.py?view=co&content-type=text/plain Regards, Martin From brian.curtin at gmail.com Tue May 3 02:39:33 2011 From: brian.curtin at gmail.com (Brian Curtin) Date: Mon, 2 May 2011 19:39:33 -0500 Subject: [Python-Dev] Windows 2000 Support In-Reply-To: <4DBF1E9E.5000006@v.loewis.de> References: <4DBF1E9E.5000006@v.loewis.de> Message-ID: On Mon, May 2, 2011 at 16:14, "Martin v. L?wis" wrote: > Am 01.05.2011 22:51, schrieb Brian Curtin: > > I'm currently writing a post about the process of removing OS/2 and VMS > > support and thought about a discussion of Windows 2000 some time > > back. http://mail.python.org/pipermail/python-dev/2010-March/098074.htmlmakes > > a proposal for beginning to walk away from 2000, but doesn't appear to > > come to any conclusion. > > > > Was anything decided off the list? I don't see anything in PEP-11 and > > don't see any changes in the installer made around Windows 2000. > > That's what you get for not following your own processes. It seems the > discussion just stopped, with no action. I vaguely recall having made > changes to the installer to produce a warning, but apparently never > got to commit these changes. > > > If nothing was decided, should anything be done for 3.3? > > Most certainly. It seems we missed the chance of dropping support for > W2k, so we still can't actively remove any code. However, I'd > > a) add it to PEP 11, and > b) add a warning to the installer > > I stand by > > http://mail.python.org/pipermail/python-dev/2010-March/098101.html > > i.e. if there are patches that happen not to work on W2k, I'd accept > them anyway - anybody interested in W2k would then have to provide > fixes before 3.3rc1. > > So please go ahead and change PEP 11. While you are at it, also threaten > to remove support for systems where the COMSPEC points to command.com > (#2405). > Done and done - http://hg.python.org/peps/rev/b9390aa12855 I'll have a look at the installer and add some type of message. -------------- next part -------------- An HTML attachment was scrubbed... URL: From nadeem.vawda at gmail.com Tue May 3 16:22:27 2011 From: nadeem.vawda at gmail.com (Nadeem Vawda) Date: Tue, 3 May 2011 16:22:27 +0200 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: References: Message-ID: On Tue, May 3, 2011 at 3:19 PM, victor.stinner wrote: > +# Issue #10276 - check that inputs of 2 GB are handled correctly. > +# Be aware of issues #1202, #8650, #8651 and #10276 > +class ChecksumBigBufferTestCase(unittest.TestCase): > + ? ?int_max = 0x7FFFFFFF > + > + ? ?@unittest.skipUnless(mmap, "mmap() is not available.") > + ? ?def test_big_buffer(self): > + ? ? ? ?if sys.platform[:3] == 'win' or sys.platform == 'darwin': > + ? ? ? ? ? ?requires('largefile', > + ? ? ? ? ? ? ? ? ? ? 'test requires %s bytes and a long time to run' % > + ? ? ? ? ? ? ? ? ? ? str(self.int_max)) > + ? ? ? ?try: > + ? ? ? ? ? ?with open(TESTFN, "wb+") as f: > + ? ? ? ? ? ? ? ?f.seek(self.int_max-4) > + ? ? ? ? ? ? ? ?f.write("asdf") > + ? ? ? ? ? ? ? ?f.flush() > + ? ? ? ? ? ? ? ?try: > + ? ? ? ? ? ? ? ? ? ?m = mmap.mmap(f.fileno(), 0, access=mmap.ACCESS_READ) > + ? ? ? ? ? ? ? ? ? ?self.assertEqual(zlib.crc32(m), 0x709418e7) > + ? ? ? ? ? ? ? ? ? ?self.assertEqual(zlib.adler32(m), -2072837729) > + ? ? ? ? ? ? ? ?finally: > + ? ? ? ? ? ? ? ? ? ?m.close() > + ? ? ? ?except (IOError, OverflowError): > + ? ? ? ? ? ?raise unittest.SkipTest("filesystem doesn't have largefile support") > + ? ? ? ?finally: > + ? ? ? ? ? ?unlink(TESTFN) > + > + 0x7FFFFFFF is (2G-1) bytes. For a 2GB buffer, int_max should be 0x80000000. However, if you make this change, crc32() and adler32() raise OverflowErrors (see changeset a0681e7a6ded). This makes the test to erroneously report that the filesystem doesn't support large files. The assertEqual() tests should probably be changed to assertRaises(..., OverflowError). Also, the assignment to m needs to be moved outside of the inner try...finally block. If mmap() fails, the call to m.close() raises a new exception because m has not yet been bound. This seems to be causing failures on some of the 32-bit buildbots. As an aside, in this sort of situation is it better to just go and commit a fix myself, or is raising it on the mailing list first the right way to do things? Cheers, Nadeem From g.brandl at gmx.net Tue May 3 20:30:22 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Tue, 03 May 2011 20:30:22 +0200 Subject: [Python-Dev] Raise OSError or RuntimeError in the OS module? In-Reply-To: <201105021206.47384.victor.stinner@haypocalc.com> References: <201105021206.47384.victor.stinner@haypocalc.com> Message-ID: On 02.05.2011 12:06, Victor Stinner wrote: > Hi, > > I introduced recently the signal.pthread_sigmask() function (issue #8407). > pthread_sigmask() (the C function) returns an error code using errno codes. I > choosed to raise a RuntimeError using this error code, but I am not sure that > RuntimeError is the best choice. It is more an OS error than a runtime error: > should signal.pthread_sigmask() raise an OSError instead? > > signal.signal() raises a RuntimeError if setting the signal handler failed. > signal.siginterrupt() raises also a RuntimeError on error. > > signal.setitimer() and signal.getitimer() have their own exception class: > signal.ItimerError, raised on setimer() and getitimer() error. If it has an errno, it should be a subclass of EnvironmentError. Georg From brian.curtin at gmail.com Tue May 3 20:39:40 2011 From: brian.curtin at gmail.com (Brian Curtin) Date: Tue, 3 May 2011 13:39:40 -0500 Subject: [Python-Dev] Windows 2000 Support In-Reply-To: References: <4DBF1E9E.5000006@v.loewis.de> Message-ID: On Mon, May 2, 2011 at 19:39, Brian Curtin wrote: > On Mon, May 2, 2011 at 16:14, "Martin v. L?wis" wrote: > >> Am 01.05.2011 22:51, schrieb Brian Curtin: >> > I'm currently writing a post about the process of removing OS/2 and VMS >> > support and thought about a discussion of Windows 2000 some time >> > back. >> http://mail.python.org/pipermail/python-dev/2010-March/098074.html makes >> > a proposal for beginning to walk away from 2000, but doesn't appear to >> > come to any conclusion. >> > >> > Was anything decided off the list? I don't see anything in PEP-11 and >> > don't see any changes in the installer made around Windows 2000. >> >> That's what you get for not following your own processes. It seems the >> discussion just stopped, with no action. I vaguely recall having made >> changes to the installer to produce a warning, but apparently never >> got to commit these changes. >> >> > If nothing was decided, should anything be done for 3.3? >> >> Most certainly. It seems we missed the chance of dropping support for >> W2k, so we still can't actively remove any code. However, I'd >> >> a) add it to PEP 11, and >> b) add a warning to the installer >> >> I stand by >> >> http://mail.python.org/pipermail/python-dev/2010-March/098101.html >> >> i.e. if there are patches that happen not to work on W2k, I'd accept >> them anyway - anybody interested in W2k would then have to provide >> fixes before 3.3rc1. >> >> So please go ahead and change PEP 11. While you are at it, also threaten >> to remove support for systems where the COMSPEC points to command.com >> (#2405). >> > > Done and done - http://hg.python.org/peps/rev/b9390aa12855 > I'll have a look at the installer and add some type of message. > It turns out that you did make the change at some point for 2.7 being the last, but there was no corresponding 3.x version chosen. http://hg.python.org/cpython/rev/de53c52fbcbf changed the installer to list 3.3.0 as the last Windows 2000 release on the default branch. -------------- next part -------------- An HTML attachment was scrubbed... URL: From solipsis at pitrou.net Tue May 3 20:57:47 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Tue, 3 May 2011 20:57:47 +0200 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by References:

Message-ID: <20110503205747.65a76522@pitrou.net> Hello, On Tue, 3 May 2011 16:22:27 +0200 Nadeem Vawda wrote: > > As an aside, in this sort of situation is it better to just go and > commit a fix myself, or is raising it on the mailing list first the > right way to do things? Raising it on the mailing-list makes it serve as a kind of post-commit review. Also, it ensures that the committer of the original patch understands the issues with it. cheers Antoine. From victor.stinner at haypocalc.com Tue May 3 22:38:43 2011 From: victor.stinner at haypocalc.com (Victor Stinner) Date: Tue, 03 May 2011 22:38:43 +0200 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: References:

Message-ID: <1304455123.1971.5.camel@marge> Le mardi 03 mai 2011 ? 16:22 +0200, Nadeem Vawda a ?crit : > On Tue, May 3, 2011 at 3:19 PM, victor.stinner > wrote: > > +# Issue #10276 - check that inputs of 2 GB are handled correctly. > > +# Be aware of issues #1202, #8650, #8651 and #10276 > > +class ChecksumBigBufferTestCase(unittest.TestCase): > > + int_max = 0x7FFFFFFF > > + > > + @unittest.skipUnless(mmap, "mmap() is not available.") > > + def test_big_buffer(self): > > + if sys.platform[:3] == 'win' or sys.platform == 'darwin': > > + requires('largefile', > > + 'test requires %s bytes and a long time to run' % > > + str(self.int_max)) > > + try: > > + with open(TESTFN, "wb+") as f: > > + f.seek(self.int_max-4) > > + f.write("asdf") > > + f.flush() > > + try: > > + m = mmap.mmap(f.fileno(), 0, access=mmap.ACCESS_READ) > > + self.assertEqual(zlib.crc32(m), 0x709418e7) > > + self.assertEqual(zlib.adler32(m), -2072837729) > > + finally: > > + m.close() > > + except (IOError, OverflowError): > > + raise unittest.SkipTest("filesystem doesn't have largefile support") > > + finally: > > + unlink(TESTFN) > > + > > + > > 0x7FFFFFFF is (2G-1) bytes. For a 2GB buffer, int_max should be > 0x80000000. However, if you make this change, crc32() and adler32() > raise OverflowErrors (see changeset a0681e7a6ded). I don't want to check OverflowError: the test is supposed to compute the checksum of a buffer of 0x7FFFFFFF bytes, to check crc32() and adler32(). 0x7FFFFFFF is the biggest size supported by these functions (zlib doesn't use Py_ssize_t in Python 2.7). If you use a buffer of 0x80000000 bytes, you test PyArg_Parse*() functions, which have already a dedicated test (in test_xml_etree_c, it's not the best file to store such test...). > Also, the assignment to m needs to be moved outside of the inner > try...finally block. Yeah, I noticed this with buildbots: already fixed by dd58f8072216. > As an aside, in this sort of situation is it better to just go and > commit a fix myself, or is raising it on the mailing list first the > right way to do things? I'm not sure that you understood the test, so I think that it's better to ask first on IRC and/or the mailing list. Victor From nadeem.vawda at gmail.com Tue May 3 23:11:48 2011 From: nadeem.vawda at gmail.com (Nadeem Vawda) Date: Tue, 3 May 2011 23:11:48 +0200 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <1304455123.1971.5.camel@marge> References:

<1304455123.1971.5.camel@marge> Message-ID: On Tue, May 3, 2011 at 10:38 PM, Victor Stinner wrote: > I don't want to check OverflowError: the test is supposed to compute the > checksum of a buffer of 0x7FFFFFFF bytes, to check crc32() and > adler32(). 0x7FFFFFFF is the biggest size supported by these functions > (zlib doesn't use Py_ssize_t in Python 2.7). I see. Since you mentioned issue 10276 in the commit message, I assumed you were testing for the underlying C functions truncating their arguments. It seems that I was mistaken. Sorry for the confusion. Cheers, Nadeem From victor.stinner at haypocalc.com Wed May 4 10:58:42 2011 From: victor.stinner at haypocalc.com (Victor Stinner) Date: Wed, 04 May 2011 10:58:42 +0200 Subject: [Python-Dev] The zombi thread of the Tcl library Message-ID: <1304499523.15694.11.camel@marge> Hi, I have a question: would it be possible to mask all signals in the Tcl thread? To understand the question, let's see the context... I'm working on signals, especially on pthread_sigmask(), and I'm trying to understand test_signal failures. test_signal fails if the _tkinter module is loaded, because _tkinter loads the Tcl library which create a thread waiting events in select(). For example, "python -m test test_pydoc test_signal" fails, because test_pydoc loads ALL Python modules. I opened an issue for test_pydoc: http://bugs.python.org/issue11995 _tkinter.c contains the following code: #if 0 /* This was not a good idea; through bindings, Tcl_Finalize() may invoke Python code but at that point the interpreter and thread state have already been destroyed! */ Py_AtExit(Tcl_Finalize); #endif Tcl_Finalize() exits the thread, but this function is never called in Python. Anyway, it is not possible to unload a module implemented in C. I would like to know if it would be possible to mask all signals in the Tcl thread, or if Tcl supports/uses signals. It is possible to mask all signals in the Tcl thread using: ---------- allsignals = range(1, signal.NSIG) oldmask = signal.pthread_sigmask(signal.SIG_BLOCK, allsignals) import _tkinter signal.pthread_sigmask(signal.SIG_SETMASK, oldmask) ---------- I'm not asking the question for test_signal: I have a patch fixing test_signal, even if the Tcl zombi thread is present (use pthread_kill() to send the signal directly to the main thread). (I wrote "zombi" thread because I was not aware that Tcl uses a thread, nor that test_pydoc loads all modules. The thread is valid, alive, and it's just a joke. The threads is more hidden than zombi.) Victor From marks at dcs.gla.ac.uk Wed May 4 11:08:33 2011 From: marks at dcs.gla.ac.uk (Mark Shannon) Date: Wed, 04 May 2011 10:08:33 +0100 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <1304499523.15694.11.camel@marge> References: <1304499523.15694.11.camel@marge> Message-ID: <4DC11791.2000109@dcs.gla.ac.uk> Hi, The online documentation specifies which API function borrow and/or steal references (as opposed to the default behaviour). Yet, I cannot find this information anywhere in the source. Any clues as to where I should look? Cheers, Mark From amauryfa at gmail.com Wed May 4 11:35:19 2011 From: amauryfa at gmail.com (Amaury Forgeot d'Arc) Date: Wed, 4 May 2011 11:35:19 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <4DC11791.2000109@dcs.gla.ac.uk> References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> Message-ID: Hi, Le mercredi 4 mai 2011, Mark Shannon a ?crit?: > The online documentation specifies which API function borrow and/or steal references (as opposed to the default behaviour). > Yet, I cannot find this information anywhere in the source. > > Any clues as to where I should look? It's in the file Doc/data/refcounts.dat in some custom format. -- Amaury -- Amaury Forgeot d'Arc From solipsis at pitrou.net Wed May 4 12:05:19 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Wed, 4 May 2011 12:05:19 +0200 Subject: [Python-Dev] The zombi thread of the Tcl library References: <1304499523.15694.11.camel@marge> Message-ID: <20110504120519.7a1bc105@pitrou.net> On Wed, 04 May 2011 10:58:42 +0200 Victor Stinner wrote: > > Tcl_Finalize() exits the thread, but this function is never called in > Python. Anyway, it is not possible to unload a module implemented in C. You could expose Tcl_Finalize() for debug purposes and call it in test_signal. Regards Antoine. From victor.stinner at haypocalc.com Wed May 4 13:54:20 2011 From: victor.stinner at haypocalc.com (Victor Stinner) Date: Wed, 04 May 2011 13:54:20 +0200 Subject: [Python-Dev] The zombi thread of the Tcl library In-Reply-To: <20110504120519.7a1bc105@pitrou.net> References: <1304499523.15694.11.camel@marge> <20110504120519.7a1bc105@pitrou.net> Message-ID: <1304510060.15694.13.camel@marge> Le mercredi 04 mai 2011 ? 12:05 +0200, Antoine Pitrou a ?crit : > On Wed, 04 May 2011 10:58:42 +0200 > Victor Stinner wrote: > > > > Tcl_Finalize() exits the thread, but this function is never called in > > Python. Anyway, it is not possible to unload a module implemented in C. > > You could expose Tcl_Finalize() for debug purposes and call it in > test_signal. Good idea. I opened an issue with a patch implementing Tcl_Finalize(): http://bugs.python.org/issue11998 I also added a workaround _tkinter border effect in test_signal. Buildbots look to be happy. Victor From ncoghlan at gmail.com Wed May 4 19:01:58 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Thu, 5 May 2011 03:01:58 +1000 Subject: [Python-Dev] New interest areas in Experts Index Message-ID: I just added two new interest areas in the Expert's Index [1] context managers: for any issues relating to proposals to add context management capabilities to objects in the stdlib, triagers should feel free to add me to the nosy list test coverage: this is specifically for anyone willing to help review and commit test coverage improvement patches (rather than the more general "testing" interest area that was already present) Cheers, Nick. [1] http://docs.python.org/devguide/experts -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From solipsis at pitrou.net Wed May 4 21:35:11 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Wed, 4 May 2011 21:35:11 +0200 Subject: [Python-Dev] cpython (2.7): Issue #11277: test_zlib tests a buffer of 1 GB on 32 bits References: Message-ID: <20110504213511.07e9f2bf@pitrou.net> On Wed, 04 May 2011 21:27:50 +0200 victor.stinner wrote: > http://hg.python.org/cpython/rev/7f3cab59ef3e > changeset: 69834:7f3cab59ef3e > branch: 2.7 > parent: 69827:affec521b330 > user: Victor Stinner > date: Wed May 04 21:27:39 2011 +0200 > summary: > Issue #11277: test_zlib tests a buffer of 1 GB on 32 bits What's the point? The issue with 2GB or 4GB buffers is that they cross the potential limit of a machine type (a signed or unsigned integer). I don't see any benefit in testing a 1GB buffer; the test could probably be removed instead. Regards Antoine. From greg.ewing at canterbury.ac.nz Thu May 5 00:04:51 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 05 May 2011 10:04:51 +1200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <4DC11791.2000109@dcs.gla.ac.uk> References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> Message-ID: <4DC1CD83.3000603@canterbury.ac.nz> Mark Shannon wrote: > The online documentation specifies which API function borrow and/or > steal references (as opposed to the default behaviour). > Yet, I cannot find this information anywhere in the source. There are comments in some places, e.g. in listobject.h: *** WARNING *** PyList_SetItem does not increment the new item's reference count, but does decrement the reference count of the item it replaces, if not nil. It does *decrement* the reference count if it is *not* inserted in the list. Similarly, PyList_GetItem does not increment the returned item's reference count. If you're looking for evidence in the actual code, there's nothing particular to look for -- it's implicit in the way the function works overall. -- Greg From greg.ewing at canterbury.ac.nz Thu May 5 00:23:01 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Thu, 05 May 2011 10:23:01 +1200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> Message-ID: <4DC1D1C5.9010507@canterbury.ac.nz> Amaury Forgeot d'Arc wrote: > It's in the file Doc/data/refcounts.dat > in some custom format. However, it doesn't seem to quite convey the same information. It lists the "refcount effect" on each parameter, but translating that into the notion of borrowed or stolen references seems to require knowledge of what the function does. For example, PyDict_SetItem has: PyDict_SetItem:PyObject*:p:0: PyDict_SetItem:PyObject*:key:+1: PyDict_SetItem:PyObject*:val:+1: All of these parameters take borrowed references, but the key and val get incremented because they're being stored in the dict. So this file appears to be of limited usefulness. -- Greg From ethan at stoneleaf.us Thu May 5 00:40:42 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Wed, 04 May 2011 15:40:42 -0700 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <1304455123.1971.5.camel@marge> References:

<1304455123.1971.5.camel@marge> Message-ID: <4DC1D5EA.7060608@stoneleaf.us> Victor Stinner wrote: > Le mardi 03 mai 2011 ? 16:22 +0200, Nadeem Vawda a ?crit : >> On Tue, May 3, 2011 at 3:19 PM, victor.stinner >> wrote: >>> +# Issue #10276 - check that inputs of 2 GB are handled correctly. >>> +# Be aware of issues #1202, #8650, #8651 and #10276 >>> +class ChecksumBigBufferTestCase(unittest.TestCase): >>> + int_max = 0x7FFFFFFF >>> + >>> + @unittest.skipUnless(mmap, "mmap() is not available.") >>> + def test_big_buffer(self): >>> + if sys.platform[:3] == 'win' or sys.platform == 'darwin': >>> + requires('largefile', >>> + 'test requires %s bytes and a long time to run' % >>> + str(self.int_max)) >>> + try: >>> + with open(TESTFN, "wb+") as f: >>> + f.seek(self.int_max-4) >>> + f.write("asdf") >>> + f.flush() >>> + try: >>> + m = mmap.mmap(f.fileno(), 0, access=mmap.ACCESS_READ) >>> + self.assertEqual(zlib.crc32(m), 0x709418e7) >>> + self.assertEqual(zlib.adler32(m), -2072837729) >>> + finally: >>> + m.close() >>> + except (IOError, OverflowError): >>> + raise unittest.SkipTest("filesystem doesn't have largefile support") >>> + finally: >>> + unlink(TESTFN) >>> + >>> + >> 0x7FFFFFFF is (2G-1) bytes. For a 2GB buffer, int_max should be >> 0x80000000. However, if you make this change, crc32() and adler32() >> raise OverflowErrors (see changeset a0681e7a6ded). > > I don't want to check OverflowError: the test is supposed to compute the > checksum of a buffer of 0x7FFFFFFF bytes The comment says 'check that inputs of 2 GB are handled correctly' but the file created is 1 byte short of 2Gb. Is the test wrong, or just wrongly commented? Or am I not understanding? ~Ethan~ From victor.stinner at haypocalc.com Thu May 5 11:33:27 2011 From: victor.stinner at haypocalc.com (Victor Stinner) Date: Thu, 05 May 2011 11:33:27 +0200 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <4DC1D5EA.7060608@stoneleaf.us> References:

<1304455123.1971.5.camel@marge> <4DC1D5EA.7060608@stoneleaf.us> Message-ID: <1304588007.22418.7.camel@marge> Le mercredi 04 mai 2011 ? 15:40 -0700, Ethan Furman a ?crit : > Victor Stinner wrote: > > Le mardi 03 mai 2011 ? 16:22 +0200, Nadeem Vawda a ?crit : > >> On Tue, May 3, 2011 at 3:19 PM, victor.stinner > >> wrote: > >>> +# Issue #10276 - check that inputs of 2 GB are handled correctly. > >>> +# Be aware of issues #1202, #8650, #8651 and #10276 > >>> +class ChecksumBigBufferTestCase(unittest.TestCase): > >>> + int_max = 0x7FFFFFFF > >>> + > >>> + @unittest.skipUnless(mmap, "mmap() is not available.") > >>> + def test_big_buffer(self): > >>> + if sys.platform[:3] == 'win' or sys.platform == 'darwin': > >>> + requires('largefile', > >>> + 'test requires %s bytes and a long time to run' % > >>> + str(self.int_max)) > >>> + try: > >>> + with open(TESTFN, "wb+") as f: > >>> + f.seek(self.int_max-4) > >>> + f.write("asdf") > >>> + f.flush() > >>> + try: > >>> + m = mmap.mmap(f.fileno(), 0, access=mmap.ACCESS_READ) > >>> + self.assertEqual(zlib.crc32(m), 0x709418e7) > >>> + self.assertEqual(zlib.adler32(m), -2072837729) > >>> + finally: > >>> + m.close() > >>> + except (IOError, OverflowError): > >>> + raise unittest.SkipTest("filesystem doesn't have largefile support") > >>> + finally: > >>> + unlink(TESTFN) > >>> + > >>> + > >> 0x7FFFFFFF is (2G-1) bytes. For a 2GB buffer, int_max should be > >> 0x80000000. However, if you make this change, crc32() and adler32() > >> raise OverflowErrors (see changeset a0681e7a6ded). > > > > I don't want to check OverflowError: the test is supposed to compute the > > checksum of a buffer of 0x7FFFFFFF bytes > > The comment says 'check that inputs of 2 GB are handled correctly' but > the file created is 1 byte short of 2Gb. Is the test wrong, or just > wrongly commented? Or am I not understanding? If you write a byte after 2 GB of zeros, the file size is 2 GB+the few bytes. This trick is to create quickly a large file: some OSes support sparse files, zeros are not written on disk. But on Mac OS X and Windows, you really write 2 GB+some bytes. Victor From nadeem.vawda at gmail.com Thu May 5 11:43:19 2011 From: nadeem.vawda at gmail.com (Nadeem Vawda) Date: Thu, 5 May 2011 11:43:19 +0200 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <1304588007.22418.7.camel@marge> References:

<1304455123.1971.5.camel@marge> <4DC1D5EA.7060608@stoneleaf.us> <1304588007.22418.7.camel@marge> Message-ID: On Thu, May 5, 2011 at 11:33 AM, Victor Stinner wrote: > Le mercredi 04 mai 2011 ? 15:40 -0700, Ethan Furman a ?crit : >> The comment says 'check that inputs of 2 GB are handled correctly' but >> the file created is 1 byte short of 2Gb. ?Is the test wrong, or just >> wrongly commented? ?Or am I not understanding? > > If you write a byte after 2 GB of zeros, the file size is 2 GB+the few > bytes. This trick is to create quickly a large file: some OSes support > sparse files, zeros are not written on disk. But on Mac OS X and > Windows, you really write 2 GB+some bytes. Ethan's point is that 0x7FFFFFFF is not 2GB - it is (2G-1) bytes. So the test and the preceding comment are inconsistent. From p.f.moore at gmail.com Thu May 5 11:53:59 2011 From: p.f.moore at gmail.com (Paul Moore) Date: Thu, 5 May 2011 10:53:59 +0100 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <1304588007.22418.7.camel@marge> References:

<1304455123.1971.5.camel@marge> <4DC1D5EA.7060608@stoneleaf.us> <1304588007.22418.7.camel@marge> Message-ID: On 5 May 2011 10:33, Victor Stinner wrote: > If you write a byte after 2 GB of zeros, the file size is 2 GB+the few > bytes. This trick is to create quickly a large file: some OSes support > sparse files, zeros are not written on disk. But on Mac OS X and > Windows, you really write 2 GB+some bytes. FWIW, on Windows you can create sparse files, using DeviceIoControl(FILE_SET_SPARSE). It's probably too messy to be worth it for this case, though... Paul From giuott at gmail.com Thu May 5 12:14:34 2011 From: giuott at gmail.com (Giuseppe Ottaviano) Date: Thu, 5 May 2011 11:14:34 +0100 Subject: [Python-Dev] What if replacing items in a dictionary returns the new dictionary? In-Reply-To: References: <20110429143406.GA441@iskra.aviel.ru> Message-ID: On Fri, Apr 29, 2011 at 4:05 PM, Roy Hyunjin Han wrote: >> ? You can implement this in your own subclass of dict, no? > > Yes, I just thought it would be convenient to have in the language > itself, but the responses to my post seem to indicate that [not > returning the updated object] is an intended language feature for > mutable types like dict or list. In general nothing stops you to use a proxy object that returns itself after each method call, something like class using(object): def __init__(self, obj): self._wrappee = obj def unwrap(self): return self._wrappee def __getattr__(self, attr): def wrapper(*args, **kwargs): getattr(self._wrappee, attr)(*args, **kwargs) return self return wrapper d = dict() print using(d).update(dict(a=1)).update(dict(b=2)).unwrap() # prints {'a': 1, 'b': 2} l = list() print using(l).append(1).append(2).unwrap() # prints [1, 2] From amauryfa at gmail.com Thu May 5 12:38:32 2011 From: amauryfa at gmail.com (Amaury Forgeot d'Arc) Date: Thu, 5 May 2011 12:38:32 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <4DC1D1C5.9010507@canterbury.ac.nz> References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> Message-ID: Hi, Le jeudi 5 mai 2011, Greg Ewing a ?crit?: > Amaury Forgeot d'Arc wrote: > > > It's in the file Doc/data/refcounts.dat > in some custom format. > > > However, it doesn't seem to quite convey the same information. > It lists the "refcount effect" on each parameter, but translating > that into the notion of borrowed or stolen references seems > to require knowledge of what the function does. > > For example, PyDict_SetItem has: > > PyDict_SetItem:PyObject*:p:0: > PyDict_SetItem:PyObject*:key:+1: > PyDict_SetItem:PyObject*:val:+1: > > All of these parameters take borrowed references, but the > key and val get incremented because they're being stored > in the dict. This is not always true, for example when the item is already present in the dict. It's not important to know what the function does to the object, Only the action on the reference is relevant. > > So this file appears to be of limited usefulness. -- Amaury -- Amaury Forgeot d'Arc From ethan at stoneleaf.us Thu May 5 14:07:04 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Thu, 05 May 2011 05:07:04 -0700 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <1304588007.22418.7.camel@marge> References:

<1304455123.1971.5.camel@marge> <4DC1D5EA.7060608@stoneleaf.us> <1304588007.22418.7.camel@marge> Message-ID: <4DC292E8.9010904@stoneleaf.us> Victor Stinner wrote: > Le mercredi 04 mai 2011 ? 15:40 -0700, Ethan Furman a ?crit : >> Victor Stinner wrote: >>> Le mardi 03 mai 2011 ? 16:22 +0200, Nadeem Vawda a ?crit : >>>> On Tue, May 3, 2011 at 3:19 PM, victor.stinner >>>> wrote: >>>>> >>>>> + int_max = 0x7FFFFFFF >>>>> >>>>> + with open(TESTFN, "wb+") as f: >>>>> + f.seek(self.int_max-4) >>>>> + f.write("asdf") >>>>> + f.flush() >>>> >>>> 0x7FFFFFFF is (2G-1) bytes. For a 2GB buffer, int_max should be >>>> 0x80000000. However, if you make this change, crc32() and adler32() >>>> raise OverflowErrors (see changeset a0681e7a6ded). >>> >>> I don't want to check OverflowError: the test is supposed to compute the >>> checksum of a buffer of 0x7FFFFFFF bytes >> >> The comment says 'check that inputs of 2 GB are handled correctly' but >> the file created is 1 byte short of 2Gb. Is the test wrong, or just >> wrongly commented? Or am I not understanding? > > If you write a byte after 2 GB of zeros, the file size is 2 GB+the few > bytes. This trick is to create quickly a large file: some OSes support > sparse files, zeros are not written on disk. But on Mac OS X and > Windows, you really write 2 GB+some bytes. True, but that's not what's happening -- four bytes are being written at int_max - 4, and int_max is one less that 2GB; hence the resulting file is one less than 2GB. ~Ethan~ From victor.stinner at haypocalc.com Thu May 5 14:27:43 2011 From: victor.stinner at haypocalc.com (Victor Stinner) Date: Thu, 05 May 2011 14:27:43 +0200 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <4DC292E8.9010904@stoneleaf.us> References:

<1304455123.1971.5.camel@marge> <4DC1D5EA.7060608@stoneleaf.us> <1304588007.22418.7.camel@marge> <4DC292E8.9010904@stoneleaf.us> Message-ID: <1304598463.27042.0.camel@marge> Le jeudi 05 mai 2011 ? 05:07 -0700, Ethan Furman a ?crit : > ... hence the resulting file is one less than 2GB. Yep, it's 0x7FFFFFFF because it's INT_MAX, the biggest value storable in an int. The zlib module stores the buffer size into an int in Python 2.7 (and Py_ssize_t in Python 3.3). Victor From ethan at stoneleaf.us Thu May 5 17:17:27 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Thu, 05 May 2011 08:17:27 -0700 Subject: [Python-Dev] [Python-checkins] cpython (2.7): Issue #10276: test_zlib checks that inputs of 2 GB are handled correctly by In-Reply-To: <1304598463.27042.0.camel@marge> References:

<1304455123.1971.5.camel@marge> <4DC1D5EA.7060608@stoneleaf.us> <1304588007.22418.7.camel@marge> <4DC292E8.9010904@stoneleaf.us> <1304598463.27042.0.camel@marge> Message-ID: <4DC2BF87.40100@stoneleaf.us> Victor Stinner wrote: > Le jeudi 05 mai 2011 ? 05:07 -0700, Ethan Furman a ?crit : >> >> ... hence the resulting file is one less than 2GB. > > Yep, it's 0x7FFFFFFF because it's INT_MAX, the biggest value storable in > an int. The zlib module stores the buffer size into an int in Python 2.7 > (and Py_ssize_t in Python 3.3). So we are agreed that the file is not, in fact, 2GB in size... > On Tue, May 3, 2011 at 3:19 PM, victor.stinner > wrote: >> +# Issue #10276 - check that inputs of 2 GB are handled correctly. >> +# Be aware of issues #1202, #8650, #8651 and #10276 So why do the comments say we are testing a 2GB input? ~Ethan~ From starsareblueandfaraway at gmail.com Thu May 5 16:37:04 2011 From: starsareblueandfaraway at gmail.com (Roy Hyunjin Han) Date: Thu, 5 May 2011 10:37:04 -0400 Subject: [Python-Dev] What if replacing items in a dictionary returns the new dictionary? In-Reply-To: References: <20110429143406.GA441@iskra.aviel.ru>

Message-ID: >> 2011/4/29 Roy Hyunjin Han : >> It would be convenient if replacing items in a dictionary returns the >> new dictionary, in a manner analogous to str.replace(). What do you >> think? >> >> # Current behavior >> x = {'key1': 1} >> x.update(key1=3) == None >> x == {'key1': 3} # Original variable has changed >> >> # Possible behavior >> x = {'key1': 1} >> x.replace(key1=3) == {'key1': 3} >> x == {'key1': 1} # Original variable is unchanged >> > 2011/5/5 Giuseppe Ottaviano : > In general nothing stops you to use a proxy object that returns itself > after each method call, something like > > class using(object): > def __init__(self, obj): > self._wrappee = obj > > def unwrap(self): > return self._wrappee > > def __getattr__(self, attr): > def wrapper(*args, **kwargs): > getattr(self._wrappee, attr)(*args, **kwargs) > return self > return wrapper > > > d = dict() > print using(d).update(dict(a=1)).update(dict(b=2)).unwrap() > # prints {'a': 1, 'b': 2} > l = list() > print using(l).append(1).append(2).unwrap() > # prints [1, 2] Cool! I never thought of that. That's a great snippet. I'll forward this to the python-ideas list. I don't think the python-dev people want this discussion to continue on their mailing list. From guido at python.org Thu May 5 19:00:54 2011 From: guido at python.org (Guido van Rossum) Date: Thu, 5 May 2011 10:00:54 -0700 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> Message-ID: On Thu, May 5, 2011 at 3:38 AM, Amaury Forgeot d'Arc wrote: > Hi, > > Le jeudi 5 mai 2011, Greg Ewing a ?crit?: >> Amaury Forgeot d'Arc wrote: >> >> >> It's in the file Doc/data/refcounts.dat >> in some custom format. >> >> >> However, it doesn't seem to quite convey the same information. >> It lists the "refcount effect" on each parameter, but translating >> that into the notion of borrowed or stolen references seems >> to require knowledge of what the function does. >> >> For example, PyDict_SetItem has: >> >> PyDict_SetItem:PyObject*:p:0: >> PyDict_SetItem:PyObject*:key:+1: >> PyDict_SetItem:PyObject*:val:+1: >> >> All of these parameters take borrowed references, but the >> key and val get incremented because they're being stored >> in the dict. > > This is not always true, for example when the item is already present > in the dict. > It's not important to know what the function does to the object, > Only the action on the reference is relevant. > >> >> So this file appears to be of limited usefulness. Seems you're in agreement with this. IMO when references are borrowed it is not very interesting. The interesting thing is when calling a function *steals* a reference. The other important thing to know is whether the caller ends up owning the return value (if it is an object) or not. I *think* you can tell the latter from the +1 for the return value; but the former (whether it steals a reference) is unclear from the data given. There's even an XXX comment about this in the file: # XXX NOTE: the 0/+1/-1 refcount information for arguments is # confusing! Much more useful would be to indicate whether the # function "steals" a reference to the argument or not. Take for # example PyList_SetItem(list, i, item). This lists as a 0 change for # both the list and the item arguments. However, in fact it steals a # reference to the item argument! -- --Guido van Rossum (python.org/~guido) From amauryfa at gmail.com Thu May 5 19:17:30 2011 From: amauryfa at gmail.com (Amaury Forgeot d'Arc) Date: Thu, 5 May 2011 19:17:30 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> Message-ID: 2011/5/5 Guido van Rossum : > Seems you're in agreement with this. IMO when references are borrowed > it is not very interesting. The interesting thing is when calling a > function *steals* a reference. The other important thing to know is > whether the caller ends up owning the return value (if it is an > object) or not. I *think* you can tell the latter from the +1 for the > return value; but the former (whether it steals a reference) is > unclear from the data given. There's even an XXX comment about this in > the file: > > # XXX NOTE: the 0/+1/-1 refcount information for arguments is > # confusing! ?Much more useful would be to indicate whether the > # function "steals" a reference to the argument or not. ?Take for > # example PyList_SetItem(list, i, item). ?This lists as a 0 change for > # both the list and the item arguments. ?However, in fact it steals a > # reference to the item argument! Should we change this file then? And only list functions that don't follow the usual conventions. But I'm sure that there are external tools which already use refcounts.dat in its present format. -- Amaury Forgeot d'Arc From guido at python.org Thu May 5 19:18:54 2011 From: guido at python.org (Guido van Rossum) Date: Thu, 5 May 2011 10:18:54 -0700 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

Message-ID: On Thu, May 5, 2011 at 10:17 AM, Amaury Forgeot d'Arc wrote: > 2011/5/5 Guido van Rossum : >> Seems you're in agreement with this. IMO when references are borrowed >> it is not very interesting. The interesting thing is when calling a >> function *steals* a reference. The other important thing to know is >> whether the caller ends up owning the return value (if it is an >> object) or not. I *think* you can tell the latter from the +1 for the >> return value; but the former (whether it steals a reference) is >> unclear from the data given. There's even an XXX comment about this in >> the file: >> >> # XXX NOTE: the 0/+1/-1 refcount information for arguments is >> # confusing! ?Much more useful would be to indicate whether the >> # function "steals" a reference to the argument or not. ?Take for >> # example PyList_SetItem(list, i, item). ?This lists as a 0 change for >> # both the list and the item arguments. ?However, in fact it steals a >> # reference to the item argument! > > Should we change this file then? > And only list functions that don't follow the usual conventions. > > But I'm sure that there are external tools which already use refcounts.dat > in its present format. Maybe we can *add* a column with the desired information? -- --Guido van Rossum (python.org/~guido) From g.brandl at gmx.net Thu May 5 20:08:51 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 05 May 2011 20:08:51 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> Message-ID: On 05.05.2011 19:00, Guido van Rossum wrote: > On Thu, May 5, 2011 at 3:38 AM, Amaury Forgeot d'Arc wrote: >> Hi, >> >> Le jeudi 5 mai 2011, Greg Ewing a ?crit : >>> Amaury Forgeot d'Arc wrote: >>> >>> >>> It's in the file Doc/data/refcounts.dat >>> in some custom format. >>> >>> >>> However, it doesn't seem to quite convey the same information. >>> It lists the "refcount effect" on each parameter, but translating >>> that into the notion of borrowed or stolen references seems >>> to require knowledge of what the function does. >>> >>> For example, PyDict_SetItem has: >>> >>> PyDict_SetItem:PyObject*:p:0: >>> PyDict_SetItem:PyObject*:key:+1: >>> PyDict_SetItem:PyObject*:val:+1: >>> >>> All of these parameters take borrowed references, but the >>> key and val get incremented because they're being stored >>> in the dict. >> >> This is not always true, for example when the item is already present >> in the dict. >> It's not important to know what the function does to the object, >> Only the action on the reference is relevant. >> >>> >>> So this file appears to be of limited usefulness. > > Seems you're in agreement with this. IMO when references are borrowed > it is not very interesting. The interesting thing is when calling a > function *steals* a reference. The other important thing to know is > whether the caller ends up owning the return value (if it is an > object) or not. I *think* you can tell the latter from the +1 for the > return value; but the former (whether it steals a reference) is > unclear from the data given. There's even an XXX comment about this in > the file: > > # XXX NOTE: the 0/+1/-1 refcount information for arguments is > # confusing! Much more useful would be to indicate whether the > # function "steals" a reference to the argument or not. Take for > # example PyList_SetItem(list, i, item). This lists as a 0 change for > # both the list and the item arguments. However, in fact it steals a > # reference to the item argument! We're not using the information about arguments anyway in the doc build. So we're free to change the file to list only return types, and parameters in the event of stolen references. Georg From solipsis at pitrou.net Thu May 5 20:09:30 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Thu, 5 May 2011 20:09:30 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

Message-ID: <20110505200930.0412d200@pitrou.net> On Thu, 5 May 2011 19:17:30 +0200 "Amaury Forgeot d'Arc" wrote: > 2011/5/5 Guido van Rossum : > > Seems you're in agreement with this. IMO when references are borrowed > > it is not very interesting. The interesting thing is when calling a > > function *steals* a reference. The other important thing to know is > > whether the caller ends up owning the return value (if it is an > > object) or not. I *think* you can tell the latter from the +1 for the > > return value; but the former (whether it steals a reference) is > > unclear from the data given. There's even an XXX comment about this in > > the file: > > > > # XXX NOTE: the 0/+1/-1 refcount information for arguments is > > # confusing! ?Much more useful would be to indicate whether the > > # function "steals" a reference to the argument or not. ?Take for > > # example PyList_SetItem(list, i, item). ?This lists as a 0 change for > > # both the list and the item arguments. ?However, in fact it steals a > > # reference to the item argument! > > Should we change this file then? > And only list functions that don't follow the usual conventions. +1 Regards Antoine. From raymond.hettinger at gmail.com Thu May 5 20:12:55 2011 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Thu, 5 May 2011 11:12:55 -0700 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

Message-ID: On May 5, 2011, at 10:18 AM, Guido van Rossum wrote: > On Thu, May 5, 2011 at 10:17 AM, Amaury Forgeot d'Arc > wrote: >> 2011/5/5 Guido van Rossum : >>> Seems you're in agreement with this. IMO when references are borrowed >>> it is not very interesting. The interesting thing is when calling a >>> function *steals* a reference. The other important thing to know is >>> whether the caller ends up owning the return value (if it is an >>> object) or not. I *think* you can tell the latter from the +1 for the >>> return value; but the former (whether it steals a reference) is >>> unclear from the data given. There's even an XXX comment about this in >>> the file: >>> >>> # XXX NOTE: the 0/+1/-1 refcount information for arguments is >>> # confusing! Much more useful would be to indicate whether the >>> # function "steals" a reference to the argument or not. Take for >>> # example PyList_SetItem(list, i, item). This lists as a 0 change for >>> # both the list and the item arguments. However, in fact it steals a >>> # reference to the item argument! >> >> Should we change this file then? >> And only list functions that don't follow the usual conventions. >> >> But I'm sure that there are external tools which already use refcounts.dat >> in its present format. > > Maybe we can *add* a column with the desired information? +1 Raymond From benjamin at python.org Thu May 5 20:41:50 2011 From: benjamin at python.org (Benjamin Peterson) Date: Thu, 5 May 2011 13:41:50 -0500 Subject: [Python-Dev] [Python-checkins] cpython (3.2): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: References: Message-ID: 2011/5/5 raymond.hettinger : > http://hg.python.org/cpython/rev/1a56775c6e54 > changeset: ? 69857:1a56775c6e54 > branch: ? ? ?3.2 > parent: ? ? ?69855:97a4855202b8 > user: ? ? ? ?Raymond Hettinger > date: ? ? ? ?Thu May 05 11:35:50 2011 -0700 > summary: > ?Avoid codec spelling issues by just using the utf-8 default. Out of curiosity, what is the issue? > > files: > ?Lib/random.py | ?2 +- > ?1 files changed, 1 insertions(+), 1 deletions(-) > > > diff --git a/Lib/random.py b/Lib/random.py > --- a/Lib/random.py > +++ b/Lib/random.py > @@ -114,7 +114,7 @@ > ? ? ? ? if version == 2: > ? ? ? ? ? ? if isinstance(a, (str, bytes, bytearray)): > ? ? ? ? ? ? ? ? if isinstance(a, str): > - ? ? ? ? ? ? ? ? ? ?a = a.encode("utf8") > + ? ? ? ? ? ? ? ? ? ?a = a.encode() -- Regards, Benjamin From solipsis at pitrou.net Thu May 5 20:44:04 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Thu, 5 May 2011 20:44:04 +0200 Subject: [Python-Dev] cpython (merge 3.2 -> default): Avoid codec spelling issues by just using the utf-8 default. References: Message-ID: <20110505204404.5cfa02f2@pitrou.net> On Thu, 05 May 2011 20:38:27 +0200 raymond.hettinger wrote: > http://hg.python.org/cpython/rev/2bc784057226 > changeset: 69858:2bc784057226 > parent: 69856:b06ad8458b32 > parent: 69857:1a56775c6e54 > user: Raymond Hettinger > date: Thu May 05 11:38:06 2011 -0700 > summary: > Avoid codec spelling issues by just using the utf-8 default. > > files: > Lib/random.py | 2 +- > 1 files changed, 1 insertions(+), 1 deletions(-) > > > diff --git a/Lib/random.py b/Lib/random.py > --- a/Lib/random.py > +++ b/Lib/random.py > @@ -114,7 +114,7 @@ > if version == 2: > if isinstance(a, (str, bytes, bytearray)): > if isinstance(a, str): > - a = a.encode("utf-8") > + a = a.encode() Isn't explicit better than implicit? By reading the new code it is not obvious that any thought was given to the choice of a codec, while stating "utf-8" explicitly hints that a decision was made. (also, I don't understand the spelling issue: "utf-8" just works) Regards Antoine. From alexander.belopolsky at gmail.com Thu May 5 21:01:29 2011 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Thu, 5 May 2011 15:01:29 -0400 Subject: [Python-Dev] cpython (merge 3.2 -> default): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: <20110505204404.5cfa02f2@pitrou.net> References: <20110505204404.5cfa02f2@pitrou.net> Message-ID: On Thu, May 5, 2011 at 2:44 PM, Antoine Pitrou wrote: .. > (also, I don't understand the spelling issue: "utf-8" just works) This is probably referring to the fact that while encode() accepts many spelling variants, some are short-circuited in C code while others require codec lookup implemented in python. From solipsis at pitrou.net Thu May 5 21:07:07 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Thu, 05 May 2011 21:07:07 +0200 Subject: [Python-Dev] cpython (merge 3.2 -> default): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: References: <20110505204404.5cfa02f2@pitrou.net> Message-ID: <1304622427.3564.12.camel@localhost.localdomain> Le jeudi 05 mai 2011 ? 15:01 -0400, Alexander Belopolsky a ?crit : > On Thu, May 5, 2011 at 2:44 PM, Antoine Pitrou wrote: > .. > > (also, I don't understand the spelling issue: "utf-8" just works) > > This is probably referring to the fact that while encode() accepts > many spelling variants, some are short-circuited in C code while > others require codec lookup implemented in python. This sounds like a bug to fix (isn't it fixed it already, btw?) rather than add hackish workarounds for in stdlib code. Regards Antoine. From benjamin at python.org Thu May 5 21:13:34 2011 From: benjamin at python.org (Benjamin Peterson) Date: Thu, 5 May 2011 14:13:34 -0500 Subject: [Python-Dev] cpython (merge 3.2 -> default): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: References: <20110505204404.5cfa02f2@pitrou.net> Message-ID: 2011/5/5 Alexander Belopolsky : > On Thu, May 5, 2011 at 2:44 PM, Antoine Pitrou wrote: > .. >> (also, I don't understand the spelling issue: "utf-8" just works) > > This is probably referring to the fact that while encode() accepts > many spelling variants, some are short-circuited in C code while > others require codec lookup implemented in python. Isn't it cached after the first run? If this is the reasoning, I find it hard to believe that seed() is a large bottleneck in random. -- Regards, Benjamin From g.brandl at gmx.net Thu May 5 22:45:13 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Thu, 05 May 2011 22:45:13 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

Message-ID: On 05.05.2011 19:17, Amaury Forgeot d'Arc wrote: > 2011/5/5 Guido van Rossum : >> Seems you're in agreement with this. IMO when references are borrowed >> it is not very interesting. The interesting thing is when calling a >> function *steals* a reference. The other important thing to know is >> whether the caller ends up owning the return value (if it is an >> object) or not. I *think* you can tell the latter from the +1 for the >> return value; but the former (whether it steals a reference) is >> unclear from the data given. There's even an XXX comment about this in >> the file: >> >> # XXX NOTE: the 0/+1/-1 refcount information for arguments is >> # confusing! Much more useful would be to indicate whether the >> # function "steals" a reference to the argument or not. Take for >> # example PyList_SetItem(list, i, item). This lists as a 0 change for >> # both the list and the item arguments. However, in fact it steals a >> # reference to the item argument! > > Should we change this file then? > And only list functions that don't follow the usual conventions. > > But I'm sure that there are external tools which already use refcounts.dat > in its present format. I doubt it. And even if there are, the information in there is in parts highly outdated (because the docs don't use parameter info), and large numbers of functions are missing. Let's remove the cruft, and only keep interesting info. This will also make the file much more manageable. Georg From raymond.hettinger at gmail.com Thu May 5 22:55:07 2011 From: raymond.hettinger at gmail.com (Raymond Hettinger) Date: Thu, 5 May 2011 13:55:07 -0700 Subject: [Python-Dev] [Python-checkins] cpython (3.2): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: References: Message-ID: <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> On May 5, 2011, at 11:41 AM, Benjamin Peterson wrote: > 2011/5/5 raymond.hettinger : >> http://hg.python.org/cpython/rev/1a56775c6e54 >> changeset: 69857:1a56775c6e54 >> branch: 3.2 >> parent: 69855:97a4855202b8 >> user: Raymond Hettinger >> date: Thu May 05 11:35:50 2011 -0700 >> summary: >> Avoid codec spelling issues by just using the utf-8 default. > > Out of curiosity, what is the issue? IIRC, the performance depended on how your spelled-it. I believe that is why the spelling got changed in Py3.3. Either way, the code is simpler by just using the default. Raymond From mal at egenix.com Fri May 6 00:32:59 2011 From: mal at egenix.com (M.-A. Lemburg) Date: Fri, 06 May 2011 00:32:59 +0200 Subject: [Python-Dev] [Python-checkins] cpython (3.2): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> References: <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> Message-ID: <4DC3259B.5020804@egenix.com> Raymond Hettinger wrote: > > On May 5, 2011, at 11:41 AM, Benjamin Peterson wrote: > >> 2011/5/5 raymond.hettinger : >>> http://hg.python.org/cpython/rev/1a56775c6e54 >>> changeset: 69857:1a56775c6e54 >>> branch: 3.2 >>> parent: 69855:97a4855202b8 >>> user: Raymond Hettinger >>> date: Thu May 05 11:35:50 2011 -0700 >>> summary: >>> Avoid codec spelling issues by just using the utf-8 default. >> >> Out of curiosity, what is the issue? > > IIRC, the performance depended on how your spelled-it. > I believe that is why the spelling got changed in Py3.3. Not really. It got changed because we have canonical names for the codecs which the stdlib should use rather than rely on aliases. Performance-wise it only makes a difference if you use it in tight loops. > Either way, the code is simpler by just using the default. ... as long as the casual reader knows what the default it :-) I think it's better to make the choice explicit, if the code relies on a particular non-ASCII encoding. If it doesn't, than the default is fine. -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, May 06 2011) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ 2011-06-20: EuroPython 2011, Florence, Italy 45 days to go ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/ From tjreedy at udel.edu Fri May 6 00:52:34 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 05 May 2011 18:52:34 -0400 Subject: [Python-Dev] cpython (3.2): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> References: <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> Message-ID: On 5/5/2011 4:55 PM, Raymond Hettinger wrote: > Either way, the code is simpler by just using the default. I thought about this and decided that the purpose of having defaults is so one does not have to always spell it out. So use it. Readers can always look it up and learn. -- Terry Jan Reedy From alexander.belopolsky at gmail.com Fri May 6 00:54:11 2011 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Thu, 5 May 2011 18:54:11 -0400 Subject: [Python-Dev] [Python-checkins] cpython (3.2): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: <4DC3259B.5020804@egenix.com> References: <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> <4DC3259B.5020804@egenix.com> Message-ID: On Thu, May 5, 2011 at 6:32 PM, M.-A. Lemburg wrote: .. >> Either way, the code is simpler by just using the default. > > ... as long as the casual reader knows what the default it :-) > .. or cares. I this particular case, it hardly matters how random bits are encoded. From victor.stinner at haypocalc.com Fri May 6 01:14:14 2011 From: victor.stinner at haypocalc.com (Victor Stinner) Date: Fri, 06 May 2011 01:14:14 +0200 Subject: [Python-Dev] [Python-checkins] cpython (3.2): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: References: <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> <4DC3259B.5020804@egenix.com> Message-ID: <1304637254.12569.4.camel@marge> Le jeudi 05 mai 2011 ? 18:54 -0400, Alexander Belopolsky a ?crit : > On Thu, May 5, 2011 at 6:32 PM, M.-A. Lemburg wrote: > .. > >> Either way, the code is simpler by just using the default. > > > > ... as long as the casual reader knows what the default it :-) > > > > .. or cares. I this particular case, it hardly matters how random > bits are encoded. You don't get the same random number sequence if you use a different encoding. >>> r=random.Random() >>> r.seed('\xe9'.encode('iso-8859-1')); r.randint(0, 1000) 639 >>> r.seed('\xe9'.encode('utf-8')); r.randint(0, 1000) 992 So it is useful to know how the seed was computed. The real question is which encoding gives the most random numbers? :-) Victor From greg.ewing at canterbury.ac.nz Fri May 6 03:28:11 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Fri, 06 May 2011 13:28:11 +1200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> Message-ID: <4DC34EAB.9050001@canterbury.ac.nz> Amaury Forgeot d'Arc wrote [concerning the Doc/data/refcounts.dat file]: > This is not always true, for example when the item is already present > in the dict. > It's not important to know what the function does to the object, > Only the action on the reference is relevant. Yes, that's the whole point. When using a functon, what you need to know is whether it borrows or steals a reference. But this file *doesn't tell* you that -- rather it assigns either 0 or +1 to a borrowed reference, apparently based on some notion of what the function "usually" does with that parameter. There does not seem to be enough information in that file to work out the borrowed/stolen statuses, which makes it seem rather useless. -- Greg From skip at pobox.com Fri May 6 03:52:08 2011 From: skip at pobox.com (skip at pobox.com) Date: Thu, 5 May 2011 20:52:08 -0500 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

Message-ID: <19907.21576.751581.958722@montanaro.dyndns.org> Georg> Let's remove the cruft, and only keep interesting info. This Georg> will also make the file much more manageable. If I was to do this from scratch I'd think hard about annotating the source code. No matter how hard you try, if you keep this information separate from the code and maintain it manually, it's going to get out-of-date. Skip From marks at dcs.gla.ac.uk Fri May 6 09:44:11 2011 From: marks at dcs.gla.ac.uk (Mark Shannon) Date: Fri, 06 May 2011 08:44:11 +0100 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <19907.21576.751581.958722@montanaro.dyndns.org> References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

<19907.21576.751581.958722@montanaro.dyndns.org> Message-ID: <4DC3A6CB.5020809@dcs.gla.ac.uk> skip at pobox.com wrote: > Georg> Let's remove the cruft, and only keep interesting info. This > Georg> will also make the file much more manageable. > > If I was to do this from scratch I'd think hard about annotating the source > code. No matter how hard you try, if you keep this information separate > from the code and maintain it manually, it's going to get out-of-date. > What about #defining PY_STOLEN in some header? Then any stolen parameter can be prefixed with PY_STOLEN in signature. For return values, similarly #define PY_BORROWED. Cheers, Mark. From amauryfa at gmail.com Fri May 6 10:18:32 2011 From: amauryfa at gmail.com (Amaury Forgeot d'Arc) Date: Fri, 6 May 2011 10:18:32 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <4DC3A6CB.5020809@dcs.gla.ac.uk> References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

<19907.21576.751581.958722@montanaro.dyndns.org> <4DC3A6CB.5020809@dcs.gla.ac.uk> Message-ID: Le vendredi 6 mai 2011, Mark Shannon a ?crit?: > What about #defining PY_STOLEN in some header? > > Then any stolen parameter can be prefixed with PY_STOLEN in signature. > > For return values, similarly #define PY_BORROWED. Header files are harder to parse, and I don't see how it would apply to macros. What about additional tags in the .rst files? -- Amaury -- Amaury Forgeot d'Arc From solipsis at pitrou.net Fri May 6 12:27:03 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 6 May 2011 12:27:03 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> <4DC34EAB.9050001@canterbury.ac.nz> Message-ID: <20110506122703.17c4d889@pitrou.net> On Fri, 06 May 2011 13:28:11 +1200 Greg Ewing wrote: > Amaury Forgeot d'Arc wrote [concerning the Doc/data/refcounts.dat file]: > > > This is not always true, for example when the item is already present > > in the dict. > > It's not important to know what the function does to the object, > > Only the action on the reference is relevant. > > Yes, that's the whole point. When using a functon, > what you need to know is whether it borrows or steals > a reference. Doesn't "borrow" mean the same as "steal" in that context? If an API borrows a reference, I expect it to take it from me. Regards Antoine. From marks at dcs.gla.ac.uk Fri May 6 12:45:38 2011 From: marks at dcs.gla.ac.uk (Mark Shannon) Date: Fri, 06 May 2011 11:45:38 +0100 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <20110506122703.17c4d889@pitrou.net> References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> <4DC34EAB.9050001@canterbury.ac.nz> <20110506122703.17c4d889@pitrou.net> Message-ID: <4DC3D152.601@dcs.gla.ac.uk> Antoine Pitrou wrote: > On Fri, 06 May 2011 13:28:11 +1200 > Greg Ewing wrote: > >> Amaury Forgeot d'Arc wrote [concerning the Doc/data/refcounts.dat file]: >> >>> This is not always true, for example when the item is already present >>> in the dict. >>> It's not important to know what the function does to the object, >>> Only the action on the reference is relevant. >> Yes, that's the whole point. When using a functon, >> what you need to know is whether it borrows or steals >> a reference. > > Doesn't "borrow" mean the same as "steal" in that context? > If an API borrows a reference, I expect it to take it from me. "Stealing" takes the ownership. Borrowing does not. This explains it better: http://docs.python.org/py3k/c-api/intro.html#reference-count-details Cheers, Mark. From jimjjewett at gmail.com Fri May 6 15:49:19 2011 From: jimjjewett at gmail.com (Jim Jewett) Date: Fri, 6 May 2011 09:49:19 -0400 Subject: [Python-Dev] [Python-checkins] cpython: Userlist.copy() wasn't returning a UserList. In-Reply-To: References: Message-ID: Do you also want to assert that u is not v, or would that sort of "copy" be acceptable by some subclasses? On 5/5/11, raymond.hettinger wrote: > http://hg.python.org/cpython/rev/f20373fcdde5 > changeset: 69865:f20373fcdde5 > user: Raymond Hettinger > date: Thu May 05 14:34:35 2011 -0700 > summary: > Userlist.copy() wasn't returning a UserList. > > files: > Lib/collections/__init__.py | 2 +- > Lib/test/test_userlist.py | 6 ++++++ > 2 files changed, 7 insertions(+), 1 deletions(-) > > > diff --git a/Lib/collections/__init__.py b/Lib/collections/__init__.py > --- a/Lib/collections/__init__.py > +++ b/Lib/collections/__init__.py > @@ -887,7 +887,7 @@ > def pop(self, i=-1): return self.data.pop(i) > def remove(self, item): self.data.remove(item) > def clear(self): self.data.clear() > - def copy(self): return self.data.copy() > + def copy(self): return self.__class__(self) > def count(self, item): return self.data.count(item) > def index(self, item, *args): return self.data.index(item, *args) > def reverse(self): self.data.reverse() > diff --git a/Lib/test/test_userlist.py b/Lib/test/test_userlist.py > --- a/Lib/test/test_userlist.py > +++ b/Lib/test/test_userlist.py > @@ -52,6 +52,12 @@ > return str(key) + '!!!' > self.assertEqual(next(iter(T((1,2)))), "0!!!") > > + def test_userlist_copy(self): > + u = self.type2test([6, 8, 1, 9, 1]) > + v = u.copy() > + self.assertEqual(u, v) > + self.assertEqual(type(u), type(v)) > + > def test_main(): > support.run_unittest(UserListTest) > > > -- > Repository URL: http://hg.python.org/cpython > From ndbecker2 at gmail.com Fri May 6 16:04:09 2011 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 06 May 2011 10:04:09 -0400 Subject: [Python-Dev] Linus on garbage collection Message-ID: http://gcc.gnu.org/ml/gcc/2002-08/msg00552.html From solipsis at pitrou.net Fri May 6 16:12:33 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 6 May 2011 16:12:33 +0200 Subject: [Python-Dev] Linus on garbage collection References: Message-ID: <20110506161233.1ed647ec@pitrou.net> On Fri, 06 May 2011 10:04:09 -0400 Neal Becker wrote: > http://gcc.gnu.org/ml/gcc/2002-08/msg00552.html Since we're sharing links, here's Matt Mackall's take: http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html cheers Antoine. From marks at dcs.gla.ac.uk Fri May 6 16:46:08 2011 From: marks at dcs.gla.ac.uk (Mark Shannon) Date: Fri, 06 May 2011 15:46:08 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: References: Message-ID: <4DC409B0.60909@dcs.gla.ac.uk> Neal Becker wrote: > http://gcc.gnu.org/ml/gcc/2002-08/msg00552.html > Being famous does not necessarily make you right. OS kernels are pretty atypical software, even if Linus is right about Linux, it doesn't apply to Python. I have empirical evidence, not opinion, that PyPy and my own HotPy are a *lot* faster (x5 or better) on Unladen Swallow's gcbench benchmark (which stresses the memory management subsystem). (Note that gcbench does not introduce any cycles, so its being easy on CPython) In fact, for gcbench CPython spends over twice as long in the cycle-collector as HotPy takes in total! I don't have such detailed results for PyPy. For other benchmarks, the HotPy GC times are often smaller than the inter-run variations in runtime, for example: HotPy GC stats for pystones (on a slow machine with a small cache): Total memory allocated: 20 Mbytes. 20 minor collections, 0 major collections Max heap size 2.4 Mbytes. Total time spent in GC: 3.5 milliseconds. ( <1% of execution time) My GC is quick, but its not the fastest. Evidence trumps opinion IMHO ;) Cheers, Mark. > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/marks%40dcs.gla.ac.uk From solipsis at pitrou.net Fri May 6 17:33:51 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 6 May 2011 17:33:51 +0200 Subject: [Python-Dev] Linus on garbage collection References: <4DC409B0.60909@dcs.gla.ac.uk> Message-ID: <20110506173351.4aef8145@pitrou.net> On Fri, 06 May 2011 15:46:08 +0100 Mark Shannon wrote: > > Neal Becker wrote: > > http://gcc.gnu.org/ml/gcc/2002-08/msg00552.html > > > Being famous does not necessarily make you right. > > OS kernels are pretty atypical software, > even if Linus is right about Linux, it doesn't apply to Python. > > I have empirical evidence, not opinion, that PyPy and my own HotPy > are a *lot* faster (x5 or better) on Unladen Swallow's gcbench benchmark > (which stresses the memory management subsystem). > > (Note that gcbench does not introduce any cycles, so its being easy on > CPython) > > In fact, for gcbench CPython spends over twice as long in the > cycle-collector as HotPy takes in total! The thing is, it would be easy to change our collection heuristics so that the cycle collector gets called less often (actually, you can already do so using gc.set_threshold, IIRC). Something which is much more delicate for a "full" GC, where it would grow memory consumption a lot. Regards Antoine. From status at bugs.python.org Fri May 6 18:07:23 2011 From: status at bugs.python.org (Python tracker) Date: Fri, 6 May 2011 18:07:23 +0200 (CEST) Subject: [Python-Dev] Summary of Python tracker Issues Message-ID: <20110506160723.04A101CFD5@psf.upfronthosting.co.za> ACTIVITY SUMMARY (2011-04-29 - 2011-05-06) Python tracker at http://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 2783 (+23) closed 21017 (+41) total 23800 (+64) Open issues with patches: 1201 Issues opened (47) ================== #11955: 3.3 : test_argparse.py fails 'make test' http://bugs.python.org/issue11955 opened by Jason.Vas.Dias #11956: 3.3 : test_import.py causes 'make test' to fail http://bugs.python.org/issue11956 opened by Jason.Vas.Dias #11957: re.sub confusion between count and flags args http://bugs.python.org/issue11957 opened by mindauga #11959: smtpd cannot be used without affecting global state http://bugs.python.org/issue11959 opened by vinay.sajip #11962: Buildbot reliability http://bugs.python.org/issue11962 opened by skrah #11963: Use real assert* for test_trigger_memory_error (test_parser) http://bugs.python.org/issue11963 opened by eric.araujo #11964: Undocumented change to indent param of json.dump in 3.2 http://bugs.python.org/issue11964 opened by eric.araujo #11965: Simplify context manager in os.popen http://bugs.python.org/issue11965 opened by eric.araujo #11968: wsgiref's wsgi application sample code does not work http://bugs.python.org/issue11968 opened by shimizukawa #11969: Can't launch Process on built-in static method http://bugs.python.org/issue11969 opened by cool-RR #11972: input does not strip a trailing newline correctly on Windows http://bugs.python.org/issue11972 opened by Michal.Molhanec #11973: kevent does not accept KQ_NOTE_EXIT (and other (f)flags) http://bugs.python.org/issue11973 opened by DragonSA #11974: Class definition gotcha.. should this be documented somewhere? http://bugs.python.org/issue11974 opened by sleepycal #11975: Fix referencing of built-in types (list, int, ...) http://bugs.python.org/issue11975 opened by jonash #11978: Report correct coverage.py data for tests that invoke subproce http://bugs.python.org/issue11978 opened by ncoghlan #11979: Minor improvements to the Sockets readme: typos, wording and s http://bugs.python.org/issue11979 opened by xmorel #11980: zipfile.ZipFile.write should accept fp as argument http://bugs.python.org/issue11980 opened by proppy #11981: dupe self.fp.tell() in zipfile.ZipFile.writestr http://bugs.python.org/issue11981 opened by proppy #11983: Inconsistent hash and comparison for code objects http://bugs.python.org/issue11983 opened by eltoder #11984: Wrong "See also" in symbol and token module docs http://bugs.python.org/issue11984 opened by davipo #11989: deprecate shutil.copy2 http://bugs.python.org/issue11989 opened by datamuc #11990: redirected output - stdout writes newline as \n in windows http://bugs.python.org/issue11990 opened by Jimbofbx #11992: sys.settrace doesn't disable tracing if a local trace function http://bugs.python.org/issue11992 opened by nedbat #11993: Use sub-second resolution to determine if a file is newer http://bugs.python.org/issue11993 opened by jsjgruber #11994: [2.7/gcc-4.4.3] Segfault under valgrind in string.split() http://bugs.python.org/issue11994 opened by skrah #11995: test_pydoc loads all Python modules http://bugs.python.org/issue11995 opened by haypo #11996: libpython.py: nicer py-bt output http://bugs.python.org/issue11996 opened by haypo #11998: test_signal cannot test blocked signals if _tkinter is loaded; http://bugs.python.org/issue11998 opened by haypo #11999: sporadic failure in test_mailbox on FreeBSD http://bugs.python.org/issue11999 opened by haypo #12001: Extend json.dumps to handle N-triples strings http://bugs.python.org/issue12001 opened by Glenn.Ammons #12002: ftplib.FTP.abort fails with TypeError on Python 3.x http://bugs.python.org/issue12002 opened by nneonneo #12003: documentation: alternate version of xrange seems to fail. http://bugs.python.org/issue12003 opened by tenuki #12004: PyZipFile.writepy gives internal error on syntax errors http://bugs.python.org/issue12004 opened by Ben.Morgan #12005: modulo result of Decimal differs from float/int http://bugs.python.org/issue12005 opened by Kotan #12006: strptime should implement %V or %u directive from libc http://bugs.python.org/issue12006 opened by Erik.Cederstrand #12007: Console commands won't work http://bugs.python.org/issue12007 opened by jake_mcaga #12008: HtmlParser non-strict goes wrong with unquoted attributes http://bugs.python.org/issue12008 opened by svilend #12009: netrc module crashes if netrc file has comment lines http://bugs.python.org/issue12009 opened by rmstoi #12010: Compile fails when sizeof(wchar_t) == 1 http://bugs.python.org/issue12010 opened by dcoles #12011: The signal module should raise OSError for OS-related exceptio http://bugs.python.org/issue12011 opened by pitrou #12012: _ssl module doesn't compile with OpenSSL 1.0.0d: SSLv2_method http://bugs.python.org/issue12012 opened by haypo #12013: file /usr/local/lib/python3.1/lib-dynload/_socket.so: symbol i http://bugs.python.org/issue12013 opened by alex_lai #12014: str.format parses replacement field incorrectly http://bugs.python.org/issue12014 opened by Ben.Wolfson #12015: possible characters in temporary file name is too few http://bugs.python.org/issue12015 opened by planet36 #12016: Wrong behavior for '\xff\n'.decode('gb2312', 'ignore') http://bugs.python.org/issue12016 opened by cdqzzy #12017: Decoding a highly-nested object with json (_speedups enabled) http://bugs.python.org/issue12017 opened by ivank #12018: No tests for ntpath.samefile, ntpath.sameopenfile http://bugs.python.org/issue12018 opened by ronaldoussoren Most recent 15 issues with no replies (15) ========================================== #12018: No tests for ntpath.samefile, ntpath.sameopenfile http://bugs.python.org/issue12018 #12016: Wrong behavior for '\xff\n'.decode('gb2312', 'ignore') http://bugs.python.org/issue12016 #12013: file /usr/local/lib/python3.1/lib-dynload/_socket.so: symbol i http://bugs.python.org/issue12013 #12009: netrc module crashes if netrc file has comment lines http://bugs.python.org/issue12009 #12003: documentation: alternate version of xrange seems to fail. http://bugs.python.org/issue12003 #12002: ftplib.FTP.abort fails with TypeError on Python 3.x http://bugs.python.org/issue12002 #12001: Extend json.dumps to handle N-triples strings http://bugs.python.org/issue12001 #11992: sys.settrace doesn't disable tracing if a local trace function http://bugs.python.org/issue11992 #11989: deprecate shutil.copy2 http://bugs.python.org/issue11989 #11984: Wrong "See also" in symbol and token module docs http://bugs.python.org/issue11984 #11983: Inconsistent hash and comparison for code objects http://bugs.python.org/issue11983 #11979: Minor improvements to the Sockets readme: typos, wording and s http://bugs.python.org/issue11979 #11973: kevent does not accept KQ_NOTE_EXIT (and other (f)flags) http://bugs.python.org/issue11973 #11969: Can't launch Process on built-in static method http://bugs.python.org/issue11969 #11968: wsgiref's wsgi application sample code does not work http://bugs.python.org/issue11968 Most recent 15 issues waiting for review (15) ============================================= #12015: possible characters in temporary file name is too few http://bugs.python.org/issue12015 #12012: _ssl module doesn't compile with OpenSSL 1.0.0d: SSLv2_method http://bugs.python.org/issue12012 #12008: HtmlParser non-strict goes wrong with unquoted attributes http://bugs.python.org/issue12008 #12004: PyZipFile.writepy gives internal error on syntax errors http://bugs.python.org/issue12004 #11999: sporadic failure in test_mailbox on FreeBSD http://bugs.python.org/issue11999 #11998: test_signal cannot test blocked signals if _tkinter is loaded; http://bugs.python.org/issue11998 #11996: libpython.py: nicer py-bt output http://bugs.python.org/issue11996 #11989: deprecate shutil.copy2 http://bugs.python.org/issue11989 #11981: dupe self.fp.tell() in zipfile.ZipFile.writestr http://bugs.python.org/issue11981 #11980: zipfile.ZipFile.write should accept fp as argument http://bugs.python.org/issue11980 #11973: kevent does not accept KQ_NOTE_EXIT (and other (f)flags) http://bugs.python.org/issue11973 #11963: Use real assert* for test_trigger_memory_error (test_parser) http://bugs.python.org/issue11963 #11956: 3.3 : test_import.py causes 'make test' to fail http://bugs.python.org/issue11956 #11949: Make float('nan') unorderable http://bugs.python.org/issue11949 #11948: Tutorial/Modules - small fix to better clarify the modules sea http://bugs.python.org/issue11948 Top 10 most discussed issues (10) ================================= #11277: Crash with mmap and sparse files on Mac OS X http://bugs.python.org/issue11277 19 msgs #8407: expose signalfd(2) and pthread_sigmask in the signal module http://bugs.python.org/issue8407 18 msgs #11935: MMDF/MBOX mailbox need utime http://bugs.python.org/issue11935 17 msgs #11999: sporadic failure in test_mailbox on FreeBSD http://bugs.python.org/issue11999 11 msgs #6721: Locks in python standard library should be sanitized on fork http://bugs.python.org/issue6721 10 msgs #9971: Optimize BufferedReader.readinto http://bugs.python.org/issue9971 9 msgs #3526: Customized malloc implementation on SunOS and AIX http://bugs.python.org/issue3526 8 msgs #11962: Buildbot reliability http://bugs.python.org/issue11962 8 msgs #11949: Make float('nan') unorderable http://bugs.python.org/issue11949 7 msgs #11954: 3.3 - 'make test' fails http://bugs.python.org/issue11954 7 msgs Issues closed (37) ================== #1856: shutdown (exit) can hang or segfault with daemon threads runni http://bugs.python.org/issue1856 closed by pitrou #7517: freeze.py not ported to python3 http://bugs.python.org/issue7517 closed by eric.araujo #8158: Docstring of optparse.OptionParser incomplete http://bugs.python.org/issue8158 closed by r.david.murray #9756: Crash with custom __getattribute__ http://bugs.python.org/issue9756 closed by haypo #10684: Folders get deleted when trying to change case with shutil.mov http://bugs.python.org/issue10684 closed by ronaldoussoren #10775: assertRaises as a context manager should accept a 'msg' keywor http://bugs.python.org/issue10775 closed by ezio.melotti #10922: Unexpected exception when calling function_proxy.__class__.__c http://bugs.python.org/issue10922 closed by haypo #11034: Build problem on Windows with MSVC++ Express 2008 http://bugs.python.org/issue11034 closed by loewis #11206: test_readline unconditionally calls clear_history() http://bugs.python.org/issue11206 closed by ned.deily #11247: Error sending packets to multicast IPV4 address http://bugs.python.org/issue11247 closed by neologix #11335: Memory leak after key function failure in sort http://bugs.python.org/issue11335 closed by stutzbach #11834: wrong module installation dir on Windows http://bugs.python.org/issue11834 closed by brian.curtin #11849: glibc allocator doesn't release all free()ed memory http://bugs.python.org/issue11849 closed by pitrou #11873: test_regexp() of test_compileall fails occassionally http://bugs.python.org/issue11873 closed by r.david.murray #11883: Call connect() before sending an email with smtplib http://bugs.python.org/issue11883 closed by r.david.murray #11887: unittest fails on comparing str with bytes if python has the - http://bugs.python.org/issue11887 closed by michael.foord #11898: Sending binary data with a POST request in httplib can cause U http://bugs.python.org/issue11898 closed by orsenthil #11912: PaX triggers a segfault in dlopen http://bugs.python.org/issue11912 closed by neologix #11930: Remove time.accept2dyear http://bugs.python.org/issue11930 closed by belopolsky #11950: logger use dict for loggers instead of WeakValueDictionary http://bugs.python.org/issue11950 closed by vinay.sajip #11958: test.test_ftplib.TestIPv6Environment failure http://bugs.python.org/issue11958 closed by python-dev #11960: Python crashes when running numpy test http://bugs.python.org/issue11960 closed by amaury.forgeotdarc #11961: Document STARTUPINFO and creationflags options for Windows http://bugs.python.org/issue11961 closed by brian.curtin #11966: Typo in PyModule_AddIntMacro's documentation http://bugs.python.org/issue11966 closed by python-dev #11967: Left shift and Right shift for floats http://bugs.python.org/issue11967 closed by loewis #11970: distutils command 'upload' crashes when --show-response is sel http://bugs.python.org/issue11970 closed by offby1 #11971: Wrong parameter -O0 instead of -OO in manpage http://bugs.python.org/issue11971 closed by r.david.murray #11976: Provide proper documentation for list data type http://bugs.python.org/issue11976 closed by georg.brandl #11977: Document int.conjugate, .denominator, ... http://bugs.python.org/issue11977 closed by python-dev #11982: json.loads() returns str instead of unicode for empty strings http://bugs.python.org/issue11982 closed by ezio.melotti #11985: Document that platform.python_implementation supports PyPy http://bugs.python.org/issue11985 closed by ezio.melotti #11986: Min/max not symmetric in presence of NaN http://bugs.python.org/issue11986 closed by rhettinger #11987: queue.Queue.put should acquire mutex for unfinished_tasks http://bugs.python.org/issue11987 closed by rhettinger #11988: special method lookup docs don't address some important detail http://bugs.python.org/issue11988 closed by r.david.murray #11991: test_distutils fails because of bad filename match http://bugs.python.org/issue11991 closed by eric.araujo #11997: One typo in Doc/c-api/init.rst http://bugs.python.org/issue11997 closed by ezio.melotti #12000: SSL certificate verification failed if no dNSName entry in sub http://bugs.python.org/issue12000 closed by pitrou From skip at pobox.com Fri May 6 18:18:51 2011 From: skip at pobox.com (skip at pobox.com) Date: Fri, 6 May 2011 11:18:51 -0500 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <20110506161233.1ed647ec@pitrou.net> References: <20110506161233.1ed647ec@pitrou.net> Message-ID: <19908.8043.8921.50222@montanaro.dyndns.org> Antoine> Since we're sharing links, here's Matt Mackall's take: Antoine> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >From that note: 1: You can't have meaningful destructors, because when destruction happens is undefined. And going-out-of-scope destructors are extremely useful. Python is already a rather broken in this regard, so feel free to ignore this point. Given the presence of cyclic data I don't see how reference counting or garbage collection win. Ignoring the fact that in a pure reference counted system you won't even consider cycles for reclmation, would both RC and GC have to punt because they can't tell which object's destructor to call first? Skip From fuzzyman at voidspace.org.uk Fri May 6 18:31:44 2011 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Fri, 06 May 2011 17:31:44 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <19908.8043.8921.50222@montanaro.dyndns.org> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> Message-ID: <4DC42270.1000301@voidspace.org.uk> On 06/05/2011 17:18, skip at pobox.com wrote: > Antoine> Since we're sharing links, here's Matt Mackall's take: > Antoine> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html > > > From that note: > > 1: You can't have meaningful destructors, because when destruction > happens is undefined. And going-out-of-scope destructors are extremely > useful. Python is already a rather broken in this regard, so feel free > to ignore this point. > > Given the presence of cyclic data I don't see how reference counting or > garbage collection win. Ignoring the fact that in a pure reference counted > system you won't even consider cycles for reclmation, would both RC and GC > have to punt because they can't tell which object's destructor to call > first? pypy and .NET choose to arbitrarily break cycles rather than leave objects unfinalised and memory unreclaimed. Not sure what Java does. All the best, Michael Foord > Skip > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk -- http://www.voidspace.org.uk/ May you do good and not evil May you find forgiveness for yourself and forgive others May you share freely, never taking more than you give. -- the sqlite blessing http://www.sqlite.org/different.html From greg at krypto.org Fri May 6 18:32:51 2011 From: greg at krypto.org (Gregory P. Smith) Date: Fri, 6 May 2011 09:32:51 -0700 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <19908.8043.8921.50222@montanaro.dyndns.org> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> Message-ID: On Fri, May 6, 2011 at 9:18 AM, wrote: > > ? ?Antoine> Since we're sharing links, here's Matt Mackall's take: > ? ?Antoine> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html > > >From that note: > > ? ?1: You can't have meaningful destructors, because when destruction > ? ?happens is undefined. And going-out-of-scope destructors are extremely > ? ?useful. Python is already a rather broken in this regard, so feel free > ? ?to ignore this point. Python being "broken" in this regard is pretty much exactly why __enter__, __exit__ and with as context managers were added to the language. That gives the ability to have the equivalent of well defined nested scopes that destroy something (exit) deterministically much as it is easy to do in C++ with some {}s and a ~destructor(). It is not broken, just different. -gps From marks at dcs.gla.ac.uk Fri May 6 18:33:03 2011 From: marks at dcs.gla.ac.uk (Mark Shannon) Date: Fri, 06 May 2011 17:33:03 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <19908.8043.8921.50222@montanaro.dyndns.org> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> Message-ID: <4DC422BF.4010006@dcs.gla.ac.uk> skip at pobox.com wrote: > Antoine> Since we're sharing links, here's Matt Mackall's take: > Antoine> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html > >>From that note: > > 1: You can't have meaningful destructors, because when destruction > happens is undefined. And going-out-of-scope destructors are extremely > useful. Python is already a rather broken in this regard, so feel free > to ignore this point. > > Given the presence of cyclic data I don't see how reference counting or > garbage collection win. Ignoring the fact that in a pure reference counted > system you won't even consider cycles for reclmation, would both RC and GC > have to punt because they can't tell which object's destructor to call > first? It doesn't matter which is called first. In fact, the VM could call all the destructors at the same time if the machine has enough cores and there's no GIL. All objects are kept alive by the GC until after the destructors are called. Those that are still dead will have their memory reclaimed. > > Skip > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/marks%40dcs.gla.ac.uk From stefan_ml at behnel.de Fri May 6 18:51:37 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 06 May 2011 18:51:37 +0200 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC422BF.4010006@dcs.gla.ac.uk> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC422BF.4010006@dcs.gla.ac.uk> Message-ID: Mark Shannon, 06.05.2011 18:33: > skip at pobox.com wrote: >> Antoine> Since we're sharing links, here's Matt Mackall's take: >> Antoine> >> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >> >>> From that note: >> >> 1: You can't have meaningful destructors, because when destruction >> happens is undefined. And going-out-of-scope destructors are extremely >> useful. Python is already a rather broken in this regard, so feel free >> to ignore this point. >> >> Given the presence of cyclic data I don't see how reference counting or >> garbage collection win. Ignoring the fact that in a pure reference counted >> system you won't even consider cycles for reclmation, would both RC and GC >> have to punt because they can't tell which object's destructor to call >> first? > > It doesn't matter which is called first. May I quote you on that one the next time my software crashes? It may not make a difference for the runtime, but the difference for user software may be "dead" or "alive". Stefan From fuzzyman at voidspace.org.uk Fri May 6 19:04:53 2011 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Fri, 06 May 2011 18:04:53 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> Message-ID: <4DC42A35.6060303@voidspace.org.uk> On 06/05/2011 17:32, Gregory P. Smith wrote: > On Fri, May 6, 2011 at 9:18 AM, wrote: >> Antoine> Since we're sharing links, here's Matt Mackall's take: >> Antoine> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >> >> > From that note: >> >> 1: You can't have meaningful destructors, because when destruction >> happens is undefined. And going-out-of-scope destructors are extremely >> useful. Python is already a rather broken in this regard, so feel free >> to ignore this point. > Python being "broken" in this regard is pretty much exactly why > __enter__, __exit__ and with as context managers were added to the > language. > How does that help with cycles? Sure it makes cleaning up some resources easier, but not at all this case. Explicit destruction is of course always an alternative to the runtime doing it for you, but it doesn't help with (for example) reclaiming memory. For long running processes memory leaks due to unreclaimable cycles can be a problem with CPython. > That gives the ability to have the equivalent of well defined nested > scopes that destroy something (exit) deterministically much as it is > easy to do in C++ with some {}s and a ~destructor(). > > It is not broken, just different. +1 QOTW ;-) Michael > -gps > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk -- http://www.voidspace.org.uk/ May you do good and not evil May you find forgiveness for yourself and forgive others May you share freely, never taking more than you give. -- the sqlite blessing http://www.sqlite.org/different.html From fuzzyman at voidspace.org.uk Fri May 6 19:06:35 2011 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Fri, 06 May 2011 18:06:35 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC422BF.4010006@dcs.gla.ac.uk> Message-ID: <4DC42A9B.6020000@voidspace.org.uk> On 06/05/2011 17:51, Stefan Behnel wrote: > Mark Shannon, 06.05.2011 18:33: >> skip at pobox.com wrote: >>> Antoine> Since we're sharing links, here's Matt Mackall's take: >>> Antoine> >>> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >>> >>>> From that note: >>> >>> 1: You can't have meaningful destructors, because when destruction >>> happens is undefined. And going-out-of-scope destructors are extremely >>> useful. Python is already a rather broken in this regard, so feel free >>> to ignore this point. >>> >>> Given the presence of cyclic data I don't see how reference counting or >>> garbage collection win. Ignoring the fact that in a pure reference >>> counted >>> system you won't even consider cycles for reclmation, would both RC >>> and GC >>> have to punt because they can't tell which object's destructor to call >>> first? >> >> It doesn't matter which is called first. > > May I quote you on that one the next time my software crashes? > Arbitrarily breaking cycles *could* cause a problem if a destructor attempts to access an already collected object. Not breaking cycles *definitely* leaks memory and definitely doesn't call finalizers. Michael > It may not make a difference for the runtime, but the difference for > user software may be "dead" or "alive". > > Stefan > > _______________________________________________ > Python-Dev mailing list > Python-Dev at python.org > http://mail.python.org/mailman/listinfo/python-dev > Unsubscribe: > http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk -- http://www.voidspace.org.uk/ May you do good and not evil May you find forgiveness for yourself and forgive others May you share freely, never taking more than you give. -- the sqlite blessing http://www.sqlite.org/different.html From glyph at twistedmatrix.com Fri May 6 19:07:44 2011 From: glyph at twistedmatrix.com (Glyph Lefkowitz) Date: Fri, 6 May 2011 13:07:44 -0400 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC42270.1000301@voidspace.org.uk> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC42270.1000301@voidspace.org.uk> Message-ID: <8F83194F-5A5C-496E-920A-A2488F9949E4@twistedmatrix.com> On May 6, 2011, at 12:31 PM, Michael Foord wrote: > pypy and .NET choose to arbitrarily break cycles rather than leave objects unfinalised and memory unreclaimed. Not sure what Java does. I think that's a mischaracterization of their respective collectors; "arbitrarily break cycles" implies that user code would see broken or incomplete objects, at least during finalization, which I'm fairly sure is not true on either .NET or PyPy. Java definitely has a collector that can handles cycles too. (None of these are reference counting.) -glyph -------------- next part -------------- An HTML attachment was scrubbed... URL: From stephen at xemacs.org Fri May 6 19:15:33 2011 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Sat, 07 May 2011 02:15:33 +0900 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC409B0.60909@dcs.gla.ac.uk> References: <4DC409B0.60909@dcs.gla.ac.uk> Message-ID: <87y62jeone.fsf@uwakimon.sk.tsukuba.ac.jp> Mark Shannon writes: > > > Neal Becker wrote: > > http://gcc.gnu.org/ml/gcc/2002-08/msg00552.html > > > Being famous does not necessarily make you right. No, but being a genius sure helps you beat the odds. > OS kernels are pretty atypical software, > even if Linus is right about Linux, it doesn't apply to Python. Well, actually he was writing about GCC.... > I have empirical evidence, not opinion, that PyPy and my own HotPy > are a *lot* faster (x5 or better) on Unladen Swallow's gcbench benchmark > (which stresses the memory management subsystem). You're missing Linus's point, I think. Linus did *not* claim that it's impossible to write a fast *GC*. He claimed that it's hard to write a fast *program* that uses GC for memory management. A benchmark that stresses *only* the memory management system is unlikely to impress him. From fuzzyman at voidspace.org.uk Fri May 6 19:12:51 2011 From: fuzzyman at voidspace.org.uk (Michael Foord) Date: Fri, 06 May 2011 18:12:51 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <8F83194F-5A5C-496E-920A-A2488F9949E4@twistedmatrix.com> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC42270.1000301@voidspace.org.uk> <8F83194F-5A5C-496E-920A-A2488F9949E4@twistedmatrix.com> Message-ID: <4DC42C13.8070806@voidspace.org.uk> On 06/05/2011 18:07, Glyph Lefkowitz wrote: > On May 6, 2011, at 12:31 PM, Michael Foord wrote: > >> pypy and .NET choose to arbitrarily break cycles rather than leave >> objects unfinalised and memory unreclaimed. Not sure what Java does. > > I think that's a mischaracterization of their respective collectors; > "arbitrarily break cycles" implies that user code would see broken or > incomplete objects, at least during finalization, which I'm fairly > sure is not true on either .NET or PyPy. http://morepypy.blogspot.com/2008/02/python-finalizers-semantics-part-1.html "Therefore we decided to break such a cycle at an arbitrary place, which doesn't sound too insane." All the best, Michael Foord > > Java definitely has a collector that can handles cycles too. (None of > these are reference counting.) > > -glyph -- http://www.voidspace.org.uk/ May you do good and not evil May you find forgiveness for yourself and forgive others May you share freely, never taking more than you give. -- the sqlite blessing http://www.sqlite.org/different.html -------------- next part -------------- An HTML attachment was scrubbed... URL: From marks at dcs.gla.ac.uk Fri May 6 19:46:37 2011 From: marks at dcs.gla.ac.uk (Mark Shannon) Date: Fri, 06 May 2011 18:46:37 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC4321F.3070206@voidspace.org.uk> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC422BF.4010006@dcs.gla.ac.uk> <4DC42A9B.6020000@voidspace.org.uk> <4DC42F4B.1050509@dcs.gla.ac.uk> <4DC4321F.3070206@voidspace.org.uk> Message-ID: <4DC433FD.6090803@dcs.gla.ac.uk> Michael Foord wrote: > On 06/05/2011 18:26, Mark Shannon wrote: >> Michael Foord wrote: >>> On 06/05/2011 17:51, Stefan Behnel wrote: >>>> Mark Shannon, 06.05.2011 18:33: >>>>> skip at pobox.com wrote: >>>>>> Antoine> Since we're sharing links, here's Matt Mackall's take: >>>>>> Antoine> >>>>>> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >>>>>> >>>>>>> From that note: >>>>>> 1: You can't have meaningful destructors, because when destruction >>>>>> happens is undefined. And going-out-of-scope destructors are >>>>>> extremely >>>>>> useful. Python is already a rather broken in this regard, so feel >>>>>> free >>>>>> to ignore this point. >>>>>> >>>>>> Given the presence of cyclic data I don't see how reference >>>>>> counting or >>>>>> garbage collection win. Ignoring the fact that in a pure reference >>>>>> counted >>>>>> system you won't even consider cycles for reclmation, would both >>>>>> RC and GC >>>>>> have to punt because they can't tell which object's destructor to >>>>>> call >>>>>> first? >>>>> It doesn't matter which is called first. >>>> May I quote you on that one the next time my software crashes? >>>> >>> Arbitrarily breaking cycles *could* cause a problem if a destructor >>> attempts to access an already collected object. Not breaking cycles >>> *definitely* leaks memory and definitely doesn't call finalizers. >> You don't need to break the cycles to call the finalizers. Just call >> them, then collect the whole cycle (assuming it is still unreachable). >> >> The GC will *never* reclaim a reachable object. Objects awaiting >> finalization are reachable, by definition. >> > Well it was sloppily worded, so replace it with: > > if a finalizer attempts to access an already finalized object. A finalized object will still be a valid object. Python code cannot make an object unsafe. Obviously C code can make it unsafe, but that's true of C code anywhere. For example, a file object will close itself during finalization, but its still a valid object, just a closed file rather than an open one. > > Michael >>> Michael >>> >>>> It may not make a difference for the runtime, but the difference for >>>> user software may be "dead" or "alive". >>>> >>>> Stefan >>>> >>>> _______________________________________________ >>>> Python-Dev mailing list >>>> Python-Dev at python.org >>>> http://mail.python.org/mailman/listinfo/python-dev >>>> Unsubscribe: >>>> http://mail.python.org/mailman/options/python-dev/fuzzyman%40voidspace.org.uk >>> > > From merwok at netwok.org Fri May 6 19:42:11 2011 From: merwok at netwok.org (=?UTF-8?Q?=C3=89ric_Araujo?=) Date: Fri, 06 May 2011 19:42:11 +0200 Subject: [Python-Dev] cpython (3.2): Avoid codec spelling issues by just using the utf-8 default. In-Reply-To: References: "\"" " <926F0913-8142-430A-8400-6E6F0CD5B8F1@gmail.com> Message-ID: Le 06/05/2011 00:52, Terry Reedy a ?crit : > On 5/5/2011 4:55 PM, Raymond Hettinger wrote: >> Either way, the code is simpler by just using the default. > I thought about this and decided that the purpose of having defaults > is > so one does not have to always spell it out. So use it. Readers can > always look it up and learn. Agreed. I thought about something similar after Victor?s commit that changed open(mode='rU') to use just 'r': Why not remove the mode argument entirely when it is the default value? Regards From merwok at netwok.org Fri May 6 19:51:31 2011 From: merwok at netwok.org (=?UTF-8?Q?=C3=89ric_Araujo?=) Date: Fri, 06 May 2011 19:51:31 +0200 Subject: [Python-Dev] Problems with regrtest and with logging Message-ID: Hi, Sorry for quick email-battery dying. regrtest helpfully reports when a test leaves the environment unclean (sys.path, os.environ, logging._handlerList), but I think the implementation is buggy: it compares object identity and then value. Why is comparing identity useful? I?d just use ==. It makes writing cleanup code easier (just use addCleanup(setattr, obj, 'attr', copy(obj.attr))). Second: in packaging, we have two modules that create a logging handler. I?m not sure how if we should change the code or fix the tests to restore the _handlerList, or how. Thanks for advice. Regards From skip at pobox.com Fri May 6 19:58:34 2011 From: skip at pobox.com (skip at pobox.com) Date: Fri, 6 May 2011 12:58:34 -0500 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC42C13.8070806@voidspace.org.uk> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC42270.1000301@voidspace.org.uk> <8F83194F-5A5C-496E-920A-A2488F9949E4@twistedmatrix.com> <4DC42C13.8070806@voidspace.org.uk> Message-ID: <19908.14026.312182.540486@montanaro.dyndns.org> Michael> "Therefore we decided to break such a cycle at an arbitrary Michael> place, which doesn't sound too insane." I trust "arbitrary" != "random"? Skip From stefan_ml at behnel.de Fri May 6 20:06:12 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 06 May 2011 20:06:12 +0200 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC42A9B.6020000@voidspace.org.uk> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC422BF.4010006@dcs.gla.ac.uk> <4DC42A9B.6020000@voidspace.org.uk> Message-ID: Michael Foord, 06.05.2011 19:06: > On 06/05/2011 17:51, Stefan Behnel wrote: >> Mark Shannon, 06.05.2011 18:33: >>> skip at pobox.com wrote: >>>> Antoine> Since we're sharing links, here's Matt Mackall's take: >>>> Antoine> >>>> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >>>> >>>>> From that note: >>>> >>>> 1: You can't have meaningful destructors, because when destruction >>>> happens is undefined. And going-out-of-scope destructors are extremely >>>> useful. Python is already a rather broken in this regard, so feel free >>>> to ignore this point. >>>> >>>> Given the presence of cyclic data I don't see how reference counting or >>>> garbage collection win. Ignoring the fact that in a pure reference counted >>>> system you won't even consider cycles for reclmation, would both RC and GC >>>> have to punt because they can't tell which object's destructor to call >>>> first? >>> >>> It doesn't matter which is called first. >> >> May I quote you on that one the next time my software crashes? > > Arbitrarily breaking cycles *could* cause a problem if a destructor > attempts to access an already collected object. This is more real than the "could" suggests. Remember that CPython includes a lot of C code, and is commonly used to interface with C libraries. While you will simply get an exception when cycles are broken in Python code, cycles that involve C code can suffer quite badly from this problem. There was a bug in the lxml.etree XML library a while ago that could let it crash hard when its Element objects participated in a reference cycle. It's based on libxml2, so there's an underlying C tree that potentially involves disconnected subtrees, and a Python space representation using Element proxies, with at least one Element for each disconnected subtree. Basically, Elements reference their Document (not the other way round) even if they are disconnected from the main C document tree. The Document needs to do some final cleanup in the end, whereas the Elements require the Document to be alive to do their own subtree cleanup, if only to know what exactly to clean up, as the subtrees share some C state through the document. Now, if any of the Elements ends up in a reference cycle for some reason, the GC will throw its dices and may decide to call the Document destructor first. Then the Element destructors are bound to crash, trying to access dead memory of the Document. This was easy to fix in CPython's refcounting environment. A double INCREF on the Document for each Element does the trick, as it effectively removes the Document from the collectable cycle and lets the Element destructors decide when to let the Document refcount go down to 0. A fix in a pure GC system is substantially harder to make efficient. Stefan From g.brandl at gmx.net Fri May 6 20:14:28 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Fri, 06 May 2011 20:14:28 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz>

<19907.21576.751581.958722@montanaro.dyndns.org> <4DC3A6CB.5020809@dcs.gla.ac.uk> Message-ID: On 06.05.2011 10:18, Amaury Forgeot d'Arc wrote: > Le vendredi 6 mai 2011, Mark Shannon a ?crit : >> What about #defining PY_STOLEN in some header? >> >> Then any stolen parameter can be prefixed with PY_STOLEN in signature. >> >> For return values, similarly #define PY_BORROWED. > > Header files are harder to parse, and I don't see how it would apply to macros. > What about additional tags in the .rst files? Possible, of course, and even easier to implement. Georg From g.brandl at gmx.net Fri May 6 20:16:20 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Fri, 06 May 2011 20:16:20 +0200 Subject: [Python-Dev] Borrowed and Stolen References in API In-Reply-To: <20110506122703.17c4d889@pitrou.net> References: <1304499523.15694.11.camel@marge> <4DC11791.2000109@dcs.gla.ac.uk> <4DC1D1C5.9010507@canterbury.ac.nz> <4DC34EAB.9050001@canterbury.ac.nz> <20110506122703.17c4d889@pitrou.net> Message-ID: On 06.05.2011 12:27, Antoine Pitrou wrote: > On Fri, 06 May 2011 13:28:11 +1200 > Greg Ewing wrote: > >> Amaury Forgeot d'Arc wrote [concerning the Doc/data/refcounts.dat file]: >> >> > This is not always true, for example when the item is already present >> > in the dict. >> > It's not important to know what the function does to the object, >> > Only the action on the reference is relevant. >> >> Yes, that's the whole point. When using a functon, >> what you need to know is whether it borrows or steals >> a reference. > > Doesn't "borrow" mean the same as "steal" in that context? > If an API borrows a reference, I expect it to take it from me. Basically, "borrow" is applied to return values (or, more generally, "out" parameters), and means that *you* borrowed the reference. "steal", OTOH, is applied to (and the exception for) "in" parameters. Georg From marks at dcs.gla.ac.uk Fri May 6 20:45:41 2011 From: marks at dcs.gla.ac.uk (Mark Shannon) Date: Fri, 06 May 2011 19:45:41 +0100 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC422BF.4010006@dcs.gla.ac.uk> <4DC42A9B.6020000@voidspace.org.uk> Message-ID: <4DC441D5.2070102@dcs.gla.ac.uk> Stefan Behnel wrote: > Michael Foord, 06.05.2011 19:06: >> On 06/05/2011 17:51, Stefan Behnel wrote: >>> Mark Shannon, 06.05.2011 18:33: >>>> skip at pobox.com wrote: >>>>> Antoine> Since we're sharing links, here's Matt Mackall's take: >>>>> Antoine> >>>>> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >>>>> >>>>>> From that note: >>>>> 1: You can't have meaningful destructors, because when destruction >>>>> happens is undefined. And going-out-of-scope destructors are extremely >>>>> useful. Python is already a rather broken in this regard, so feel free >>>>> to ignore this point. >>>>> >>>>> Given the presence of cyclic data I don't see how reference counting or >>>>> garbage collection win. Ignoring the fact that in a pure reference counted >>>>> system you won't even consider cycles for reclmation, would both RC and GC >>>>> have to punt because they can't tell which object's destructor to call >>>>> first? >>>> It doesn't matter which is called first. >>> May I quote you on that one the next time my software crashes? >> Arbitrarily breaking cycles *could* cause a problem if a destructor >> attempts to access an already collected object. > > This is more real than the "could" suggests. Remember that CPython includes > a lot of C code, and is commonly used to interface with C libraries. While > you will simply get an exception when cycles are broken in Python code, > cycles that involve C code can suffer quite badly from this problem. > > There was a bug in the lxml.etree XML library a while ago that could let it > crash hard when its Element objects participated in a reference cycle. It's > based on libxml2, so there's an underlying C tree that potentially involves > disconnected subtrees, and a Python space representation using Element > proxies, with at least one Element for each disconnected subtree. > > Basically, Elements reference their Document (not the other way round) even > if they are disconnected from the main C document tree. The Document needs > to do some final cleanup in the end, whereas the Elements require the > Document to be alive to do their own subtree cleanup, if only to know what > exactly to clean up, as the subtrees share some C state through the > document. Now, if any of the Elements ends up in a reference cycle for some > reason, the GC will throw its dices and may decide to call the Document > destructor first. Then the Element destructors are bound to crash, trying > to access dead memory of the Document. With a tracing collector it is *impossible* to access dead memory, ever. If it can be reached the GC will *not* collect it. This should be a fundamental invariant of *all* GCs. If an object is finalizable or reachable from any finalizable objects then it is reachable and its memory should not be reclaimed until it is truly unreachable. Finalization and reclamation are separate phases. > > This was easy to fix in CPython's refcounting environment. A double INCREF > on the Document for each Element does the trick, as it effectively removes > the Document from the collectable cycle and lets the Element destructors > decide when to let the Document refcount go down to 0. A fix in a pure GC > system is substantially harder to make efficient. With a tracing GC: While the Elements are finalized, the Document is still alive. While the Document is finalized, the Elements are still alive. Then, and only then, is the whole lot reclaimed. Mark. From vinay_sajip at yahoo.co.uk Fri May 6 20:57:24 2011 From: vinay_sajip at yahoo.co.uk (Vinay Sajip) Date: Fri, 6 May 2011 18:57:24 +0000 (UTC) Subject: [Python-Dev] Problems with regrtest and with logging References: Message-ID: ?ric Araujo netwok.org> writes: > Second: in packaging, we have two modules that create a logging > handler. I?m not sure how if we should change the code or fix the tests > to restore the _handlerList, or how. If you are saying this happens in your unit tests for packaging, then you can either restore the _handlerList using the approach in test_logging, or else you can just close the handlers when you've done with them. If you point me at the relevant code (is it on bitbucket or on hg.python.org?) I can perhaps take a look and advise. Regards, Vinay Sajip From stefan_ml at behnel.de Fri May 6 21:10:30 2011 From: stefan_ml at behnel.de (Stefan Behnel) Date: Fri, 06 May 2011 21:10:30 +0200 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC441D5.2070102@dcs.gla.ac.uk> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC422BF.4010006@dcs.gla.ac.uk> <4DC42A9B.6020000@voidspace.org.uk> <4DC441D5.2070102@dcs.gla.ac.uk> Message-ID: Mark Shannon, 06.05.2011 20:45: > Stefan Behnel wrote: >> Michael Foord, 06.05.2011 19:06: >>> On 06/05/2011 17:51, Stefan Behnel wrote: >>>> Mark Shannon, 06.05.2011 18:33: >>>>> skip at pobox.com wrote: >>>>>> Antoine> Since we're sharing links, here's Matt Mackall's take: >>>>>> Antoine> >>>>>> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html >>>>>> >>>>>>> From that note: >>>>>> 1: You can't have meaningful destructors, because when destruction >>>>>> happens is undefined. And going-out-of-scope destructors are extremely >>>>>> useful. Python is already a rather broken in this regard, so feel free >>>>>> to ignore this point. >>>>>> >>>>>> Given the presence of cyclic data I don't see how reference counting or >>>>>> garbage collection win. Ignoring the fact that in a pure reference >>>>>> counted >>>>>> system you won't even consider cycles for reclmation, would both RC >>>>>> and GC >>>>>> have to punt because they can't tell which object's destructor to call >>>>>> first? >>>>> It doesn't matter which is called first. >>>> May I quote you on that one the next time my software crashes? >>> Arbitrarily breaking cycles *could* cause a problem if a destructor >>> attempts to access an already collected object. >> >> This is more real than the "could" suggests. Remember that CPython >> includes a lot of C code, and is commonly used to interface with C >> libraries. While you will simply get an exception when cycles are broken >> in Python code, cycles that involve C code can suffer quite badly from >> this problem. >> >> There was a bug in the lxml.etree XML library a while ago that could let >> it crash hard when its Element objects participated in a reference cycle. >> It's based on libxml2, so there's an underlying C tree that potentially >> involves disconnected subtrees, and a Python space representation using >> Element proxies, with at least one Element for each disconnected subtree. >> >> Basically, Elements reference their Document (not the other way round) >> even if they are disconnected from the main C document tree. The Document >> needs to do some final cleanup in the end, whereas the Elements require >> the Document to be alive to do their own subtree cleanup, if only to know >> what exactly to clean up, as the subtrees share some C state through the >> document. Now, if any of the Elements ends up in a reference cycle for >> some reason, the GC will throw its dices and may decide to call the >> Document destructor first. Then the Element destructors are bound to >> crash, trying to access dead memory of the Document. > > With a tracing collector it is *impossible* to access dead memory, ever. > If it can be reached the GC will *not* collect it. > This should be a fundamental invariant of *all* GCs. > > If an object is finalizable or reachable from any finalizable objects > then it is reachable and its memory should not be reclaimed until it is > truly unreachable. > > Finalization and reclamation are separate phases. Sure. However, I'm talking about Python types and C memory here. Even if the Python objects are still alive, they may already have freed the underlying C memory during their *finalisation*. When an Element goes out of scope, it must free its C subtree if it is disconnected, even if the Document stays alive. So that's what Elements do in their destructor, and they need the Document's C memory for that, which the Document frees during its own finalisation. I do agree that CPython's destructor call algorithms could have been smarter in this case. After all, the described crash case indicates that the Document destructor was called before all of the Element destructors had been called, although all Elements reference their Document, but the Document does not refer to any of the Elements, so it's basically a dead end. That would have provided a detectable hint to call the Document destructor last, after the ones of all objects that reference it. Apparently, this hint did not lead to an appropriate action, possibly because it's an unimplemented special case and there are enough cases where multiple objects with destructors are actually part of the 'real' cycle. Stefan From drsalists at gmail.com Fri May 6 21:59:30 2011 From: drsalists at gmail.com (Dan Stromberg) Date: Fri, 6 May 2011 12:59:30 -0700 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: References: Message-ID: On Fri, May 6, 2011 at 7:04 AM, Neal Becker wrote: > http://gcc.gnu.org/ml/gcc/2002-08/msg00552.html > Of course, a generational GC improves locality of reference. -------------- next part -------------- An HTML attachment was scrubbed... URL: From rdmurray at bitdance.com Fri May 6 22:07:30 2011 From: rdmurray at bitdance.com (R. David Murray) Date: Fri, 06 May 2011 16:07:30 -0400 Subject: [Python-Dev] Problems with regrtest and with logging In-Reply-To: References: Message-ID: <20110506200734.049872500DF@webabinitio.net> On Fri, 06 May 2011 19:51:31 +0200, =?UTF-8?Q?=C3=89ric_Araujo?= wrote: > regrtest helpfully reports when a test leaves the environment unclean > (sys.path, os.environ, logging._handlerList), but I think the > implementation is buggy: it compares object identity and then value. > Why is comparing identity useful? I???d just use ==. It makes writing > cleanup code easier (just use addCleanup(setattr, obj, 'attr', > copy(obj.attr))). Well, the implementation is intentional. Nick (I think) added the identity check, and he had a reason at the time. I don't remember what it was, though. -- R. David Murray http://www.bitdance.com From greg.ewing at canterbury.ac.nz Sat May 7 01:25:09 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 11:25:09 +1200 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: References: Message-ID: <4DC48355.2050509@canterbury.ac.nz> Neal Becker wrote: > http://gcc.gnu.org/ml/gcc/2002-08/msg00552.html There, Linus says > For example, if you have an _explicit_ refcounting system, then it is > quite natural to have operations like ... > > note_t *node = *np; > if (node->count > 1) > newnode = copy_alloc(node); It's interesting to note that, even though you *can* get reference count information in CPython, it's not all that useful for doing things like that, because it's hard to be sure how many incidental references have been created on the way to the code concerned. So tricks like this at the Python level aren't really feasible in any robust way. -- Greg From greg.ewing at canterbury.ac.nz Sat May 7 01:43:16 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 11:43:16 +1200 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <19908.8043.8921.50222@montanaro.dyndns.org> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> Message-ID: <4DC48794.5070808@canterbury.ac.nz> Antoine> http://www.selenic.com/pipermail/mercurial-devel/2011-May/031055.html > >>From that note: > > 1: You can't have meaningful destructors, because when destruction > happens is undefined. And going-out-of-scope destructors are extremely > useful. Python is already a rather broken in this regard, so feel free > to ignore this point. It's only broken if you regard RAII as the One True Way to implement scoped resource management. Python has other approaches to that, such as the with-statement. Also, you *can* have destructors that work for objects in cycles, as long as you don't insist on the destructor having access to the object that's being destroyed. Weakref callbacks provide a way of implementing this in CPython. -- Greg From greg.ewing at canterbury.ac.nz Sat May 7 01:53:39 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 11:53:39 +1200 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: <4DC433FD.6090803@dcs.gla.ac.uk> References: <20110506161233.1ed647ec@pitrou.net> <19908.8043.8921.50222@montanaro.dyndns.org> <4DC422BF.4010006@dcs.gla.ac.uk> <4DC42A9B.6020000@voidspace.org.uk> <4DC42F4B.1050509@dcs.gla.ac.uk> <4DC4321F.3070206@voidspace.org.uk> <4DC433FD.6090803@dcs.gla.ac.uk> Message-ID: <4DC48A03.2020800@canterbury.ac.nz> Mark Shannon wrote: > For example, a file object will close itself during finalization, > but its still a valid object, just a closed file rather than an open one. It might be valid in the sense that you won't get a segfault. But the point is that the destructors of some objects may be relying on other objects still being in a certain state, e.g. a file still being open. One would have to adopt a highly defensive coding style in destructors, verging on paranoia, to be sure that one's destructor code was completely immune to this kind of problem. All of this worry goes away if the destructor is not a method of the object being destroyed, but something external that runs *after* the object has disappeared. -- Greg From ncoghlan at gmail.com Sat May 7 02:12:33 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sat, 7 May 2011 10:12:33 +1000 Subject: [Python-Dev] Linus on garbage collection In-Reply-To: