From python-checkins at python.org Thu Dec 1 00:22:04 2011 From: python-checkins at python.org (nadeem.vawda) Date: Thu, 01 Dec 2011 00:22:04 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Credit_Per_=C3=98yvind_Karl?= =?utf8?q?sen_for_the_initial_implementation_of_the_lzma_module?= Message-ID: http://hg.python.org/cpython/rev/6cde416ef03b changeset: 73798:6cde416ef03b user: Nadeem Vawda date: Thu Dec 01 01:18:27 2011 +0200 summary: Credit Per ?yvind Karlsen for the initial implementation of the lzma module (issue #6715). files: Misc/ACKS | 1 + Misc/NEWS | 1 + Modules/_lzmamodule.c | 7 ++++++- 3 files changed, 8 insertions(+), 1 deletions(-) diff --git a/Misc/ACKS b/Misc/ACKS --- a/Misc/ACKS +++ b/Misc/ACKS @@ -502,6 +502,7 @@ Peter van Kampen Rafe Kaplan Jacob Kaplan-Moss +Per ?yvind Karlsen Lou Kates Hiroaki Kawai Sebastien Keim diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -400,6 +400,7 @@ ------- - Issue #6715: Add a module 'lzma' for compression using the LZMA algorithm. + Thanks to Per ?yvind Karlsen for the initial implementation. - Issue #13487: Make inspect.getmodule robust against changes done to sys.modules while it is iterating over it. diff --git a/Modules/_lzmamodule.c b/Modules/_lzmamodule.c --- a/Modules/_lzmamodule.c +++ b/Modules/_lzmamodule.c @@ -1,4 +1,9 @@ -/* _lzma - Low-level Python interface to liblzma. */ +/* _lzma - Low-level Python interface to liblzma. + + Initial implementation by Per ?yvind Karlsen. + Rewritten by Nadeem Vawda. + +*/ #define PY_SSIZE_T_CLEAN -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 1 02:52:11 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 01 Dec 2011 02:52:11 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_PyObject=5FRepr=28=29_ensur?= =?utf8?q?es_that_the_result_is_a_ready_Unicode_string?= Message-ID: http://hg.python.org/cpython/rev/c5d2ce38b0d3 changeset: 73799:c5d2ce38b0d3 user: Victor Stinner date: Thu Dec 01 02:15:00 2011 +0100 summary: PyObject_Repr() ensures that the result is a ready Unicode string And PyObject_Str() and PyObject_Repr() don't make strings ready in debug mode to ensure that the caller makes the string ready before using it. files: Objects/object.c | 8 ++++++++ 1 files changed, 8 insertions(+), 0 deletions(-) diff --git a/Objects/object.c b/Objects/object.c --- a/Objects/object.c +++ b/Objects/object.c @@ -385,6 +385,10 @@ Py_DECREF(res); return NULL; } +#ifndef Py_DEBUG + if (PyUnicode_READY(res) < 0) + return NULL; +#endif return res; } @@ -403,8 +407,10 @@ if (v == NULL) return PyUnicode_FromString(""); if (PyUnicode_CheckExact(v)) { +#ifndef Py_DEBUG if (PyUnicode_READY(v) < 0) return NULL; +#endif Py_INCREF(v); return v; } @@ -426,8 +432,10 @@ Py_DECREF(res); return NULL; } +#ifndef Py_DEBUG if (PyUnicode_READY(res) < 0) return NULL; +#endif assert(_PyUnicode_CheckConsistency(res, 1)); return res; } -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 1 02:52:12 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 01 Dec 2011 02:52:12 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_PyCodec=5FIgnoreErrors=28?= =?utf8?q?=29_avoids_the_deprecated_=22u=23=22_format?= Message-ID: http://hg.python.org/cpython/rev/6407294eb3c6 changeset: 73800:6407294eb3c6 user: Victor Stinner date: Thu Dec 01 02:52:11 2011 +0100 summary: PyCodec_IgnoreErrors() avoids the deprecated "u#" format files: Python/codecs.c | 3 +-- 1 files changed, 1 insertions(+), 2 deletions(-) diff --git a/Python/codecs.c b/Python/codecs.c --- a/Python/codecs.c +++ b/Python/codecs.c @@ -510,8 +510,7 @@ wrong_exception_type(exc); return NULL; } - /* ouch: passing NULL, 0, pos gives None instead of u'' */ - return Py_BuildValue("(u#n)", &end, 0, end); + return Py_BuildValue("(Nn)", PyUnicode_New(0, 0), end); } -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 1 02:52:12 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 01 Dec 2011 02:52:12 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_c-api=3A_Replace_PyUnicodeO?= =?utf8?q?bject*_by_PyObject*_in_=22U=22_format_doc?= Message-ID: http://hg.python.org/cpython/rev/ba8e7886fdd7 changeset: 73801:ba8e7886fdd7 user: Victor Stinner date: Thu Dec 01 02:52:55 2011 +0100 summary: c-api: Replace PyUnicodeObject* by PyObject* in "U" format doc files: Doc/c-api/arg.rst | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Doc/c-api/arg.rst b/Doc/c-api/arg.rst --- a/Doc/c-api/arg.rst +++ b/Doc/c-api/arg.rst @@ -146,7 +146,7 @@ Like ``u#``, but the Python object may also be ``None``, in which case the :c:type:`Py_UNICODE` pointer is set to *NULL*. -``U`` (:class:`str`) [PyUnicodeObject \*] +``U`` (:class:`str`) [PyObject \*] Requires that the Python object is a Unicode object, without attempting any conversion. Raises :exc:`TypeError` if the object is not a Unicode object. The C variable may also be declared as :c:type:`PyObject\*`. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 1 03:16:37 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 01 Dec 2011 03:16:37 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_MultibyteCodec=5FDecode=28?= =?utf8?q?=29_catchs_PyUnicode=5FAS=5FUNICODE=28=29_failures?= Message-ID: http://hg.python.org/cpython/rev/7216cf767993 changeset: 73802:7216cf767993 user: Victor Stinner date: Thu Dec 01 03:18:30 2011 +0100 summary: MultibyteCodec_Decode() catchs PyUnicode_AS_UNICODE() failures files: Modules/cjkcodecs/multibytecodec.c | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/Modules/cjkcodecs/multibytecodec.c b/Modules/cjkcodecs/multibytecodec.c --- a/Modules/cjkcodecs/multibytecodec.c +++ b/Modules/cjkcodecs/multibytecodec.c @@ -643,6 +643,8 @@ if (buf.outobj == NULL) goto errorexit; buf.outbuf = PyUnicode_AS_UNICODE(buf.outobj); + if (buf.outbuf == NULL) + goto errorexit; buf.outbuf_end = buf.outbuf + PyUnicode_GET_SIZE(buf.outobj); if (self->codec->decinit != NULL && -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 1 03:16:38 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 01 Dec 2011 03:16:38 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Replace_PyUnicode=5FFromUni?= =?utf8?b?Y29kZShOVUxMLCAwKSBieSBQeVVuaWNvZGVfTmV3KDAsIDAp?= Message-ID: http://hg.python.org/cpython/rev/d855329d0f72 changeset: 73803:d855329d0f72 user: Victor Stinner date: Thu Dec 01 03:18:59 2011 +0100 summary: Replace PyUnicode_FromUnicode(NULL, 0) by PyUnicode_New(0, 0) Create an empty string with the new Unicode API. files: Modules/cjkcodecs/multibytecodec.c | 4 ++-- Objects/stringlib/unicode_format.h | 2 +- 2 files changed, 3 insertions(+), 3 deletions(-) diff --git a/Modules/cjkcodecs/multibytecodec.c b/Modules/cjkcodecs/multibytecodec.c --- a/Modules/cjkcodecs/multibytecodec.c +++ b/Modules/cjkcodecs/multibytecodec.c @@ -633,7 +633,7 @@ if (datalen == 0) { PyBuffer_Release(&pdata); ERROR_DECREF(errorcb); - return make_tuple(PyUnicode_FromUnicode(NULL, 0), 0); + return make_tuple(PyUnicode_New(0, 0), 0); } buf.excobj = NULL; @@ -1265,7 +1265,7 @@ Py_ssize_t rsize, finalsize = 0; if (sizehint == 0) - return PyUnicode_FromUnicode(NULL, 0); + return PyUnicode_New(0, 0); buf.outobj = buf.excobj = NULL; cres = NULL; diff --git a/Objects/stringlib/unicode_format.h b/Objects/stringlib/unicode_format.h --- a/Objects/stringlib/unicode_format.h +++ b/Objects/stringlib/unicode_format.h @@ -79,7 +79,7 @@ SubString_new_object_or_empty(SubString *str) { if (str->str == NULL) { - return PyUnicode_FromUnicode(NULL, 0); + return PyUnicode_New(0, 0); } return SubString_new_object(str); } -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 1 03:20:10 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 01 Dec 2011 03:20:10 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Fix_PyObject=5FRepr=28=29?= =?utf8?q?=3A_don=27t_call_PyUnicode=5FREADY=28=29_if_res_is_NULL?= Message-ID: http://hg.python.org/cpython/rev/0d1536ec44e8 changeset: 73804:0d1536ec44e8 user: Victor Stinner date: Thu Dec 01 03:22:44 2011 +0100 summary: Fix PyObject_Repr(): don't call PyUnicode_READY() if res is NULL files: Objects/object.c | 4 +++- 1 files changed, 3 insertions(+), 1 deletions(-) diff --git a/Objects/object.c b/Objects/object.c --- a/Objects/object.c +++ b/Objects/object.c @@ -378,7 +378,9 @@ return PyUnicode_FromFormat("<%s object at %p>", v->ob_type->tp_name, v); res = (*v->ob_type->tp_repr)(v); - if (res != NULL && !PyUnicode_Check(res)) { + if (res == NULL) + return NULL; + if (!PyUnicode_Check(res)) { PyErr_Format(PyExc_TypeError, "__repr__ returned non-string (type %.200s)", res->ob_type->tp_name); -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Thu Dec 1 05:36:16 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Thu, 01 Dec 2011 05:36:16 +0100 Subject: [Python-checkins] Daily reference leaks (0d1536ec44e8): sum=0 Message-ID: results for 0d1536ec44e8 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/reflogonBh4G', '-x'] From python-checkins at python.org Thu Dec 1 16:27:09 2011 From: python-checkins at python.org (mark.dickinson) Date: Thu, 01 Dec 2011 16:27:09 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=239530=3A_Fix_undefi?= =?utf8?q?ned_behaviour_due_to_signed_overflow_in?= Message-ID: http://hg.python.org/cpython/rev/7e37598a25a6 changeset: 73805:7e37598a25a6 user: Mark Dickinson date: Thu Dec 01 15:27:04 2011 +0000 summary: Issue #9530: Fix undefined behaviour due to signed overflow in Python/formatter_unicode.c. files: Python/formatter_unicode.c | 16 +++++++--------- 1 files changed, 7 insertions(+), 9 deletions(-) diff --git a/Python/formatter_unicode.c b/Python/formatter_unicode.c --- a/Python/formatter_unicode.c +++ b/Python/formatter_unicode.c @@ -51,7 +51,7 @@ get_integer(PyObject *str, Py_ssize_t *pos, Py_ssize_t end, Py_ssize_t *result) { - Py_ssize_t accumulator, digitval, oldaccumulator; + Py_ssize_t accumulator, digitval; int numdigits; accumulator = numdigits = 0; for (;;(*pos)++, numdigits++) { @@ -61,19 +61,17 @@ if (digitval < 0) break; /* - This trick was copied from old Unicode format code. It's cute, - but would really suck on an old machine with a slow divide - implementation. Fortunately, in the normal case we do not - expect too many digits. + Detect possible overflow before it happens: + + accumulator * 10 + digitval > PY_SSIZE_T_MAX if and only if + accumulator > (PY_SSIZE_T_MAX - digitval) / 10. */ - oldaccumulator = accumulator; - accumulator *= 10; - if ((accumulator+10)/10 != oldaccumulator+1) { + if (accumulator > (PY_SSIZE_T_MAX - digitval) / 10) { PyErr_Format(PyExc_ValueError, "Too many decimal digits in format string"); return -1; } - accumulator += digitval; + accumulator = accumulator * 10 + digitval; } *result = accumulator; return numdigits; -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Fri Dec 2 05:37:16 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Fri, 02 Dec 2011 05:37:16 +0100 Subject: [Python-checkins] Daily reference leaks (7e37598a25a6): sum=0 Message-ID: results for 7e37598a25a6 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/refloglWXr6j', '-x'] From python-checkins at python.org Fri Dec 2 17:24:09 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 17:24:09 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogIzg0MTQ6IGFkZCBt?= =?utf8?q?ore_tests_for_=22assert=22=2E__Initial_patch_by_Gregory_Nofi=2E?= Message-ID: http://hg.python.org/cpython/rev/bcfb499338c1 changeset: 73806:bcfb499338c1 branch: 2.7 parent: 73783:3ecddf168f1f user: Ezio Melotti date: Fri Dec 02 18:17:30 2011 +0200 summary: #8414: add more tests for "assert". Initial patch by Gregory Nofi. files: Lib/test/test_grammar.py | 26 ++++++++++++++++++++++++-- 1 files changed, 24 insertions(+), 2 deletions(-) diff --git a/Lib/test/test_grammar.py b/Lib/test/test_grammar.py --- a/Lib/test/test_grammar.py +++ b/Lib/test/test_grammar.py @@ -551,13 +551,35 @@ assert 1, 1 assert lambda x:x assert 1, lambda x:x+1 + + try: + assert True + except AssertionError as e: + self.fail("'assert True' should not have raised an AssertionError") + + try: + assert True, 'this should always pass' + except AssertionError as e: + self.fail("'assert True, msg' should not have " + "raised an AssertionError") + + # these tests fail if python is run with -O, so check __debug__ + @unittest.skipUnless(__debug__, "Won't work if __debug__ is False") + def testAssert2(self): try: assert 0, "msg" except AssertionError, e: self.assertEqual(e.args[0], "msg") else: - if __debug__: - self.fail("AssertionError not raised by assert 0") + self.fail("AssertionError not raised by assert 0") + + try: + assert False + except AssertionError as e: + self.assertEqual(len(e.args), 0) + else: + self.fail("AssertionError not raised by 'assert False'") + ### compound_stmt: if_stmt | while_stmt | for_stmt | try_stmt | funcdef | classdef # Tested below -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 17:24:11 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 17:24:11 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogIzg0MTQ6IGFkZCBt?= =?utf8?q?ore_tests_for_=22assert=22=2E__Initial_patch_by_Gregory_Nofi=2E?= Message-ID: http://hg.python.org/cpython/rev/1efefeda00a7 changeset: 73807:1efefeda00a7 branch: 3.2 parent: 73796:2c05b8a6cdd1 user: Ezio Melotti date: Fri Dec 02 18:22:52 2011 +0200 summary: #8414: add more tests for "assert". Initial patch by Gregory Nofi. files: Lib/test/test_grammar.py | 26 ++++++++++++++++++++++++-- 1 files changed, 24 insertions(+), 2 deletions(-) diff --git a/Lib/test/test_grammar.py b/Lib/test/test_grammar.py --- a/Lib/test/test_grammar.py +++ b/Lib/test/test_grammar.py @@ -493,13 +493,35 @@ assert 1, 1 assert lambda x:x assert 1, lambda x:x+1 + + try: + assert True + except AssertionError as e: + self.fail("'assert True' should not have raised an AssertionError") + + try: + assert True, 'this should always pass' + except AssertionError as e: + self.fail("'assert True, msg' should not have " + "raised an AssertionError") + + # these tests fail if python is run with -O, so check __debug__ + @unittest.skipUnless(__debug__, "Won't work if __debug__ is False") + def testAssert2(self): try: assert 0, "msg" except AssertionError as e: self.assertEqual(e.args[0], "msg") else: - if __debug__: - self.fail("AssertionError not raised by assert 0") + self.fail("AssertionError not raised by assert 0") + + try: + assert False + except AssertionError as e: + self.assertEqual(len(e.args), 0) + else: + self.fail("AssertionError not raised by 'assert False'") + ### compound_stmt: if_stmt | while_stmt | for_stmt | try_stmt | funcdef | classdef # Tested below -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 17:24:12 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 17:24:12 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?b?OiAjODQxNDogbWVyZ2Ugd2l0aCAzLjIu?= Message-ID: http://hg.python.org/cpython/rev/47afbb2033aa changeset: 73808:47afbb2033aa parent: 73805:7e37598a25a6 parent: 73807:1efefeda00a7 user: Ezio Melotti date: Fri Dec 02 18:23:54 2011 +0200 summary: #8414: merge with 3.2. files: Lib/test/test_grammar.py | 26 ++++++++++++++++++++++++-- 1 files changed, 24 insertions(+), 2 deletions(-) diff --git a/Lib/test/test_grammar.py b/Lib/test/test_grammar.py --- a/Lib/test/test_grammar.py +++ b/Lib/test/test_grammar.py @@ -500,13 +500,35 @@ assert 1, 1 assert lambda x:x assert 1, lambda x:x+1 + + try: + assert True + except AssertionError as e: + self.fail("'assert True' should not have raised an AssertionError") + + try: + assert True, 'this should always pass' + except AssertionError as e: + self.fail("'assert True, msg' should not have " + "raised an AssertionError") + + # these tests fail if python is run with -O, so check __debug__ + @unittest.skipUnless(__debug__, "Won't work if __debug__ is False") + def testAssert2(self): try: assert 0, "msg" except AssertionError as e: self.assertEqual(e.args[0], "msg") else: - if __debug__: - self.fail("AssertionError not raised by assert 0") + self.fail("AssertionError not raised by assert 0") + + try: + assert False + except AssertionError as e: + self.assertEqual(len(e.args), 0) + else: + self.fail("AssertionError not raised by 'assert False'") + ### compound_stmt: if_stmt | while_stmt | for_stmt | try_stmt | funcdef | classdef # Tested below -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 18:29:17 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 18:29:17 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogIzEzNDk5OiBmaXgg?= =?utf8?q?example_adding_=3E=3E=3E_before_the_comments=2E?= Message-ID: http://hg.python.org/cpython/rev/d9e918c8d9d6 changeset: 73809:d9e918c8d9d6 branch: 2.7 parent: 73806:bcfb499338c1 user: Ezio Melotti date: Fri Dec 02 19:26:48 2011 +0200 summary: #13499: fix example adding >>> before the comments. files: Doc/library/uuid.rst | 16 ++++++++-------- 1 files changed, 8 insertions(+), 8 deletions(-) diff --git a/Doc/library/uuid.rst b/Doc/library/uuid.rst --- a/Doc/library/uuid.rst +++ b/Doc/library/uuid.rst @@ -225,34 +225,34 @@ >>> import uuid - # make a UUID based on the host ID and current time + >>> # make a UUID based on the host ID and current time >>> uuid.uuid1() UUID('a8098c1a-f86e-11da-bd1a-00112444be1e') - # make a UUID using an MD5 hash of a namespace UUID and a name + >>> # make a UUID using an MD5 hash of a namespace UUID and a name >>> uuid.uuid3(uuid.NAMESPACE_DNS, 'python.org') UUID('6fa459ea-ee8a-3ca4-894e-db77e160355e') - # make a random UUID + >>> # make a random UUID >>> uuid.uuid4() UUID('16fd2706-8baf-433b-82eb-8c7fada847da') - # make a UUID using a SHA-1 hash of a namespace UUID and a name + >>> # make a UUID using a SHA-1 hash of a namespace UUID and a name >>> uuid.uuid5(uuid.NAMESPACE_DNS, 'python.org') UUID('886313e1-3b8a-5372-9b90-0c9aee199e5d') - # make a UUID from a string of hex digits (braces and hyphens ignored) + >>> # make a UUID from a string of hex digits (braces and hyphens ignored) >>> x = uuid.UUID('{00010203-0405-0607-0809-0a0b0c0d0e0f}') - # convert a UUID to a string of hex digits in standard form + >>> # convert a UUID to a string of hex digits in standard form >>> str(x) '00010203-0405-0607-0809-0a0b0c0d0e0f' - # get the raw 16 bytes of the UUID + >>> # get the raw 16 bytes of the UUID >>> x.bytes '\x00\x01\x02\x03\x04\x05\x06\x07\x08\t\n\x0b\x0c\r\x0e\x0f' - # make a UUID from a 16-byte string + >>> # make a UUID from a 16-byte string >>> uuid.UUID(bytes=x.bytes) UUID('00010203-0405-0607-0809-0a0b0c0d0e0f') -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 18:29:19 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 18:29:19 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogIzEzNDk5OiBmaXgg?= =?utf8?q?example_adding_=3E=3E=3E_before_the_comments=2E?= Message-ID: http://hg.python.org/cpython/rev/9e7728dc35e7 changeset: 73810:9e7728dc35e7 branch: 3.2 parent: 73807:1efefeda00a7 user: Ezio Melotti date: Fri Dec 02 19:28:36 2011 +0200 summary: #13499: fix example adding >>> before the comments. files: Doc/library/uuid.rst | 16 ++++++++-------- 1 files changed, 8 insertions(+), 8 deletions(-) diff --git a/Doc/library/uuid.rst b/Doc/library/uuid.rst --- a/Doc/library/uuid.rst +++ b/Doc/library/uuid.rst @@ -222,34 +222,34 @@ >>> import uuid - # make a UUID based on the host ID and current time + >>> # make a UUID based on the host ID and current time >>> uuid.uuid1() UUID('a8098c1a-f86e-11da-bd1a-00112444be1e') - # make a UUID using an MD5 hash of a namespace UUID and a name + >>> # make a UUID using an MD5 hash of a namespace UUID and a name >>> uuid.uuid3(uuid.NAMESPACE_DNS, 'python.org') UUID('6fa459ea-ee8a-3ca4-894e-db77e160355e') - # make a random UUID + >>> # make a random UUID >>> uuid.uuid4() UUID('16fd2706-8baf-433b-82eb-8c7fada847da') - # make a UUID using a SHA-1 hash of a namespace UUID and a name + >>> # make a UUID using a SHA-1 hash of a namespace UUID and a name >>> uuid.uuid5(uuid.NAMESPACE_DNS, 'python.org') UUID('886313e1-3b8a-5372-9b90-0c9aee199e5d') - # make a UUID from a string of hex digits (braces and hyphens ignored) + >>> # make a UUID from a string of hex digits (braces and hyphens ignored) >>> x = uuid.UUID('{00010203-0405-0607-0809-0a0b0c0d0e0f}') - # convert a UUID to a string of hex digits in standard form + >>> # convert a UUID to a string of hex digits in standard form >>> str(x) '00010203-0405-0607-0809-0a0b0c0d0e0f' - # get the raw 16 bytes of the UUID + >>> # get the raw 16 bytes of the UUID >>> x.bytes b'\x00\x01\x02\x03\x04\x05\x06\x07\x08\t\n\x0b\x0c\r\x0e\x0f' - # make a UUID from a 16-byte string + >>> # make a UUID from a 16-byte string >>> uuid.UUID(bytes=x.bytes) UUID('00010203-0405-0607-0809-0a0b0c0d0e0f') -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 18:29:22 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 18:29:22 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_=2313499=3A_merge_with_3=2E2=2E?= Message-ID: http://hg.python.org/cpython/rev/060c7093a81f changeset: 73811:060c7093a81f parent: 73808:47afbb2033aa parent: 73810:9e7728dc35e7 user: Ezio Melotti date: Fri Dec 02 19:29:10 2011 +0200 summary: #13499: merge with 3.2. files: Doc/library/uuid.rst | 16 ++++++++-------- 1 files changed, 8 insertions(+), 8 deletions(-) diff --git a/Doc/library/uuid.rst b/Doc/library/uuid.rst --- a/Doc/library/uuid.rst +++ b/Doc/library/uuid.rst @@ -222,34 +222,34 @@ >>> import uuid - # make a UUID based on the host ID and current time + >>> # make a UUID based on the host ID and current time >>> uuid.uuid1() UUID('a8098c1a-f86e-11da-bd1a-00112444be1e') - # make a UUID using an MD5 hash of a namespace UUID and a name + >>> # make a UUID using an MD5 hash of a namespace UUID and a name >>> uuid.uuid3(uuid.NAMESPACE_DNS, 'python.org') UUID('6fa459ea-ee8a-3ca4-894e-db77e160355e') - # make a random UUID + >>> # make a random UUID >>> uuid.uuid4() UUID('16fd2706-8baf-433b-82eb-8c7fada847da') - # make a UUID using a SHA-1 hash of a namespace UUID and a name + >>> # make a UUID using a SHA-1 hash of a namespace UUID and a name >>> uuid.uuid5(uuid.NAMESPACE_DNS, 'python.org') UUID('886313e1-3b8a-5372-9b90-0c9aee199e5d') - # make a UUID from a string of hex digits (braces and hyphens ignored) + >>> # make a UUID from a string of hex digits (braces and hyphens ignored) >>> x = uuid.UUID('{00010203-0405-0607-0809-0a0b0c0d0e0f}') - # convert a UUID to a string of hex digits in standard form + >>> # convert a UUID to a string of hex digits in standard form >>> str(x) '00010203-0405-0607-0809-0a0b0c0d0e0f' - # get the raw 16 bytes of the UUID + >>> # get the raw 16 bytes of the UUID >>> x.bytes b'\x00\x01\x02\x03\x04\x05\x06\x07\x08\t\n\x0b\x0c\r\x0e\x0f' - # make a UUID from a 16-byte string + >>> # make a UUID from a 16-byte string >>> uuid.UUID(bytes=x.bytes) UUID('00010203-0405-0607-0809-0a0b0c0d0e0f') -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 18:49:12 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 18:49:12 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogIzEzNDk0OiBzL2Nh?= =?utf8?q?st/convert/=2E__Also_add_a_link=2E?= Message-ID: http://hg.python.org/cpython/rev/2f9c986b46cd changeset: 73812:2f9c986b46cd branch: 2.7 parent: 73809:d9e918c8d9d6 user: Ezio Melotti date: Fri Dec 02 19:47:24 2011 +0200 summary: #13494: s/cast/convert/. Also add a link. files: Doc/library/stdtypes.rst | 6 +++--- 1 files changed, 3 insertions(+), 3 deletions(-) diff --git a/Doc/library/stdtypes.rst b/Doc/library/stdtypes.rst --- a/Doc/library/stdtypes.rst +++ b/Doc/library/stdtypes.rst @@ -2955,9 +2955,9 @@ used to represent truth values (although other values can also be considered false or true). In numeric contexts (for example when used as the argument to an arithmetic operator), they behave like the integers 0 and 1, respectively. -The built-in function :func:`bool` can be used to cast any value to a Boolean, -if the value can be interpreted as a truth value (see section Truth Value -Testing above). +The built-in function :func:`bool` can be used to convert any value to a +Boolean, if the value can be interpreted as a truth value (see section +:ref:`truth` above). .. index:: single: False -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 18:49:13 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 18:49:13 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogIzEzNDk0OiBzL2Nh?= =?utf8?q?st/convert/=2E__Also_add_a_link=2E?= Message-ID: http://hg.python.org/cpython/rev/69369fd3514b changeset: 73813:69369fd3514b branch: 3.2 parent: 73810:9e7728dc35e7 user: Ezio Melotti date: Fri Dec 02 19:47:24 2011 +0200 summary: #13494: s/cast/convert/. Also add a link. files: Doc/library/stdtypes.rst | 6 +++--- 1 files changed, 3 insertions(+), 3 deletions(-) diff --git a/Doc/library/stdtypes.rst b/Doc/library/stdtypes.rst --- a/Doc/library/stdtypes.rst +++ b/Doc/library/stdtypes.rst @@ -2736,9 +2736,9 @@ used to represent truth values (although other values can also be considered false or true). In numeric contexts (for example when used as the argument to an arithmetic operator), they behave like the integers 0 and 1, respectively. -The built-in function :func:`bool` can be used to cast any value to a Boolean, -if the value can be interpreted as a truth value (see section Truth Value -Testing above). +The built-in function :func:`bool` can be used to convert any value to a +Boolean, if the value can be interpreted as a truth value (see section +:ref:`truth` above). .. index:: single: False -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 18:49:14 2011 From: python-checkins at python.org (ezio.melotti) Date: Fri, 02 Dec 2011 18:49:14 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_=2313494=3A_merge_with_3=2E2=2E?= Message-ID: http://hg.python.org/cpython/rev/454b97887c5a changeset: 73814:454b97887c5a parent: 73811:060c7093a81f parent: 73813:69369fd3514b user: Ezio Melotti date: Fri Dec 02 19:49:02 2011 +0200 summary: #13494: merge with 3.2. files: Doc/library/stdtypes.rst | 6 +++--- 1 files changed, 3 insertions(+), 3 deletions(-) diff --git a/Doc/library/stdtypes.rst b/Doc/library/stdtypes.rst --- a/Doc/library/stdtypes.rst +++ b/Doc/library/stdtypes.rst @@ -2772,9 +2772,9 @@ used to represent truth values (although other values can also be considered false or true). In numeric contexts (for example when used as the argument to an arithmetic operator), they behave like the integers 0 and 1, respectively. -The built-in function :func:`bool` can be used to cast any value to a Boolean, -if the value can be interpreted as a truth value (see section Truth Value -Testing above). +The built-in function :func:`bool` can be used to convert any value to a +Boolean, if the value can be interpreted as a truth value (see section +:ref:`truth` above). .. index:: single: False -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 20:25:16 2011 From: python-checkins at python.org (antoine.pitrou) Date: Fri, 02 Dec 2011 20:25:16 +0100 Subject: [Python-checkins] =?utf8?q?peps=3A_Update_PEP_3154_after_PEP_3155?= =?utf8?q?_has_been_accepted=2E?= Message-ID: http://hg.python.org/peps/rev/d54015aaa32b changeset: 4006:d54015aaa32b user: Antoine Pitrou date: Fri Dec 02 20:19:29 2011 +0100 summary: Update PEP 3154 after PEP 3155 has been accepted. files: pep-3154.txt | 37 ++++++++++++++++--------------------- 1 files changed, 16 insertions(+), 21 deletions(-) diff --git a/pep-3154.txt b/pep-3154.txt --- a/pep-3154.txt +++ b/pep-3154.txt @@ -71,27 +71,20 @@ special method (``__getnewargs_ex__`` ?) and a new opcode (NEWOBJEX ?) are needed. -Serializing more callable objects ---------------------------------- +Serializing more "lookupable" objects +------------------------------------- -Currently, only module-global functions are serializable. -Multiprocessing has custom support for pickling other callables such -as bound methods [4]_. This support could be folded in the protocol, -and made more efficient through a new GETATTR opcode. +For some kinds of objects, it only makes sense to serialize them by name +(for example classes and functions). By default, pickle is only able to +serialize module-global functions and classes by name. Supporting other +kinds of objects, such as unbound methods [4]_, is a common request. +Actually, third-party support for some of them, such as bound methods, +is implemented in the multiprocessing module [5]_. -Serializing "pseudo-global" objects ------------------------------------ - -Objects which are not module-global, but should be treated in a -similar fashion -- such as unbound methods [5]_ or nested classes -- -cannot currently be pickled (or, rather, unpickled) because the pickle -protocol does not correctly specify how to retrieve them. One -solution would be through the adjunction of a ``__namespace__`` (or -``__qualname__``) to all class and function objects, specifying the -full "path" by which they can be retrieved. For globals, this would -generally be ``"{}.{}".format(obj.__module__, obj.__name__)``. Then a -new opcode can resolve that path and push the object on the stack, -similarly to the GLOBAL opcode. +:pep:`3155` now makes it possible to lookup many more objects by name. +Generalizing the GLOBAL opcode to accept dot-separated names, or adding +a special GETATTR opcode, would allow the standard pickle implementation +to support, in an efficient way, all those kinds of objects. Binary encoding for all opcodes ------------------------------- @@ -131,12 +124,12 @@ .. [3] "pickle/copyreg doesn't support keyword only arguments in __new__": http://bugs.python.org/issue4727 -.. [4] Lib/multiprocessing/forking.py: +.. [4] "pickle should support methods": + http://bugs.python.org/issue9276 + +.. [5] Lib/multiprocessing/forking.py: http://hg.python.org/cpython/file/baea9f5f973c/Lib/multiprocessing/forking.py#l54 -.. [5] "pickle should support methods": - http://bugs.python.org/issue9276 - Copyright ========= -- Repository URL: http://hg.python.org/peps From python-checkins at python.org Fri Dec 2 20:25:17 2011 From: python-checkins at python.org (antoine.pitrou) Date: Fri, 02 Dec 2011 20:25:17 +0100 Subject: [Python-checkins] =?utf8?q?peps=3A_Mark_PEP_3155_final=2E?= Message-ID: http://hg.python.org/peps/rev/f41beb5dcdaa changeset: 4007:f41beb5dcdaa user: Antoine Pitrou date: Fri Dec 02 20:20:06 2011 +0100 summary: Mark PEP 3155 final. files: pep-3155.txt | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/pep-3155.txt b/pep-3155.txt --- a/pep-3155.txt +++ b/pep-3155.txt @@ -3,7 +3,7 @@ Version: $Revision$ Last-Modified: $Date$ Author: Antoine Pitrou -Status: Accepted +Status: Final Type: Standards Track Content-Type: text/x-rst Created: 2011-10-29 -- Repository URL: http://hg.python.org/peps From python-checkins at python.org Fri Dec 2 20:28:35 2011 From: python-checkins at python.org (petri.lehtinen) Date: Fri, 02 Dec 2011 20:28:35 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogSXNzdWUgIzEzNDM5?= =?utf8?q?=3A_Fix_many_errors_in_turtle_docstrings=2E?= Message-ID: http://hg.python.org/cpython/rev/6e03ab9950f6 changeset: 73815:6e03ab9950f6 branch: 2.7 parent: 73812:2f9c986b46cd user: Petri Lehtinen date: Fri Dec 02 21:09:30 2011 +0200 summary: Issue #13439: Fix many errors in turtle docstrings. files: Lib/lib-tk/turtle.py | 104 ++++++++++++++---------------- Misc/NEWS | 2 + 2 files changed, 52 insertions(+), 54 deletions(-) diff --git a/Lib/lib-tk/turtle.py b/Lib/lib-tk/turtle.py --- a/Lib/lib-tk/turtle.py +++ b/Lib/lib-tk/turtle.py @@ -859,7 +859,7 @@ >>> poly = ((0,0),(10,-5),(0,10),(-10,-5)) >>> s = Shape("compound") >>> s.addcomponent(poly, "red", "blue") - ### .. add more components and then use register_shape() + >>> # .. add more components and then use register_shape() """ if self._type != "compound": raise TurtleGraphicsError("Cannot add component to %s Shape" @@ -958,7 +958,7 @@ No argument. Example (for a TurtleScreen instance named screen): - screen.clear() + >>> screen.clear() Note: this method is not available as function. """ @@ -1030,8 +1030,8 @@ Example (for a TurtleScreen instance named screen): >>> screen.setworldcoordinates(-10,-0.5,50,1.5) >>> for _ in range(36): - left(10) - forward(0.5) + ... left(10) + ... forward(0.5) """ if self.mode() != "world": self.mode("world") @@ -1136,7 +1136,7 @@ >>> screen.colormode() 1.0 >>> screen.colormode(255) - >>> turtle.pencolor(240,160,80) + >>> pencolor(240,160,80) """ if cmode is None: return self._colormode @@ -1204,9 +1204,9 @@ >>> screen.tracer(8, 25) >>> dist = 2 >>> for i in range(200): - fd(dist) - rt(90) - dist += 2 + ... fd(dist) + ... rt(90) + ... dist += 2 """ if n is None: return self._tracing @@ -1233,7 +1233,7 @@ self._delayvalue = int(delay) def _incrementudc(self): - "Increment upadate counter.""" + """Increment upadate counter.""" if not TurtleScreen._RUNNING: TurtleScreen._RUNNNING = True raise Terminator @@ -1304,13 +1304,10 @@ Example (for a TurtleScreen instance named screen and a Turtle instance named turtle): - >>> screen.onclick(turtle.goto) - - ### Subsequently clicking into the TurtleScreen will - ### make the turtle move to the clicked point. + >>> screen.onclick(goto) + >>> # Subsequently clicking into the TurtleScreen will + >>> # make the turtle move to the clicked point. >>> screen.onclick(None) - - ### event-binding will be removed """ self._onscreenclick(fun, btn, add) @@ -1324,20 +1321,18 @@ In order to be able to register key-events, TurtleScreen must have focus. (See method listen.) - Example (for a TurtleScreen instance named screen - and a Turtle instance named turtle): + Example (for a TurtleScreen instance named screen): >>> def f(): - fd(50) - lt(60) - - + ... fd(50) + ... lt(60) + ... >>> screen.onkey(f, "Up") >>> screen.listen() - ### Subsequently the turtle can be moved by - ### repeatedly pressing the up-arrow key, - ### consequently drawing a hexagon + Subsequently the turtle can be moved by repeatedly pressing + the up-arrow key, consequently drawing a hexagon + """ if fun is None: if key in self._keys: @@ -1369,12 +1364,12 @@ >>> running = True >>> def f(): - if running: - fd(50) - lt(60) - screen.ontimer(f, 250) - - >>> f() ### makes the turtle marching around + ... if running: + ... fd(50) + ... lt(60) + ... screen.ontimer(f, 250) + ... + >>> f() # makes the turtle marching around >>> running = False """ self._ontimer(fun, t) @@ -1418,7 +1413,7 @@ Example (for a Turtle instance named turtle): >>> turtle.screensize(2000,1500) - ### e. g. to search for an erroneously escaped turtle ;-) + >>> # e. g. to search for an erroneously escaped turtle ;-) """ return self._resize(canvwidth, canvheight, bg) @@ -2004,7 +1999,7 @@ Example (for a Turtle instance named turtle): >>> turtle.pensize() 1 - turtle.pensize(10) # from here on lines of width 10 are drawn + >>> turtle.pensize(10) # from here on lines of width 10 are drawn """ if width is None: return self._pensize @@ -2516,7 +2511,7 @@ Example (for a Turtle instance named turtle): >>> while undobufferentries(): - undo() + ... undo() """ if self.undobuffer is None: return 0 @@ -2592,9 +2587,9 @@ >>> turtle.tracer(8, 25) >>> dist = 2 >>> for i in range(200): - turtle.fd(dist) - turtle.rt(90) - dist += 2 + ... turtle.fd(dist) + ... turtle.rt(90) + ... dist += 2 """ return self.screen.tracer(flag, delay) @@ -2763,7 +2758,6 @@ >>> turtle.shapesize(5,2) >>> turtle.tilt(45) >>> turtle.tiltangle() - >>> """ tilt = -self._tilt * (180.0/math.pi) * self._angleOrient return (tilt / self._degreesPerAU) % self._fullcircle @@ -2963,7 +2957,7 @@ Example (for a Turtle instance named turtle): >>> for i in range(8): - turtle.stamp(); turtle.fd(30) + ... turtle.stamp(); turtle.fd(30) ... >>> turtle.clearstamps(2) >>> turtle.clearstamps(-2) @@ -3430,9 +3424,9 @@ Example for the anonymous turtle, i. e. the procedural way: >>> def turn(x, y): - left(360) - - >>> onclick(turn) # Now clicking into the turtle will turn it. + ... left(360) + ... + >>> onclick(turn) # Now clicking into the turtle will turn it. >>> onclick(None) # event-binding will be removed """ self.screen._onclick(self.turtle._item, fun, btn, add) @@ -3448,16 +3442,17 @@ Example (for a MyTurtle instance named joe): >>> class MyTurtle(Turtle): - def glow(self,x,y): - self.fillcolor("red") - def unglow(self,x,y): - self.fillcolor("") - + ... def glow(self,x,y): + ... self.fillcolor("red") + ... def unglow(self,x,y): + ... self.fillcolor("") + ... >>> joe = MyTurtle() >>> joe.onclick(joe.glow) >>> joe.onrelease(joe.unglow) - ### clicking on joe turns fillcolor red, - ### unclicking turns it to transparent. + + Clicking on joe turns fillcolor red, unclicking turns it to + transparent. """ self.screen._onrelease(self.turtle._item, fun, btn, add) self._update() @@ -3476,9 +3471,9 @@ Example (for a Turtle instance named turtle): >>> turtle.ondrag(turtle.goto) - ### Subsequently clicking and dragging a Turtle will - ### move it across the screen thereby producing handdrawings - ### (if pen is down). + Subsequently clicking and dragging a Turtle will move it + across the screen thereby producing handdrawings (if pen is + down). """ self.screen._ondrag(self.turtle._item, fun, btn, add) @@ -3525,10 +3520,11 @@ Example (for a Turtle instance named turtle): >>> for i in range(4): - turtle.fd(50); turtle.lt(80) - + ... turtle.fd(50); turtle.lt(80) + ... >>> for i in range(8): - turtle.undo() + ... turtle.undo() + ... """ if self.undobuffer is None: return diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -79,6 +79,8 @@ Library ------- +- Issue #13439: Fix many errors in turtle docstrings. + - Issue #12856: Ensure child processes do not inherit the parent's random seed for filename generation in the tempfile module. Patch by Brian Harring. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 20:28:36 2011 From: python-checkins at python.org (petri.lehtinen) Date: Fri, 02 Dec 2011 20:28:36 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEzNDM5?= =?utf8?q?=3A_Fix_many_errors_in_turtle_docstrings=2E?= Message-ID: http://hg.python.org/cpython/rev/cc559e1e3bd8 changeset: 73816:cc559e1e3bd8 branch: 3.2 parent: 73813:69369fd3514b user: Petri Lehtinen date: Fri Dec 02 21:24:14 2011 +0200 summary: Issue #13439: Fix many errors in turtle docstrings. files: Lib/turtle.py | 134 ++++++++++++++++++------------------- Misc/NEWS | 2 + 2 files changed, 66 insertions(+), 70 deletions(-) diff --git a/Lib/turtle.py b/Lib/turtle.py --- a/Lib/turtle.py +++ b/Lib/turtle.py @@ -905,7 +905,7 @@ >>> poly = ((0,0),(10,-5),(0,10),(-10,-5)) >>> s = Shape("compound") >>> s.addcomponent(poly, "red", "blue") - ### .. add more components and then use register_shape() + >>> # .. add more components and then use register_shape() """ if self._type != "compound": raise TurtleGraphicsError("Cannot add component to %s Shape" @@ -1003,7 +1003,7 @@ no backgroundimage, no eventbindings and tracing on. Example (for a TurtleScreen instance named screen): - screen.clear() + >>> screen.clear() Note: this method is not available as function. """ @@ -1077,8 +1077,8 @@ Example (for a TurtleScreen instance named screen): >>> screen.setworldcoordinates(-10,-0.5,50,1.5) >>> for _ in range(36): - left(10) - forward(0.5) + ... left(10) + ... forward(0.5) """ if self.mode() != "world": self.mode("world") @@ -1182,7 +1182,7 @@ >>> screen.colormode() 1.0 >>> screen.colormode(255) - >>> turtle.pencolor(240,160,80) + >>> pencolor(240,160,80) """ if cmode is None: return self._colormode @@ -1250,9 +1250,9 @@ >>> screen.tracer(8, 25) >>> dist = 2 >>> for i in range(200): - fd(dist) - rt(90) - dist += 2 + ... fd(dist) + ... rt(90) + ... dist += 2 """ if n is None: return self._tracing @@ -1279,7 +1279,7 @@ self._delayvalue = int(delay) def _incrementudc(self): - "Increment upadate counter.""" + """Increment upadate counter.""" if not TurtleScreen._RUNNING: TurtleScreen._RUNNNING = True raise Terminator @@ -1347,16 +1347,12 @@ clicked point on the canvas. num -- the number of the mouse-button, defaults to 1 - Example (for a TurtleScreen instance named screen - and a Turtle instance named turtle): - - >>> screen.onclick(turtle.goto) - - ### Subsequently clicking into the TurtleScreen will - ### make the turtle move to the clicked point. + Example (for a TurtleScreen instance named screen) + + >>> screen.onclick(goto) + >>> # Subsequently clicking into the TurtleScreen will + >>> # make the turtle move to the clicked point. >>> screen.onclick(None) - - ### event-binding will be removed """ self._onscreenclick(fun, btn, add) @@ -1370,20 +1366,18 @@ In order to be able to register key-events, TurtleScreen must have focus. (See method listen.) - Example (for a TurtleScreen instance named screen - and a Turtle instance named turtle): + Example (for a TurtleScreen instance named screen): >>> def f(): - fd(50) - lt(60) - - + ... fd(50) + ... lt(60) + ... >>> screen.onkey(f, "Up") >>> screen.listen() - ### Subsequently the turtle can be moved by - ### repeatedly pressing the up-arrow key, - ### consequently drawing a hexagon + Subsequently the turtle can be moved by repeatedly pressing + the up-arrow key, consequently drawing a hexagon + """ if fun is None: if key in self._keys: @@ -1407,16 +1401,15 @@ and a Turtle instance named turtle): >>> def f(): - fd(50) - - - >>> screen.onkey(f, "Up") + ... fd(50) + ... lt(60) + ... + >>> screen.onkeypress(f, "Up") >>> screen.listen() - ### Subsequently the turtle can be moved by - ### repeatedly pressing the up-arrow key, - ### or by keeping pressed the up-arrow key. - ### consequently drawing a hexagon. + Subsequently the turtle can be moved by repeatedly pressing + the up-arrow key, or by keeping pressed the up-arrow key. + consequently drawing a hexagon. """ if fun is None: if key in self._keys: @@ -1448,12 +1441,12 @@ >>> running = True >>> def f(): - if running: - fd(50) - lt(60) - screen.ontimer(f, 250) - - >>> f() ### makes the turtle marching around + ... if running: + ... fd(50) + ... lt(60) + ... screen.ontimer(f, 250) + ... + >>> f() # makes the turtle marching around >>> running = False """ self._ontimer(fun, t) @@ -1497,7 +1490,7 @@ Example (for a Turtle instance named turtle): >>> turtle.screensize(2000,1500) - ### e. g. to search for an erroneously escaped turtle ;-) + >>> # e.g. to search for an erroneously escaped turtle ;-) """ return self._resize(canvwidth, canvheight, bg) @@ -2085,7 +2078,7 @@ Example (for a Turtle instance named turtle): >>> turtle.pensize() 1 - turtle.pensize(10) # from here on lines of width 10 are drawn + >>> turtle.pensize(10) # from here on lines of width 10 are drawn """ if width is None: return self._pensize @@ -2560,7 +2553,7 @@ """Delete the turtle's drawings and restore its default values. No argument. -, + Delete the turtle's drawings from the screen, re-center the turtle and set variables to the default values. @@ -2607,7 +2600,7 @@ Example (for a Turtle instance named turtle): >>> while undobufferentries(): - undo() + ... undo() """ if self.undobuffer is None: return 0 @@ -2683,9 +2676,9 @@ >>> turtle.tracer(8, 25) >>> dist = 2 >>> for i in range(200): - turtle.fd(dist) - turtle.rt(90) - dist += 2 + ... turtle.fd(dist) + ... turtle.rt(90) + ... dist += 2 """ return self.screen.tracer(flag, delay) @@ -2883,7 +2876,6 @@ >>> turtle.shapesize(5,2) >>> turtle.tilt(45) >>> turtle.tiltangle() - >>> """ if angle is None: tilt = -self._tilt * (180.0/math.pi) * self._angleOrient @@ -2928,7 +2920,7 @@ >>> turtle.shapesize(4,2) >>> turtle.shearfactor(-0.5) >>> turtle.shapetransform() - >>> (4.0, -1.0, -0.0, 2.0) + (4.0, -1.0, -0.0, 2.0) """ if t11 is t12 is t21 is t22 is None: return self._shapetrafo @@ -3126,7 +3118,7 @@ Example (for a Turtle instance named turtle): >>> for i in range(8): - turtle.stamp(); turtle.fd(30) + ... turtle.stamp(); turtle.fd(30) ... >>> turtle.clearstamps(2) >>> turtle.clearstamps(-2) @@ -3302,9 +3294,9 @@ Example (for a Turtle instance named turtle): >>> turtle.begin_fill() >>> if turtle.filling(): - turtle.pensize(5) - else: - turtle.pensize(3) + ... turtle.pensize(5) + ... else: + ... turtle.pensize(3) """ return isinstance(self._fillpath, list) @@ -3534,9 +3526,9 @@ Example for the anonymous turtle, i. e. the procedural way: >>> def turn(x, y): - left(360) - - >>> onclick(turn) # Now clicking into the turtle will turn it. + ... left(360) + ... + >>> onclick(turn) # Now clicking into the turtle will turn it. >>> onclick(None) # event-binding will be removed """ self.screen._onclick(self.turtle._item, fun, btn, add) @@ -3552,16 +3544,17 @@ Example (for a MyTurtle instance named joe): >>> class MyTurtle(Turtle): - def glow(self,x,y): - self.fillcolor("red") - def unglow(self,x,y): - self.fillcolor("") - + ... def glow(self,x,y): + ... self.fillcolor("red") + ... def unglow(self,x,y): + ... self.fillcolor("") + ... >>> joe = MyTurtle() >>> joe.onclick(joe.glow) >>> joe.onrelease(joe.unglow) - ### clicking on joe turns fillcolor red, - ### unclicking turns it to transparent. + + Clicking on joe turns fillcolor red, unclicking turns it to + transparent. """ self.screen._onrelease(self.turtle._item, fun, btn, add) self._update() @@ -3580,9 +3573,9 @@ Example (for a Turtle instance named turtle): >>> turtle.ondrag(turtle.goto) - ### Subsequently clicking and dragging a Turtle will - ### move it across the screen thereby producing handdrawings - ### (if pen is down). + Subsequently clicking and dragging a Turtle will move it + across the screen thereby producing handdrawings (if pen is + down). """ self.screen._ondrag(self.turtle._item, fun, btn, add) @@ -3630,10 +3623,11 @@ Example (for a Turtle instance named turtle): >>> for i in range(4): - turtle.fd(50); turtle.lt(80) - + ... turtle.fd(50); turtle.lt(80) + ... >>> for i in range(8): - turtle.undo() + ... turtle.undo() + ... """ if self.undobuffer is None: return diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -87,6 +87,8 @@ Library ------- +- Issue #13439: Fix many errors in turtle docstrings. + - Issue #13487: Make inspect.getmodule robust against changes done to sys.modules while it is iterating over it. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 2 20:28:37 2011 From: python-checkins at python.org (petri.lehtinen) Date: Fri, 02 Dec 2011 20:28:37 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Issue_=2313439=3A_Merge_branch_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/8d60c1c89105 changeset: 73817:8d60c1c89105 parent: 73814:454b97887c5a parent: 73816:cc559e1e3bd8 user: Petri Lehtinen date: Fri Dec 02 21:25:39 2011 +0200 summary: Issue #13439: Merge branch 3.2 files: Lib/turtle.py | 134 ++++++++++++++++++------------------- Misc/NEWS | 2 + 2 files changed, 66 insertions(+), 70 deletions(-) diff --git a/Lib/turtle.py b/Lib/turtle.py --- a/Lib/turtle.py +++ b/Lib/turtle.py @@ -904,7 +904,7 @@ >>> poly = ((0,0),(10,-5),(0,10),(-10,-5)) >>> s = Shape("compound") >>> s.addcomponent(poly, "red", "blue") - ### .. add more components and then use register_shape() + >>> # .. add more components and then use register_shape() """ if self._type != "compound": raise TurtleGraphicsError("Cannot add component to %s Shape" @@ -1002,7 +1002,7 @@ no backgroundimage, no eventbindings and tracing on. Example (for a TurtleScreen instance named screen): - screen.clear() + >>> screen.clear() Note: this method is not available as function. """ @@ -1076,8 +1076,8 @@ Example (for a TurtleScreen instance named screen): >>> screen.setworldcoordinates(-10,-0.5,50,1.5) >>> for _ in range(36): - left(10) - forward(0.5) + ... left(10) + ... forward(0.5) """ if self.mode() != "world": self.mode("world") @@ -1181,7 +1181,7 @@ >>> screen.colormode() 1.0 >>> screen.colormode(255) - >>> turtle.pencolor(240,160,80) + >>> pencolor(240,160,80) """ if cmode is None: return self._colormode @@ -1249,9 +1249,9 @@ >>> screen.tracer(8, 25) >>> dist = 2 >>> for i in range(200): - fd(dist) - rt(90) - dist += 2 + ... fd(dist) + ... rt(90) + ... dist += 2 """ if n is None: return self._tracing @@ -1278,7 +1278,7 @@ self._delayvalue = int(delay) def _incrementudc(self): - "Increment upadate counter.""" + """Increment upadate counter.""" if not TurtleScreen._RUNNING: TurtleScreen._RUNNNING = True raise Terminator @@ -1346,16 +1346,12 @@ clicked point on the canvas. num -- the number of the mouse-button, defaults to 1 - Example (for a TurtleScreen instance named screen - and a Turtle instance named turtle): - - >>> screen.onclick(turtle.goto) - - ### Subsequently clicking into the TurtleScreen will - ### make the turtle move to the clicked point. + Example (for a TurtleScreen instance named screen) + + >>> screen.onclick(goto) + >>> # Subsequently clicking into the TurtleScreen will + >>> # make the turtle move to the clicked point. >>> screen.onclick(None) - - ### event-binding will be removed """ self._onscreenclick(fun, btn, add) @@ -1369,20 +1365,18 @@ In order to be able to register key-events, TurtleScreen must have focus. (See method listen.) - Example (for a TurtleScreen instance named screen - and a Turtle instance named turtle): + Example (for a TurtleScreen instance named screen): >>> def f(): - fd(50) - lt(60) - - + ... fd(50) + ... lt(60) + ... >>> screen.onkey(f, "Up") >>> screen.listen() - ### Subsequently the turtle can be moved by - ### repeatedly pressing the up-arrow key, - ### consequently drawing a hexagon + Subsequently the turtle can be moved by repeatedly pressing + the up-arrow key, consequently drawing a hexagon + """ if fun is None: if key in self._keys: @@ -1406,16 +1400,15 @@ and a Turtle instance named turtle): >>> def f(): - fd(50) - - - >>> screen.onkey(f, "Up") + ... fd(50) + ... lt(60) + ... + >>> screen.onkeypress(f, "Up") >>> screen.listen() - ### Subsequently the turtle can be moved by - ### repeatedly pressing the up-arrow key, - ### or by keeping pressed the up-arrow key. - ### consequently drawing a hexagon. + Subsequently the turtle can be moved by repeatedly pressing + the up-arrow key, or by keeping pressed the up-arrow key. + consequently drawing a hexagon. """ if fun is None: if key in self._keys: @@ -1447,12 +1440,12 @@ >>> running = True >>> def f(): - if running: - fd(50) - lt(60) - screen.ontimer(f, 250) - - >>> f() ### makes the turtle marching around + ... if running: + ... fd(50) + ... lt(60) + ... screen.ontimer(f, 250) + ... + >>> f() # makes the turtle marching around >>> running = False """ self._ontimer(fun, t) @@ -1496,7 +1489,7 @@ Example (for a Turtle instance named turtle): >>> turtle.screensize(2000,1500) - ### e. g. to search for an erroneously escaped turtle ;-) + >>> # e.g. to search for an erroneously escaped turtle ;-) """ return self._resize(canvwidth, canvheight, bg) @@ -2084,7 +2077,7 @@ Example (for a Turtle instance named turtle): >>> turtle.pensize() 1 - turtle.pensize(10) # from here on lines of width 10 are drawn + >>> turtle.pensize(10) # from here on lines of width 10 are drawn """ if width is None: return self._pensize @@ -2559,7 +2552,7 @@ """Delete the turtle's drawings and restore its default values. No argument. -, + Delete the turtle's drawings from the screen, re-center the turtle and set variables to the default values. @@ -2606,7 +2599,7 @@ Example (for a Turtle instance named turtle): >>> while undobufferentries(): - undo() + ... undo() """ if self.undobuffer is None: return 0 @@ -2682,9 +2675,9 @@ >>> turtle.tracer(8, 25) >>> dist = 2 >>> for i in range(200): - turtle.fd(dist) - turtle.rt(90) - dist += 2 + ... turtle.fd(dist) + ... turtle.rt(90) + ... dist += 2 """ return self.screen.tracer(flag, delay) @@ -2882,7 +2875,6 @@ >>> turtle.shapesize(5,2) >>> turtle.tilt(45) >>> turtle.tiltangle() - >>> """ if angle is None: tilt = -self._tilt * (180.0/math.pi) * self._angleOrient @@ -2927,7 +2919,7 @@ >>> turtle.shapesize(4,2) >>> turtle.shearfactor(-0.5) >>> turtle.shapetransform() - >>> (4.0, -1.0, -0.0, 2.0) + (4.0, -1.0, -0.0, 2.0) """ if t11 is t12 is t21 is t22 is None: return self._shapetrafo @@ -3125,7 +3117,7 @@ Example (for a Turtle instance named turtle): >>> for i in range(8): - turtle.stamp(); turtle.fd(30) + ... turtle.stamp(); turtle.fd(30) ... >>> turtle.clearstamps(2) >>> turtle.clearstamps(-2) @@ -3301,9 +3293,9 @@ Example (for a Turtle instance named turtle): >>> turtle.begin_fill() >>> if turtle.filling(): - turtle.pensize(5) - else: - turtle.pensize(3) + ... turtle.pensize(5) + ... else: + ... turtle.pensize(3) """ return isinstance(self._fillpath, list) @@ -3533,9 +3525,9 @@ Example for the anonymous turtle, i. e. the procedural way: >>> def turn(x, y): - left(360) - - >>> onclick(turn) # Now clicking into the turtle will turn it. + ... left(360) + ... + >>> onclick(turn) # Now clicking into the turtle will turn it. >>> onclick(None) # event-binding will be removed """ self.screen._onclick(self.turtle._item, fun, btn, add) @@ -3551,16 +3543,17 @@ Example (for a MyTurtle instance named joe): >>> class MyTurtle(Turtle): - def glow(self,x,y): - self.fillcolor("red") - def unglow(self,x,y): - self.fillcolor("") - + ... def glow(self,x,y): + ... self.fillcolor("red") + ... def unglow(self,x,y): + ... self.fillcolor("") + ... >>> joe = MyTurtle() >>> joe.onclick(joe.glow) >>> joe.onrelease(joe.unglow) - ### clicking on joe turns fillcolor red, - ### unclicking turns it to transparent. + + Clicking on joe turns fillcolor red, unclicking turns it to + transparent. """ self.screen._onrelease(self.turtle._item, fun, btn, add) self._update() @@ -3579,9 +3572,9 @@ Example (for a Turtle instance named turtle): >>> turtle.ondrag(turtle.goto) - ### Subsequently clicking and dragging a Turtle will - ### move it across the screen thereby producing handdrawings - ### (if pen is down). + Subsequently clicking and dragging a Turtle will move it + across the screen thereby producing handdrawings (if pen is + down). """ self.screen._ondrag(self.turtle._item, fun, btn, add) @@ -3629,10 +3622,11 @@ Example (for a Turtle instance named turtle): >>> for i in range(4): - turtle.fd(50); turtle.lt(80) - + ... turtle.fd(50); turtle.lt(80) + ... >>> for i in range(8): - turtle.undo() + ... turtle.undo() + ... """ if self.undobuffer is None: return diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -399,6 +399,8 @@ Library ------- +- Issue #13439: Fix many errors in turtle docstrings. + - Issue #6715: Add a module 'lzma' for compression using the LZMA algorithm. Thanks to Per ?yvind Karlsen for the initial implementation. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 00:31:18 2011 From: python-checkins at python.org (brett.cannon) Date: Sat, 03 Dec 2011 00:31:18 +0100 Subject: [Python-checkins] =?utf8?q?peps=3A_Fix_a_spelling_error=2E?= Message-ID: http://hg.python.org/peps/rev/22c2c36d0072 changeset: 4008:22c2c36d0072 user: Brett Cannon date: Fri Dec 02 18:31:13 2011 -0500 summary: Fix a spelling error. files: pep-0362.txt | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/pep-0362.txt b/pep-0362.txt --- a/pep-0362.txt +++ b/pep-0362.txt @@ -44,7 +44,7 @@ representation affecting the function it represents (but this is an `Open Issues`_). -Indirecation of signature introspection can also occur. If a +Indirection of signature introspection can also occur. If a decorator took a decorated function's signature object and set it on the decorating function then introspection could be redirected to what is actually expected instead of the typical ``*args, **kwargs`` -- Repository URL: http://hg.python.org/peps From solipsis at pitrou.net Sat Dec 3 05:37:27 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Sat, 03 Dec 2011 05:37:27 +0100 Subject: [Python-checkins] Daily reference leaks (8d60c1c89105): sum=0 Message-ID: results for 8d60c1c89105 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/reflogxX2Nwm', '-x'] From python-checkins at python.org Sat Dec 3 14:44:39 2011 From: python-checkins at python.org (charles-francois.natali) Date: Sat, 03 Dec 2011 14:44:39 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2312612=3A_Add_some_?= =?utf8?q?Valgrind_suppressions_for_64-bit_machines=2E_Patch_by_Paul?= Message-ID: http://hg.python.org/cpython/rev/3eb73f45a614 changeset: 73818:3eb73f45a614 user: Charles-Fran?ois Natali date: Sat Dec 03 14:43:57 2011 +0100 summary: Issue #12612: Add some Valgrind suppressions for 64-bit machines. Patch by Paul Price. files: Misc/ACKS | 1 + Misc/valgrind-python.supp | 24 ++++++++++++++++++++++++ 2 files changed, 25 insertions(+), 0 deletions(-) diff --git a/Misc/ACKS b/Misc/ACKS --- a/Misc/ACKS +++ b/Misc/ACKS @@ -777,6 +777,7 @@ Amrit Prem Paul Prescod Donovan Preston +Paul Price Jyrki Pulliainen Steve Purcell Eduardo P?rez diff --git a/Misc/valgrind-python.supp b/Misc/valgrind-python.supp --- a/Misc/valgrind-python.supp +++ b/Misc/valgrind-python.supp @@ -137,6 +137,18 @@ ###} ### ###{ +### ADDRESS_IN_RANGE/Use of uninitialised value of size 8 +### Memcheck:Addr8 +### fun:PyObject_Free +###} +### +###{ +### ADDRESS_IN_RANGE/Use of uninitialised value of size 8 +### Memcheck:Value8 +### fun:PyObject_Free +###} +### +###{ ### ADDRESS_IN_RANGE/Conditional jump or move depends on uninitialised value ### Memcheck:Cond ### fun:PyObject_Free @@ -155,6 +167,18 @@ ###} ### ###{ +### ADDRESS_IN_RANGE/Use of uninitialised value of size 8 +### Memcheck:Addr8 +### fun:PyObject_Realloc +###} +### +###{ +### ADDRESS_IN_RANGE/Use of uninitialised value of size 8 +### Memcheck:Value8 +### fun:PyObject_Realloc +###} +### +###{ ### ADDRESS_IN_RANGE/Conditional jump or move depends on uninitialised value ### Memcheck:Cond ### fun:PyObject_Realloc -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 15:00:32 2011 From: python-checkins at python.org (jason.coombs) Date: Sat, 03 Dec 2011 15:00:32 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEyNjY2?= =?utf8?q?=3A_Clarifying_changes_in_map_for_Python_3?= Message-ID: http://hg.python.org/cpython/rev/3b505df38fd8 changeset: 73819:3b505df38fd8 branch: 3.2 parent: 73816:cc559e1e3bd8 user: Jason R. Coombs date: Mon Aug 01 17:51:34 2011 -0400 summary: Issue #12666: Clarifying changes in map for Python 3 files: Doc/whatsnew/3.0.rst | 10 +++++++++- 1 files changed, 9 insertions(+), 1 deletions(-) diff --git a/Doc/whatsnew/3.0.rst b/Doc/whatsnew/3.0.rst --- a/Doc/whatsnew/3.0.rst +++ b/Doc/whatsnew/3.0.rst @@ -154,7 +154,9 @@ :meth:`dict.itervalues` methods are no longer supported. * :func:`map` and :func:`filter` return iterators. If you really need - a list, a quick fix is e.g. ``list(map(...))``, but a better fix is + a list and the input sequences are all of equal length, a quick + fix is to wrap :func:`map` in :func:`list`, e.g. ``list(map(...))``, + but a better fix is often to use a list comprehension (especially when the original code uses :keyword:`lambda`), or rewriting the code so it doesn't need a list at all. Particularly tricky is :func:`map` invoked for the @@ -162,6 +164,12 @@ regular :keyword:`for` loop (since creating a list would just be wasteful). + If the input sequences are not of equal length, :func:`map` will + stop at the termination of the shortest of the sequences. For full + compatibility with `map` from Python 2.x, also wrap the sequences in + :func:`itertools.zip_longest`, e.g. ``map(func, *sequences)`` becomes + ``list(map(func, itertools.zip_longest(*sequences)))``. + * :func:`range` now behaves like :func:`xrange` used to behave, except it works with values of arbitrary size. The latter no longer exists. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 15:00:33 2011 From: python-checkins at python.org (jason.coombs) Date: Sat, 03 Dec 2011 15:00:33 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEyNjY2?= =?utf8?q?=3A_Added_section_about_map_changes=2E?= Message-ID: http://hg.python.org/cpython/rev/0e2812b16f5f changeset: 73820:0e2812b16f5f branch: 3.2 user: Jason R. Coombs date: Sat Dec 03 08:24:21 2011 -0500 summary: Issue #12666: Added section about map changes. files: Doc/howto/pyporting.rst | 12 ++++++++++++ 1 files changed, 12 insertions(+), 0 deletions(-) diff --git a/Doc/howto/pyporting.rst b/Doc/howto/pyporting.rst --- a/Doc/howto/pyporting.rst +++ b/Doc/howto/pyporting.rst @@ -505,6 +505,18 @@ to :mod:`unittest`. +Update `map` for imbalanced input sequences +''''''''''''''''''''''''''''''''''''''''''' + +With Python 2, `map` would pad input sequences of unequal length with +`None` values, returning a sequence as long as the longest input sequence. + +With Python 3, if the input sequences to `map` are of unequal length, `map` +will stop at the termination of the shortest of the sequences. For full +compatibility with `map` from Python 2.x, also wrap the sequences in +:func:`itertools.zip_longest`, e.g. ``map(func, *sequences)`` becomes +``list(map(func, itertools.zip_longest(*sequences)))``. + Eliminate ``-3`` Warnings ------------------------- -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 15:00:36 2011 From: python-checkins at python.org (jason.coombs) Date: Sat, 03 Dec 2011 15:00:36 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_fix_for_Issue_=2312666_from_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/51af35bd46f7 changeset: 73821:51af35bd46f7 parent: 73818:3eb73f45a614 parent: 73820:0e2812b16f5f user: Jason R. Coombs date: Sat Dec 03 08:59:53 2011 -0500 summary: Merge fix for Issue #12666 from 3.2 files: Doc/howto/pyporting.rst | 12 ++++++++++++ Doc/whatsnew/3.0.rst | 10 +++++++++- 2 files changed, 21 insertions(+), 1 deletions(-) diff --git a/Doc/howto/pyporting.rst b/Doc/howto/pyporting.rst --- a/Doc/howto/pyporting.rst +++ b/Doc/howto/pyporting.rst @@ -505,6 +505,18 @@ to :mod:`unittest`. +Update `map` for imbalanced input sequences +''''''''''''''''''''''''''''''''''''''''''' + +With Python 2, `map` would pad input sequences of unequal length with +`None` values, returning a sequence as long as the longest input sequence. + +With Python 3, if the input sequences to `map` are of unequal length, `map` +will stop at the termination of the shortest of the sequences. For full +compatibility with `map` from Python 2.x, also wrap the sequences in +:func:`itertools.zip_longest`, e.g. ``map(func, *sequences)`` becomes +``list(map(func, itertools.zip_longest(*sequences)))``. + Eliminate ``-3`` Warnings ------------------------- diff --git a/Doc/whatsnew/3.0.rst b/Doc/whatsnew/3.0.rst --- a/Doc/whatsnew/3.0.rst +++ b/Doc/whatsnew/3.0.rst @@ -154,7 +154,9 @@ :meth:`dict.itervalues` methods are no longer supported. * :func:`map` and :func:`filter` return iterators. If you really need - a list, a quick fix is e.g. ``list(map(...))``, but a better fix is + a list and the input sequences are all of equal length, a quick + fix is to wrap :func:`map` in :func:`list`, e.g. ``list(map(...))``, + but a better fix is often to use a list comprehension (especially when the original code uses :keyword:`lambda`), or rewriting the code so it doesn't need a list at all. Particularly tricky is :func:`map` invoked for the @@ -162,6 +164,12 @@ regular :keyword:`for` loop (since creating a list would just be wasteful). + If the input sequences are not of equal length, :func:`map` will + stop at the termination of the shortest of the sequences. For full + compatibility with `map` from Python 2.x, also wrap the sequences in + :func:`itertools.zip_longest`, e.g. ``map(func, *sequences)`` becomes + ``list(map(func, itertools.zip_longest(*sequences)))``. + * :func:`range` now behaves like :func:`xrange` used to behave, except it works with values of arbitrary size. The latter no longer exists. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 15:46:11 2011 From: python-checkins at python.org (jason.coombs) Date: Sat, 03 Dec 2011 15:46:11 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogSXNzdWUgIzEzMjEx?= =?utf8?q?=3A_Add_=2Ereason_attribute_to_HTTPError_to_implement_parent_cla?= =?utf8?q?ss?= Message-ID: http://hg.python.org/cpython/rev/ee94b89f65ab changeset: 73822:ee94b89f65ab branch: 2.7 parent: 73815:6e03ab9950f6 user: Jason R. Coombs date: Mon Nov 07 10:44:25 2011 -0500 summary: Issue #13211: Add .reason attribute to HTTPError to implement parent class (URLError) interface. files: Lib/test/test_urllib2.py | 11 +++++++++++ Lib/urllib2.py | 6 ++++++ 2 files changed, 17 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_urllib2.py b/Lib/test/test_urllib2.py --- a/Lib/test/test_urllib2.py +++ b/Lib/test/test_urllib2.py @@ -1318,6 +1318,17 @@ req = Request(url) self.assertEqual(req.get_full_url(), url) +def test_HTTPError_interface(): + """ + Issue 13211 reveals that HTTPError didn't implement the URLError + interface even though HTTPError is a subclass of URLError. + + >>> err = urllib2.HTTPError(msg='something bad happened', url=None, code=None, hdrs=None, fp=None) + >>> assert hasattr(err, 'reason') + >>> err.reason + 'something bad happened' + """ + def test_main(verbose=None): from test import test_urllib2 test_support.run_doctest(test_urllib2, verbose) diff --git a/Lib/urllib2.py b/Lib/urllib2.py --- a/Lib/urllib2.py +++ b/Lib/urllib2.py @@ -166,6 +166,12 @@ def __str__(self): return 'HTTP Error %s: %s' % (self.code, self.msg) + # since URLError specifies a .reason attribute, HTTPError should also + # provide this attribute. See issue13211 fo discussion. + @property + def reason(self): + return self.msg + # copied from cookielib.py _cut_port_re = re.compile(r":\d+$") def request_host(request): -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 15:46:15 2011 From: python-checkins at python.org (jason.coombs) Date: Sat, 03 Dec 2011 15:46:15 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEzMjEx?= =?utf8?q?=3A_Add_=2Ereason_attribute_to_HTTPError_to_implement_parent_cla?= =?utf8?q?ss?= Message-ID: http://hg.python.org/cpython/rev/abfe76a19f63 changeset: 73823:abfe76a19f63 branch: 3.2 parent: 73820:0e2812b16f5f user: Jason R. Coombs date: Mon Nov 07 10:50:32 2011 -0500 summary: Issue #13211: Add .reason attribute to HTTPError to implement parent class (URLError) interface. files: Lib/test/test_urllib2.py | 11 +++++++++++ Lib/urllib/error.py | 6 ++++++ 2 files changed, 17 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_urllib2.py b/Lib/test/test_urllib2.py --- a/Lib/test/test_urllib2.py +++ b/Lib/test/test_urllib2.py @@ -1409,6 +1409,17 @@ req = Request(url) self.assertEqual(req.get_full_url(), url) +def test_HTTPError_interface(): + """ + Issue 13211 reveals that HTTPError didn't implement the URLError + interface even though HTTPError is a subclass of URLError. + + >>> err = urllib.error.HTTPError(msg='something bad happened', url=None, code=None, hdrs=None, fp=None) + >>> assert hasattr(err, 'reason') + >>> err.reason + 'something bad happened' + """ + def test_main(verbose=None): from test import test_urllib2 support.run_doctest(test_urllib2, verbose) diff --git a/Lib/urllib/error.py b/Lib/urllib/error.py --- a/Lib/urllib/error.py +++ b/Lib/urllib/error.py @@ -52,6 +52,12 @@ def __str__(self): return 'HTTP Error %s: %s' % (self.code, self.msg) + # since URLError specifies a .reason attribute, HTTPError should also + # provide this attribute. See issue13211 for discussion. + @property + def reason(self): + return self.msg + # exception raised when downloaded size does not match content-length class ContentTooShortError(URLError): def __init__(self, message, content): -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 15:46:18 2011 From: python-checkins at python.org (jason.coombs) Date: Sat, 03 Dec 2011 15:46:18 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merged_fix_for_=2313211_from_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/deb60efd32eb changeset: 73824:deb60efd32eb parent: 73821:51af35bd46f7 parent: 73823:abfe76a19f63 user: Jason R. Coombs date: Sat Dec 03 09:39:58 2011 -0500 summary: Merged fix for #13211 from 3.2 files: Lib/test/test_urllib2.py | 11 +++++++++++ Lib/urllib/error.py | 6 ++++++ 2 files changed, 17 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_urllib2.py b/Lib/test/test_urllib2.py --- a/Lib/test/test_urllib2.py +++ b/Lib/test/test_urllib2.py @@ -1449,6 +1449,17 @@ req = Request(url) self.assertEqual(req.get_full_url(), url) +def test_HTTPError_interface(): + """ + Issue 13211 reveals that HTTPError didn't implement the URLError + interface even though HTTPError is a subclass of URLError. + + >>> err = urllib.error.HTTPError(msg='something bad happened', url=None, code=None, hdrs=None, fp=None) + >>> assert hasattr(err, 'reason') + >>> err.reason + 'something bad happened' + """ + def test_main(verbose=None): from test import test_urllib2 support.run_doctest(test_urllib2, verbose) diff --git a/Lib/urllib/error.py b/Lib/urllib/error.py --- a/Lib/urllib/error.py +++ b/Lib/urllib/error.py @@ -55,6 +55,12 @@ def __str__(self): return 'HTTP Error %s: %s' % (self.code, self.msg) + # since URLError specifies a .reason attribute, HTTPError should also + # provide this attribute. See issue13211 for discussion. + @property + def reason(self): + return self.msg + # exception raised when downloaded size does not match content-length class ContentTooShortError(URLError): def __init__(self, message, content): -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 16:01:29 2011 From: python-checkins at python.org (eric.araujo) Date: Sat, 03 Dec 2011 16:01:29 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Fix_glitches_in_email=2Epol?= =?utf8?q?icy_docs_=28=2312208=29?= Message-ID: http://hg.python.org/cpython/rev/9ffb00748a47 changeset: 73825:9ffb00748a47 user: ?ric Araujo date: Sat Dec 03 16:00:56 2011 +0100 summary: Fix glitches in email.policy docs (#12208) files: Doc/library/email.policy.rst | 6 ++++-- 1 files changed, 4 insertions(+), 2 deletions(-) diff --git a/Doc/library/email.policy.rst b/Doc/library/email.policy.rst --- a/Doc/library/email.policy.rst +++ b/Doc/library/email.policy.rst @@ -48,16 +48,18 @@ >>> import email.policy >>> from subprocess import Popen, PIPE >>> with open('mymsg.txt', 'b') as f: - ... Msg = msg_from_binary_file(f, policy=email.policy.mbox) + ... msg = msg_from_binary_file(f, policy=email.policy.mbox) >>> p = Popen(['sendmail', msg['To'][0].address], stdin=PIPE) >>> g = BytesGenerator(p.stdin, policy=email.policy.SMTP) >>> g.flatten(msg) >>> p.stdin.close() >>> rc = p.wait() +.. XXX email.policy.mbox/MBOX does not exist yet + Some email package methods accept a *policy* keyword argument, allowing the policy to be overridden for that method. For example, the following code uses -the :meth:`email.message.Message.as_string` method of the *msg* object from the +the :meth:`~email.message.Message.as_string` method of the *msg* object from the previous example and re-write it to a file using the native line separators for the platform on which it is running:: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 19:51:40 2011 From: python-checkins at python.org (meador.inge) Date: Sat, 03 Dec 2011 19:51:40 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogSXNzdWUgIzEzNTEz?= =?utf8?q?=3A_IOBase_docs_incorrectly_link_to_the_readline_module?= Message-ID: http://hg.python.org/cpython/rev/fb8b6d310fb8 changeset: 73826:fb8b6d310fb8 branch: 2.7 parent: 73822:ee94b89f65ab user: Meador Inge date: Sat Dec 03 12:13:42 2011 -0600 summary: Issue #13513: IOBase docs incorrectly link to the readline module files: Doc/library/io.rst | 2 +- Misc/NEWS | 3 +++ 2 files changed, 4 insertions(+), 1 deletions(-) diff --git a/Doc/library/io.rst b/Doc/library/io.rst --- a/Doc/library/io.rst +++ b/Doc/library/io.rst @@ -233,7 +233,7 @@ :class:`IOBase` object can be iterated over yielding the lines in a stream. Lines are defined slightly differently depending on whether the stream is a binary stream (yielding :class:`bytes`), or a text stream (yielding - :class:`unicode` strings). See :meth:`readline` below. + :class:`unicode` strings). See :meth:`~IOBase.readline` below. IOBase is also a context manager and therefore supports the :keyword:`with` statement. In this example, *file* is closed after the diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -430,6 +430,9 @@ Documentation ------------- +- Issue #13513: Fix io.IOBase documentation to correctly link to the + io.IOBase.readline method instead of the readline module. + - Issue #13237: Reorganise subprocess documentation to emphasise convenience functions and the most commonly needed arguments to Popen. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 19:51:45 2011 From: python-checkins at python.org (meador.inge) Date: Sat, 03 Dec 2011 19:51:45 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEzNTEz?= =?utf8?q?=3A_IOBase_docs_incorrectly_link_to_the_readline_module?= Message-ID: http://hg.python.org/cpython/rev/9792e812198f changeset: 73827:9792e812198f branch: 3.2 parent: 73823:abfe76a19f63 user: Meador Inge date: Sat Dec 03 12:29:54 2011 -0600 summary: Issue #13513: IOBase docs incorrectly link to the readline module files: Doc/library/io.rst | 2 +- Misc/NEWS | 3 +++ 2 files changed, 4 insertions(+), 1 deletions(-) diff --git a/Doc/library/io.rst b/Doc/library/io.rst --- a/Doc/library/io.rst +++ b/Doc/library/io.rst @@ -217,7 +217,7 @@ :class:`IOBase` object can be iterated over yielding the lines in a stream. Lines are defined slightly differently depending on whether the stream is a binary stream (yielding bytes), or a text stream (yielding character - strings). See :meth:`readline` below. + strings). See :meth:`~IOBase.readline` below. IOBase is also a context manager and therefore supports the :keyword:`with` statement. In this example, *file* is closed after the diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -316,6 +316,9 @@ Documentation ------------- +- Issue #13513: Fix io.IOBase documentation to correctly link to the + io.IOBase.readline method instead of the readline module. + - Issue #13237: Reorganise subprocess documentation to emphasise convenience functions and the most commonly needed arguments to Popen. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 19:51:48 2011 From: python-checkins at python.org (meador.inge) Date: Sat, 03 Dec 2011 19:51:48 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Issue_=2313513=3A_IOBase_docs_incorrectly_link_to_the_readli?= =?utf8?q?ne_module?= Message-ID: http://hg.python.org/cpython/rev/ab5bc05ac223 changeset: 73828:ab5bc05ac223 parent: 73825:9ffb00748a47 parent: 73827:9792e812198f user: Meador Inge date: Sat Dec 03 12:50:18 2011 -0600 summary: Issue #13513: IOBase docs incorrectly link to the readline module files: Doc/library/io.rst | 2 +- Misc/NEWS | 3 +++ 2 files changed, 4 insertions(+), 1 deletions(-) diff --git a/Doc/library/io.rst b/Doc/library/io.rst --- a/Doc/library/io.rst +++ b/Doc/library/io.rst @@ -213,7 +213,7 @@ :class:`IOBase` object can be iterated over yielding the lines in a stream. Lines are defined slightly differently depending on whether the stream is a binary stream (yielding bytes), or a text stream (yielding character - strings). See :meth:`readline` below. + strings). See :meth:`~IOBase.readline` below. IOBase is also a context manager and therefore supports the :keyword:`with` statement. In this example, *file* is closed after the diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -1885,6 +1885,9 @@ Documentation ------------- +- Issue #13513: Fix io.IOBase documentation to correctly link to the + io.IOBase.readline method instead of the readline module. + - Issue #13237: Reorganise subprocess documentation to emphasise convenience functions and the most commonly needed arguments to Popen. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 21:13:06 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 21:13:06 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Mention_the_new?= =?utf8?q?_GTK+_3_bindings=2E?= Message-ID: http://hg.python.org/cpython/rev/1e3f09a896da changeset: 73829:1e3f09a896da branch: 3.2 parent: 73827:9792e812198f user: Antoine Pitrou date: Sat Dec 03 20:59:24 2011 +0100 summary: Mention the new GTK+ 3 bindings. files: Doc/faq/gui.rst | 8 ++++++-- Doc/library/othergui.rst | 16 +++++++++++----- 2 files changed, 17 insertions(+), 7 deletions(-) diff --git a/Doc/faq/gui.rst b/Doc/faq/gui.rst --- a/Doc/faq/gui.rst +++ b/Doc/faq/gui.rst @@ -68,8 +68,12 @@ Gtk+ ---- -PyGtk bindings for the `Gtk+ toolkit `_ have been -implemented by James Henstridge; see . +The `GObject introspection bindings `_ +for Python allow you to write GTK+ 3 applications. There is also a +`Python GTK+ 3 Tutorial `_. + +The older PyGtk bindings for the `Gtk+ 2 toolkit `_ have +been implemented by James Henstridge; see . FLTK ---- diff --git a/Doc/library/othergui.rst b/Doc/library/othergui.rst --- a/Doc/library/othergui.rst +++ b/Doc/library/othergui.rst @@ -34,11 +34,17 @@ .. seealso:: - `PyGTK `_ - is a set of bindings for the `GTK `_ widget set. It - provides an object oriented interface that is slightly higher level than - the C one. It comes with many more widgets than Tkinter provides, and has - good Python-specific reference documentation. There are also bindings to + `PyGObject `_ + provides introspection bindings for C libraries using + `GObject `_. One of + these libraries is the `GTK+ 3 `_ widget set. + GTK+ comes with many more widgets than Tkinter provides. An online + `Python GTK+ 3 Tutorial `_ + is available. + + `PyGTK `_ provides bindings for an older version + of the library, GTK+ 2. It provides an object oriented interface that + is slightly higher level than the C one. There are also bindings to `GNOME `_. One well known PyGTK application is `PythonCAD `_. An online `tutorial `_ is available. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 21:13:07 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 21:13:07 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Mention_PySide_?= =?utf8?q?in_the_=22other_GUIs=22_page=2E?= Message-ID: http://hg.python.org/cpython/rev/bc3f1629d825 changeset: 73830:bc3f1629d825 branch: 3.2 user: Antoine Pitrou date: Sat Dec 03 21:07:01 2011 +0100 summary: Mention PySide in the "other GUIs" page. files: Doc/library/othergui.rst | 5 +++++ 1 files changed, 5 insertions(+), 0 deletions(-) diff --git a/Doc/library/othergui.rst b/Doc/library/othergui.rst --- a/Doc/library/othergui.rst +++ b/Doc/library/othergui.rst @@ -61,6 +61,11 @@ with Python and Qt `_, by Mark Summerfield. + `PySide `_ + is a newer binding to the Qt toolkit, provided by Nokia. + Compared to PyQt, its licensing scheme is friendlier to non-open source + applications. + `wxPython `_ wxPython is a cross-platform GUI toolkit for Python that is built around the popular `wxWidgets `_ (formerly wxWindows) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 21:13:07 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 21:13:07 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_doc_fixes_from_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/309f12301de2 changeset: 73831:309f12301de2 parent: 73828:ab5bc05ac223 parent: 73830:bc3f1629d825 user: Antoine Pitrou date: Sat Dec 03 21:07:50 2011 +0100 summary: Merge doc fixes from 3.2 files: Doc/faq/gui.rst | 8 ++++++-- Doc/library/othergui.rst | 21 ++++++++++++++++----- 2 files changed, 22 insertions(+), 7 deletions(-) diff --git a/Doc/faq/gui.rst b/Doc/faq/gui.rst --- a/Doc/faq/gui.rst +++ b/Doc/faq/gui.rst @@ -68,8 +68,12 @@ Gtk+ ---- -PyGtk bindings for the `Gtk+ toolkit `_ have been -implemented by James Henstridge; see . +The `GObject introspection bindings `_ +for Python allow you to write GTK+ 3 applications. There is also a +`Python GTK+ 3 Tutorial `_. + +The older PyGtk bindings for the `Gtk+ 2 toolkit `_ have +been implemented by James Henstridge; see . FLTK ---- diff --git a/Doc/library/othergui.rst b/Doc/library/othergui.rst --- a/Doc/library/othergui.rst +++ b/Doc/library/othergui.rst @@ -34,11 +34,17 @@ .. seealso:: - `PyGTK `_ - is a set of bindings for the `GTK `_ widget set. It - provides an object oriented interface that is slightly higher level than - the C one. It comes with many more widgets than Tkinter provides, and has - good Python-specific reference documentation. There are also bindings to + `PyGObject `_ + provides introspection bindings for C libraries using + `GObject `_. One of + these libraries is the `GTK+ 3 `_ widget set. + GTK+ comes with many more widgets than Tkinter provides. An online + `Python GTK+ 3 Tutorial `_ + is available. + + `PyGTK `_ provides bindings for an older version + of the library, GTK+ 2. It provides an object oriented interface that + is slightly higher level than the C one. There are also bindings to `GNOME `_. One well known PyGTK application is `PythonCAD `_. An online `tutorial `_ is available. @@ -55,6 +61,11 @@ with Python and Qt `_, by Mark Summerfield. + `PySide `_ + is a newer binding to the Qt toolkit, provided by Nokia. + Compared to PyQt, its licensing scheme is friendlier to non-open source + applications. + `wxPython `_ wxPython is a cross-platform GUI toolkit for Python that is built around the popular `wxWidgets `_ (formerly wxWindows) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 21:30:38 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 21:30:38 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Remove_referenc?= =?utf8?q?es_to_psyco=2C_which_is_mostly_unmaintained_and_doesn=27t_work_w?= =?utf8?q?ith?= Message-ID: http://hg.python.org/cpython/rev/48a723092e1e changeset: 73832:48a723092e1e branch: 3.2 parent: 73830:bc3f1629d825 user: Antoine Pitrou date: Sat Dec 03 21:21:36 2011 +0100 summary: Remove references to psyco, which is mostly unmaintained and doesn't work with Python 3. files: Doc/faq/extending.rst | 8 +------- Doc/faq/programming.rst | 14 +++----------- 2 files changed, 4 insertions(+), 18 deletions(-) diff --git a/Doc/faq/extending.rst b/Doc/faq/extending.rst --- a/Doc/faq/extending.rst +++ b/Doc/faq/extending.rst @@ -37,13 +37,7 @@ There are a number of alternatives to writing your own C extensions, depending on what you're trying to do. -.. XXX make sure these all work; mention Cython - -If you need more speed, `Psyco `_ generates x86 -assembly code from Python bytecode. You can use Psyco to compile the most -time-critical functions in your code, and gain a significant improvement with -very little effort, as long as you're running on a machine with an -x86-compatible processor. +.. XXX make sure these all work `Cython `_ and its relative `Pyrex `_ are compilers diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -121,19 +121,11 @@ That's a tough one, in general. There are many tricks to speed up Python code; consider rewriting parts in C as a last resort. -In some cases it's possible to automatically translate Python to C or x86 -assembly language, meaning that you don't have to modify your code to gain -increased speed. - -.. XXX seems to have overlap with other questions! - `Cython `_ and `Pyrex `_ can compile a slightly modified version of Python code into a C extension, and -can be used on many different platforms. - -`Psyco `_ is a just-in-time compiler that -translates Python code into x86 assembly language. If you can use it, Psyco can -provide dramatic speedups for critical functions. +can be used on many different platforms. Depending on your code, Cython +may be able to make it significantly faster than when run by the Python +interpreter. The rest of this answer will discuss various tricks for squeezing a bit more speed out of Python code. *Never* apply any optimization tricks unless you know -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 21:30:39 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 21:30:39 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Remove_references_to_psyco=2C_which_is_mostly_unmaintained_a?= =?utf8?q?nd_doesn=27t_work_with?= Message-ID: http://hg.python.org/cpython/rev/06087f6890af changeset: 73833:06087f6890af parent: 73831:309f12301de2 parent: 73832:48a723092e1e user: Antoine Pitrou date: Sat Dec 03 21:25:17 2011 +0100 summary: Remove references to psyco, which is mostly unmaintained and doesn't work with Python 3. files: Doc/faq/extending.rst | 8 +------- Doc/faq/programming.rst | 14 +++----------- 2 files changed, 4 insertions(+), 18 deletions(-) diff --git a/Doc/faq/extending.rst b/Doc/faq/extending.rst --- a/Doc/faq/extending.rst +++ b/Doc/faq/extending.rst @@ -37,13 +37,7 @@ There are a number of alternatives to writing your own C extensions, depending on what you're trying to do. -.. XXX make sure these all work; mention Cython - -If you need more speed, `Psyco `_ generates x86 -assembly code from Python bytecode. You can use Psyco to compile the most -time-critical functions in your code, and gain a significant improvement with -very little effort, as long as you're running on a machine with an -x86-compatible processor. +.. XXX make sure these all work `Cython `_ and its relative `Pyrex `_ are compilers diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -121,19 +121,11 @@ That's a tough one, in general. There are many tricks to speed up Python code; consider rewriting parts in C as a last resort. -In some cases it's possible to automatically translate Python to C or x86 -assembly language, meaning that you don't have to modify your code to gain -increased speed. - -.. XXX seems to have overlap with other questions! - `Cython `_ and `Pyrex `_ can compile a slightly modified version of Python code into a C extension, and -can be used on many different platforms. - -`Psyco `_ is a just-in-time compiler that -translates Python code into x86 assembly language. If you can use it, Psyco can -provide dramatic speedups for critical functions. +can be used on many different platforms. Depending on your code, Cython +may be able to make it significantly faster than when run by the Python +interpreter. The rest of this answer will discuss various tricks for squeezing a bit more speed out of Python code. *Never* apply any optimization tricks unless you know -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:17:02 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:17:02 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Streamline_FAQ_?= =?utf8?q?entry_about_the_ternary_operator=2C_and_suggest_using_io=2EStrin?= =?utf8?q?gIO?= Message-ID: http://hg.python.org/cpython/rev/140b8c98c7b9 changeset: 73834:140b8c98c7b9 branch: 3.2 parent: 73832:48a723092e1e user: Antoine Pitrou date: Sat Dec 03 22:11:11 2011 +0100 summary: Streamline FAQ entry about the ternary operator, and suggest using io.StringIO for a mutable unicode container. files: Doc/faq/programming.rst | 74 +++++++--------------------- 1 files changed, 20 insertions(+), 54 deletions(-) diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -679,61 +679,21 @@ Is there an equivalent of C's "?:" ternary operator? ---------------------------------------------------- -Yes, this feature was added in Python 2.5. The syntax would be as follows:: +Yes, there is. The syntax is as follows:: [on_true] if [expression] else [on_false] x, y = 50, 25 - small = x if x < y else y -For versions previous to 2.5 the answer would be 'No'. +Before this syntax was introduced in Python 2.5, a common idiom was to use +logical operators:: -.. XXX remove rest? + [expression] and [on_true] or [on_false] -In many cases you can mimic ``a ? b : c`` with ``a and b or c``, but there's a -flaw: if *b* is zero (or empty, or ``None`` -- anything that tests false) then -*c* will be selected instead. In many cases you can prove by looking at the -code that this can't happen (e.g. because *b* is a constant or has a type that -can never be false), but in general this can be a problem. - -Tim Peters (who wishes it was Steve Majewski) suggested the following solution: -``(a and [b] or [c])[0]``. Because ``[b]`` is a singleton list it is never -false, so the wrong path is never taken; then applying ``[0]`` to the whole -thing gets the *b* or *c* that you really wanted. Ugly, but it gets you there -in the rare cases where it is really inconvenient to rewrite your code using -'if'. - -The best course is usually to write a simple ``if...else`` statement. Another -solution is to implement the ``?:`` operator as a function:: - - def q(cond, on_true, on_false): - if cond: - if not isfunction(on_true): - return on_true - else: - return on_true() - else: - if not isfunction(on_false): - return on_false - else: - return on_false() - -In most cases you'll pass b and c directly: ``q(a, b, c)``. To avoid evaluating -b or c when they shouldn't be, encapsulate them within a lambda function, e.g.: -``q(a, lambda: b, lambda: c)``. - -It has been asked *why* Python has no if-then-else expression. There are -several answers: many languages do just fine without one; it can easily lead to -less readable code; no sufficiently "Pythonic" syntax has been discovered; a -search of the standard library found remarkably few places where using an -if-then-else expression would make the code more understandable. - -In 2002, :pep:`308` was written proposing several possible syntaxes and the -community was asked to vote on the issue. The vote was inconclusive. Most -people liked one of the syntaxes, but also hated other syntaxes; many votes -implied that people preferred no ternary operator rather than having a syntax -they hated. +However, this idiom is unsafe, as it can give wrong results when *on_true* +has a false boolean value. Therefore, it is always better to use +the ``... if ... else ...`` form. Is it possible to write obfuscated one-liners in Python? @@ -852,15 +812,21 @@ How do I modify a string in place? ---------------------------------- -You can't, because strings are immutable. If you need an object with this -ability, try converting the string to a list or use the array module:: +You can't, because strings are immutable. In most situations, you should +simply construct a new string from the various parts you want to assemble +it from. However, if you need an object with the ability to modify in-place +unicode data, try using a :class:`io.StringIO` object or the :mod:`array` +module:: >>> s = "Hello, world" - >>> a = list(s) - >>> print(a) - ['H', 'e', 'l', 'l', 'o', ',', ' ', 'w', 'o', 'r', 'l', 'd'] - >>> a[7:] = list("there!") - >>> ''.join(a) + >>> sio = io.StringIO(s) + >>> sio.getvalue() + 'Hello, world' + >>> sio.seek(7) + 7 + >>> sio.write("there!") + 6 + >>> sio.getvalue() 'Hello, there!' >>> import array -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:17:02 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:17:02 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Streamline_FAQ_entry_about_the_ternary_operator=2C_and_sugge?= =?utf8?q?st_using_io=2EStringIO?= Message-ID: http://hg.python.org/cpython/rev/10440e132cfb changeset: 73835:10440e132cfb parent: 73833:06087f6890af parent: 73834:140b8c98c7b9 user: Antoine Pitrou date: Sat Dec 03 22:11:45 2011 +0100 summary: Streamline FAQ entry about the ternary operator, and suggest using io.StringIO for a mutable unicode container. files: Doc/faq/programming.rst | 74 +++++++--------------------- 1 files changed, 20 insertions(+), 54 deletions(-) diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -679,61 +679,21 @@ Is there an equivalent of C's "?:" ternary operator? ---------------------------------------------------- -Yes, this feature was added in Python 2.5. The syntax would be as follows:: +Yes, there is. The syntax is as follows:: [on_true] if [expression] else [on_false] x, y = 50, 25 - small = x if x < y else y -For versions previous to 2.5 the answer would be 'No'. +Before this syntax was introduced in Python 2.5, a common idiom was to use +logical operators:: -.. XXX remove rest? + [expression] and [on_true] or [on_false] -In many cases you can mimic ``a ? b : c`` with ``a and b or c``, but there's a -flaw: if *b* is zero (or empty, or ``None`` -- anything that tests false) then -*c* will be selected instead. In many cases you can prove by looking at the -code that this can't happen (e.g. because *b* is a constant or has a type that -can never be false), but in general this can be a problem. - -Tim Peters (who wishes it was Steve Majewski) suggested the following solution: -``(a and [b] or [c])[0]``. Because ``[b]`` is a singleton list it is never -false, so the wrong path is never taken; then applying ``[0]`` to the whole -thing gets the *b* or *c* that you really wanted. Ugly, but it gets you there -in the rare cases where it is really inconvenient to rewrite your code using -'if'. - -The best course is usually to write a simple ``if...else`` statement. Another -solution is to implement the ``?:`` operator as a function:: - - def q(cond, on_true, on_false): - if cond: - if not isfunction(on_true): - return on_true - else: - return on_true() - else: - if not isfunction(on_false): - return on_false - else: - return on_false() - -In most cases you'll pass b and c directly: ``q(a, b, c)``. To avoid evaluating -b or c when they shouldn't be, encapsulate them within a lambda function, e.g.: -``q(a, lambda: b, lambda: c)``. - -It has been asked *why* Python has no if-then-else expression. There are -several answers: many languages do just fine without one; it can easily lead to -less readable code; no sufficiently "Pythonic" syntax has been discovered; a -search of the standard library found remarkably few places where using an -if-then-else expression would make the code more understandable. - -In 2002, :pep:`308` was written proposing several possible syntaxes and the -community was asked to vote on the issue. The vote was inconclusive. Most -people liked one of the syntaxes, but also hated other syntaxes; many votes -implied that people preferred no ternary operator rather than having a syntax -they hated. +However, this idiom is unsafe, as it can give wrong results when *on_true* +has a false boolean value. Therefore, it is always better to use +the ``... if ... else ...`` form. Is it possible to write obfuscated one-liners in Python? @@ -852,15 +812,21 @@ How do I modify a string in place? ---------------------------------- -You can't, because strings are immutable. If you need an object with this -ability, try converting the string to a list or use the array module:: +You can't, because strings are immutable. In most situations, you should +simply construct a new string from the various parts you want to assemble +it from. However, if you need an object with the ability to modify in-place +unicode data, try using a :class:`io.StringIO` object or the :mod:`array` +module:: >>> s = "Hello, world" - >>> a = list(s) - >>> print(a) - ['H', 'e', 'l', 'l', 'o', ',', ' ', 'w', 'o', 'r', 'l', 'd'] - >>> a[7:] = list("there!") - >>> ''.join(a) + >>> sio = io.StringIO(s) + >>> sio.getvalue() + 'Hello, world' + >>> sio.seek(7) + 7 + >>> sio.write("there!") + 6 + >>> sio.getvalue() 'Hello, there!' >>> import array -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:31:15 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:31:15 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Remove_outdate_?= =?utf8?q?FAQ_content?= Message-ID: http://hg.python.org/cpython/rev/93f64ae51fff changeset: 73836:93f64ae51fff branch: 3.2 parent: 73834:140b8c98c7b9 user: Antoine Pitrou date: Sat Dec 03 22:19:55 2011 +0100 summary: Remove outdate FAQ content files: Doc/faq/programming.rst | 40 +++++----------------------- 1 files changed, 8 insertions(+), 32 deletions(-) diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -901,11 +901,11 @@ Is there an equivalent to Perl's chomp() for removing trailing newlines from strings? ------------------------------------------------------------------------------------- -Starting with Python 2.2, you can use ``S.rstrip("\r\n")`` to remove all -occurrences of any line terminator from the end of the string ``S`` without -removing other trailing whitespace. If the string ``S`` represents more than -one line, with several empty lines at the end, the line terminators for all the -blank lines will be removed:: +You can use ``S.rstrip("\r\n")`` to remove all occurrences of any line +terminator from the end of the string ``S`` without removing other trailing +whitespace. If the string ``S`` represents more than one line, with several +empty lines at the end, the line terminators for all the blank lines will +be removed:: >>> lines = ("line 1 \r\n" ... "\r\n" @@ -916,15 +916,6 @@ Since this is typically only desired when reading text one line at a time, using ``S.rstrip()`` this way works well. -For older versions of Python, there are two partial substitutes: - -- If you want to remove all trailing whitespace, use the ``rstrip()`` method of - string objects. This removes all trailing whitespace, not just a single - newline. - -- Otherwise, if there is only one line in the string ``S``, use - ``S.splitlines()[0]``. - Is there a scanf() or sscanf() equivalent? ------------------------------------------ @@ -1042,15 +1033,8 @@ else: last = mylist[i] -If all elements of the list may be used as dictionary keys (i.e. they are all -hashable) this is often faster :: - - d = {} - for x in mylist: - d[x] = 1 - mylist = list(d.keys()) - -In Python 2.5 and later, the following is possible instead:: +If all elements of the list may be used as set keys (i.e. they are all +:term:`hashable`) this is often faster :: mylist = list(set(mylist)) @@ -1420,15 +1404,7 @@ C.count = 314 -Static methods are possible since Python 2.2:: - - class C: - def static(arg1, arg2, arg3): - # No 'self' parameter! - ... - static = staticmethod(static) - -With Python 2.4's decorators, this can also be written as :: +Static methods are possible:: class C: @staticmethod -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:31:16 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:31:16 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Remove_outdated_FAQ_content?= Message-ID: http://hg.python.org/cpython/rev/3d96c2dbec13 changeset: 73837:3d96c2dbec13 parent: 73835:10440e132cfb parent: 73836:93f64ae51fff user: Antoine Pitrou date: Sat Dec 03 22:26:01 2011 +0100 summary: Remove outdated FAQ content files: Doc/faq/programming.rst | 40 +++++----------------------- 1 files changed, 8 insertions(+), 32 deletions(-) diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -901,11 +901,11 @@ Is there an equivalent to Perl's chomp() for removing trailing newlines from strings? ------------------------------------------------------------------------------------- -Starting with Python 2.2, you can use ``S.rstrip("\r\n")`` to remove all -occurrences of any line terminator from the end of the string ``S`` without -removing other trailing whitespace. If the string ``S`` represents more than -one line, with several empty lines at the end, the line terminators for all the -blank lines will be removed:: +You can use ``S.rstrip("\r\n")`` to remove all occurrences of any line +terminator from the end of the string ``S`` without removing other trailing +whitespace. If the string ``S`` represents more than one line, with several +empty lines at the end, the line terminators for all the blank lines will +be removed:: >>> lines = ("line 1 \r\n" ... "\r\n" @@ -916,15 +916,6 @@ Since this is typically only desired when reading text one line at a time, using ``S.rstrip()`` this way works well. -For older versions of Python, there are two partial substitutes: - -- If you want to remove all trailing whitespace, use the ``rstrip()`` method of - string objects. This removes all trailing whitespace, not just a single - newline. - -- Otherwise, if there is only one line in the string ``S``, use - ``S.splitlines()[0]``. - Is there a scanf() or sscanf() equivalent? ------------------------------------------ @@ -1042,15 +1033,8 @@ else: last = mylist[i] -If all elements of the list may be used as dictionary keys (i.e. they are all -hashable) this is often faster :: - - d = {} - for x in mylist: - d[x] = 1 - mylist = list(d.keys()) - -In Python 2.5 and later, the following is possible instead:: +If all elements of the list may be used as set keys (i.e. they are all +:term:`hashable`) this is often faster :: mylist = list(set(mylist)) @@ -1420,15 +1404,7 @@ C.count = 314 -Static methods are possible since Python 2.2:: - - class C: - def static(arg1, arg2, arg3): - # No 'self' parameter! - ... - static = staticmethod(static) - -With Python 2.4's decorators, this can also be written as :: +Static methods are possible:: class C: @staticmethod -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:45:37 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:45:37 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Advocate_PyTupl?= =?utf8?q?e=5FPack_instead_of_manual_building_of_tuples?= Message-ID: http://hg.python.org/cpython/rev/8267af6f370d changeset: 73838:8267af6f370d branch: 3.2 parent: 73836:93f64ae51fff user: Antoine Pitrou date: Sat Dec 03 22:30:19 2011 +0100 summary: Advocate PyTuple_Pack instead of manual building of tuples files: Doc/faq/extending.rst | 7 +------ 1 files changed, 1 insertions(+), 6 deletions(-) diff --git a/Doc/faq/extending.rst b/Doc/faq/extending.rst --- a/Doc/faq/extending.rst +++ b/Doc/faq/extending.rst @@ -99,12 +99,7 @@ How do I use Py_BuildValue() to create a tuple of arbitrary length? ------------------------------------------------------------------- -You can't. Use ``t = PyTuple_New(n)`` instead, and fill it with objects using -``PyTuple_SetItem(t, i, o)`` -- note that this "eats" a reference count of -``o``, so you have to :c:func:`Py_INCREF` it. Lists have similar functions -``PyList_New(n)`` and ``PyList_SetItem(l, i, o)``. Note that you *must* set all -the tuple items to some value before you pass the tuple to Python code -- -``PyTuple_New(n)`` initializes them to NULL, which isn't a valid Python value. +You can't. Use :c:func:`PyTuple_Pack` instead. How do I call an object's method from C? -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:45:37 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:45:37 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Refine_FAQ_entr?= =?utf8?q?y_for_catching_stdout?= Message-ID: http://hg.python.org/cpython/rev/c82492fc9943 changeset: 73839:c82492fc9943 branch: 3.2 user: Antoine Pitrou date: Sat Dec 03 22:35:31 2011 +0100 summary: Refine FAQ entry for catching stdout files: Doc/faq/extending.rst | 21 +++++++++++++++------ 1 files changed, 15 insertions(+), 6 deletions(-) diff --git a/Doc/faq/extending.rst b/Doc/faq/extending.rst --- a/Doc/faq/extending.rst +++ b/Doc/faq/extending.rst @@ -142,21 +142,30 @@ just allow the standard traceback mechanism to work. Then, the output will go wherever your ``write()`` method sends it. -The easiest way to do this is to use the StringIO class in the standard library. +The easiest way to do this is to use the :class:`io.StringIO` class:: -Sample code and use for catching stdout: + >>> import io, sys + >>> sys.stdout = io.StringIO() + >>> print('foo') + >>> print('hello world!') + >>> sys.stderr.write(sys.stdout.getvalue()) + foo + hello world! - >>> class StdoutCatcher: +A custom object to do the same would look like this:: + + >>> import io, sys + >>> class StdoutCatcher(io.TextIOBase): ... def __init__(self): - ... self.data = '' + ... self.data = [] ... def write(self, stuff): - ... self.data = self.data + stuff + ... self.data.append(stuff) ... >>> import sys >>> sys.stdout = StdoutCatcher() >>> print('foo') >>> print('hello world!') - >>> sys.stderr.write(sys.stdout.data) + >>> sys.stderr.write(''.join(sys.stdout.data)) foo hello world! -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:45:38 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:45:38 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Remove_outdated?= =?utf8?q?_question_=28the_bsddb_module_isn=27t_included_anymore=29?= Message-ID: http://hg.python.org/cpython/rev/040c751e5497 changeset: 73840:040c751e5497 branch: 3.2 user: Antoine Pitrou date: Sat Dec 03 22:37:14 2011 +0100 summary: Remove outdated question (the bsddb module isn't included anymore) files: Doc/faq/general.rst | 34 --------------------------------- 1 files changed, 0 insertions(+), 34 deletions(-) diff --git a/Doc/faq/general.rst b/Doc/faq/general.rst --- a/Doc/faq/general.rst +++ b/Doc/faq/general.rst @@ -469,38 +469,3 @@ If you want to discuss Python's use in education, you may be interested in joining `the edu-sig mailing list `_. - - -Upgrading Python -================ - -What is this bsddb185 module my application keeps complaining about? --------------------------------------------------------------------- - -.. XXX remove this question? - -Starting with Python2.3, the distribution includes the `PyBSDDB package -` as a replacement for the old bsddb module. It -includes functions which provide backward compatibility at the API level, but -requires a newer version of the underlying `Berkeley DB -`_ library. Files created with the older bsddb module -can't be opened directly using the new module. - -Using your old version of Python and a pair of scripts which are part of Python -2.3 (db2pickle.py and pickle2db.py, in the Tools/scripts directory) you can -convert your old database files to the new format. Using your old Python -version, run the db2pickle.py script to convert it to a pickle, e.g.:: - - python2.2 /db2pickley.py database.db database.pck - -Rename your database file:: - - mv database.db olddatabase.db - -Now convert the pickle file to a new format database:: - - python /pickle2db.py database.db database.pck - -The precise commands you use will vary depending on the particulars of your -installation. For full details about operation of these two scripts check the -doc string at the start of each one. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:45:39 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:45:39 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Fix_compatibili?= =?utf8?q?ty_statement=2E?= Message-ID: http://hg.python.org/cpython/rev/01292dd6dadd changeset: 73841:01292dd6dadd branch: 3.2 user: Antoine Pitrou date: Sat Dec 03 22:39:13 2011 +0100 summary: Fix compatibility statement. files: Doc/faq/general.rst | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Doc/faq/general.rst b/Doc/faq/general.rst --- a/Doc/faq/general.rst +++ b/Doc/faq/general.rst @@ -19,7 +19,7 @@ window systems, and is extensible in C or C++. It is also usable as an extension language for applications that need a programmable interface. Finally, Python is portable: it runs on many Unix variants, on the Mac, and on -PCs under MS-DOS, Windows, Windows NT, and OS/2. +Windows 2000 and later. To find out more, start with :ref:`tutorial-index`. The `Beginner's Guide to Python `_ links to other -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 22:45:40 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 22:45:40 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_from_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/abd2adbfceb7 changeset: 73842:abd2adbfceb7 parent: 73837:3d96c2dbec13 parent: 73841:01292dd6dadd user: Antoine Pitrou date: Sat Dec 03 22:40:23 2011 +0100 summary: Merge from 3.2 files: Doc/faq/extending.rst | 28 +++++++++++++---------- Doc/faq/general.rst | 36 +------------------------------ 2 files changed, 17 insertions(+), 47 deletions(-) diff --git a/Doc/faq/extending.rst b/Doc/faq/extending.rst --- a/Doc/faq/extending.rst +++ b/Doc/faq/extending.rst @@ -99,12 +99,7 @@ How do I use Py_BuildValue() to create a tuple of arbitrary length? ------------------------------------------------------------------- -You can't. Use ``t = PyTuple_New(n)`` instead, and fill it with objects using -``PyTuple_SetItem(t, i, o)`` -- note that this "eats" a reference count of -``o``, so you have to :c:func:`Py_INCREF` it. Lists have similar functions -``PyList_New(n)`` and ``PyList_SetItem(l, i, o)``. Note that you *must* set all -the tuple items to some value before you pass the tuple to Python code -- -``PyTuple_New(n)`` initializes them to NULL, which isn't a valid Python value. +You can't. Use :c:func:`PyTuple_Pack` instead. How do I call an object's method from C? @@ -147,21 +142,30 @@ just allow the standard traceback mechanism to work. Then, the output will go wherever your ``write()`` method sends it. -The easiest way to do this is to use the StringIO class in the standard library. +The easiest way to do this is to use the :class:`io.StringIO` class:: -Sample code and use for catching stdout: + >>> import io, sys + >>> sys.stdout = io.StringIO() + >>> print('foo') + >>> print('hello world!') + >>> sys.stderr.write(sys.stdout.getvalue()) + foo + hello world! - >>> class StdoutCatcher: +A custom object to do the same would look like this:: + + >>> import io, sys + >>> class StdoutCatcher(io.TextIOBase): ... def __init__(self): - ... self.data = '' + ... self.data = [] ... def write(self, stuff): - ... self.data = self.data + stuff + ... self.data.append(stuff) ... >>> import sys >>> sys.stdout = StdoutCatcher() >>> print('foo') >>> print('hello world!') - >>> sys.stderr.write(sys.stdout.data) + >>> sys.stderr.write(''.join(sys.stdout.data)) foo hello world! diff --git a/Doc/faq/general.rst b/Doc/faq/general.rst --- a/Doc/faq/general.rst +++ b/Doc/faq/general.rst @@ -19,7 +19,7 @@ window systems, and is extensible in C or C++. It is also usable as an extension language for applications that need a programmable interface. Finally, Python is portable: it runs on many Unix variants, on the Mac, and on -PCs under MS-DOS, Windows, Windows NT, and OS/2. +Windows 2000 and later. To find out more, start with :ref:`tutorial-index`. The `Beginner's Guide to Python `_ links to other @@ -469,38 +469,3 @@ If you want to discuss Python's use in education, you may be interested in joining `the edu-sig mailing list `_. - - -Upgrading Python -================ - -What is this bsddb185 module my application keeps complaining about? --------------------------------------------------------------------- - -.. XXX remove this question? - -Starting with Python2.3, the distribution includes the `PyBSDDB package -` as a replacement for the old bsddb module. It -includes functions which provide backward compatibility at the API level, but -requires a newer version of the underlying `Berkeley DB -`_ library. Files created with the older bsddb module -can't be opened directly using the new module. - -Using your old version of Python and a pair of scripts which are part of Python -2.3 (db2pickle.py and pickle2db.py, in the Tools/scripts directory) you can -convert your old database files to the new format. Using your old Python -version, run the db2pickle.py script to convert it to a pickle, e.g.:: - - python2.2 /db2pickley.py database.db database.pck - -Rename your database file:: - - mv database.db olddatabase.db - -Now convert the pickle file to a new format database:: - - python /pickle2db.py database.db database.pck - -The precise commands you use will vary depending on the particulars of your -installation. For full details about operation of these two scripts check the -doc string at the start of each one. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 23:15:25 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 23:15:25 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Reformulate_ans?= =?utf8?q?wer=2C_and_remove_off-topic_discussion_of_bytecode_in_a_question?= Message-ID: http://hg.python.org/cpython/rev/00fdd8b1e17b changeset: 73843:00fdd8b1e17b branch: 3.2 parent: 73841:01292dd6dadd user: Antoine Pitrou date: Sat Dec 03 22:56:02 2011 +0100 summary: Reformulate answer, and remove off-topic discussion of bytecode in a question about compiling to C. files: Doc/faq/design.rst | 49 +++++++++++---------------------- 1 files changed, 17 insertions(+), 32 deletions(-) diff --git a/Doc/faq/design.rst b/Doc/faq/design.rst --- a/Doc/faq/design.rst +++ b/Doc/faq/design.rst @@ -380,11 +380,24 @@ Can Python be compiled to machine code, C or some other language? ----------------------------------------------------------------- -Not easily. Python's high level data types, dynamic typing of objects and +Practical answer: + +`Cython `_ and `Pyrex `_ +compile a modified version of Python with optional annotations into C +extensions. `Weave `_ makes it easy to +intermingle Python and C code in various ways to increase performance. +`Nuitka `_ is an up-and-coming compiler of Python +into C++ code, aiming to support the full Python language. + +Theoretical answer: + + .. XXX not sure what to make of this + +Not trivially. Python's high level data types, dynamic typing of objects and run-time invocation of the interpreter (using :func:`eval` or :func:`exec`) -together mean that a "compiled" Python program would probably consist mostly of -calls into the Python run-time system, even for seemingly simple operations like -``x+1``. +together mean that a na?vely "compiled" Python program would probably consist +mostly of calls into the Python run-time system, even for seemingly simple +operations like ``x+1``. Several projects described in the Python newsgroup or at past `Python conferences `_ have shown that this @@ -395,34 +408,6 @@ from the `1997 Python conference `_ for more information.) -Internally, Python source code is always translated into a bytecode -representation, and this bytecode is then executed by the Python virtual -machine. In order to avoid the overhead of repeatedly parsing and translating -modules that rarely change, this byte code is written into a file whose name -ends in ".pyc" whenever a module is parsed. When the corresponding .py file is -changed, it is parsed and translated again and the .pyc file is rewritten. - -There is no performance difference once the .pyc file has been loaded, as the -bytecode read from the .pyc file is exactly the same as the bytecode created by -direct translation. The only difference is that loading code from a .pyc file -is faster than parsing and translating a .py file, so the presence of -precompiled .pyc files improves the start-up time of Python scripts. If -desired, the Lib/compileall.py module can be used to create valid .pyc files for -a given set of modules. - -Note that the main script executed by Python, even if its filename ends in .py, -is not compiled to a .pyc file. It is compiled to bytecode, but the bytecode is -not saved to a file. Usually main scripts are quite short, so this doesn't cost -much speed. - -.. XXX check which of these projects are still alive - -There are also several programs which make it easier to intermingle Python and C -code in various ways to increase performance. See, for example, `Cython -`_, `Pyrex -`_ and `Weave -`_. - How does Python manage memory? ------------------------------ -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 23:15:26 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 23:15:26 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Break_down_and_?= =?utf8?q?refine_memory_management_question?= Message-ID: http://hg.python.org/cpython/rev/29fc3c23c73d changeset: 73844:29fc3c23c73d branch: 3.2 user: Antoine Pitrou date: Sat Dec 03 23:06:50 2011 +0100 summary: Break down and refine memory management question files: Doc/faq/design.rst | 74 +++++++++++++++------------------ 1 files changed, 34 insertions(+), 40 deletions(-) diff --git a/Doc/faq/design.rst b/Doc/faq/design.rst --- a/Doc/faq/design.rst +++ b/Doc/faq/design.rst @@ -413,66 +413,59 @@ ------------------------------ The details of Python memory management depend on the implementation. The -standard C implementation of Python uses reference counting to detect -inaccessible objects, and another mechanism to collect reference cycles, +standard implementation of Python, :term:`CPython`, uses reference counting to +detect inaccessible objects, and another mechanism to collect reference cycles, periodically executing a cycle detection algorithm which looks for inaccessible cycles and deletes the objects involved. The :mod:`gc` module provides functions to perform a garbage collection, obtain debugging statistics, and tune the collector's parameters. -Jython relies on the Java runtime so the JVM's garbage collector is used. This -difference can cause some subtle porting problems if your Python code depends on -the behavior of the reference counting implementation. +Other implementations (such as `Jython `_ or +`PyPy `_), however, can rely on a different mechanism +such as a full-blown garbage collector. This difference can cause some +subtle porting problems if your Python code depends on the behavior of the +reference counting implementation. -.. XXX relevant for Python 3? +In some Python implementations, the following code (which is fine in CPython) +will probably run out of file descriptors:: - Sometimes objects get stuck in traceback temporarily and hence are not - deallocated when you might expect. Clear the traceback with:: + for file in very_long_list_of_files: + f = open(file) + c = f.read(1) - import sys - sys.last_traceback = None +Indeed, using CPython's reference counting and destructor scheme, each new +assignment to *f* closes the previous file. With a traditional GC, however, +those file objects will only get collected (and closed) at varying and possibly +long intervals. - Tracebacks are used for reporting errors, implementing debuggers and related - things. They contain a portion of the program state extracted during the - handling of an exception (usually the most recent exception). +If you want to write code that will work with any Python implementation, +you should explicitly close the file or use the :keyword:`with` statement; +this will work regardless of memory management scheme:: -In the absence of circularities, Python programs do not need to manage memory -explicitly. + for file in very_long_list_of_files: + with open(file) as f: + c = f.read(1) -Why doesn't Python use a more traditional garbage collection scheme? For one -thing, this is not a C standard feature and hence it's not portable. (Yes, we -know about the Boehm GC library. It has bits of assembler code for *most* -common platforms, not for all of them, and although it is mostly transparent, it -isn't completely transparent; patches are required to get Python to work with -it.) + +Why doesn't CPython use a more traditional garbage collection scheme? +--------------------------------------------------------------------- + +For one thing, this is not a C standard feature and hence it's not portable. +(Yes, we know about the Boehm GC library. It has bits of assembler code for +*most* common platforms, not for all of them, and although it is mostly +transparent, it isn't completely transparent; patches are required to get +Python to work with it.) Traditional GC also becomes a problem when Python is embedded into other applications. While in a standalone Python it's fine to replace the standard malloc() and free() with versions provided by the GC library, an application embedding Python may want to have its *own* substitute for malloc() and free(), -and may not want Python's. Right now, Python works with anything that +and may not want Python's. Right now, CPython works with anything that implements malloc() and free() properly. -In Jython, the following code (which is fine in CPython) will probably run out -of file descriptors long before it runs out of memory:: - for file in very_long_list_of_files: - f = open(file) - c = f.read(1) - -Using the current reference counting and destructor scheme, each new assignment -to f closes the previous file. Using GC, this is not guaranteed. If you want -to write code that will work with any Python implementation, you should -explicitly close the file or use the :keyword:`with` statement; this will work -regardless of GC:: - - for file in very_long_list_of_files: - with open(file) as f: - c = f.read(1) - - -Why isn't all memory freed when Python exits? ---------------------------------------------- +Why isn't all memory freed when CPython exits? +---------------------------------------------- Objects referenced from the global namespaces of Python modules are not always deallocated when Python exits. This may happen if there are circular -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 23:15:26 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 23:15:26 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Streamline_ment?= =?utf8?q?ion_of_sorted=28=29?= Message-ID: http://hg.python.org/cpython/rev/bf0c5cd303e5 changeset: 73845:bf0c5cd303e5 branch: 3.2 user: Antoine Pitrou date: Sat Dec 03 23:08:57 2011 +0100 summary: Streamline mention of sorted() files: Doc/faq/design.rst | 8 ++++---- 1 files changed, 4 insertions(+), 4 deletions(-) diff --git a/Doc/faq/design.rst b/Doc/faq/design.rst --- a/Doc/faq/design.rst +++ b/Doc/faq/design.rst @@ -625,10 +625,10 @@ you won't be fooled into accidentally overwriting a list when you need a sorted copy but also need to keep the unsorted version around. -In Python 2.4 a new built-in function -- :func:`sorted` -- has been added. -This function creates a new list from a provided iterable, sorts it and returns -it. For example, here's how to iterate over the keys of a dictionary in sorted -order:: +If you want to return a new list, use the built-in :func:`sorted` function +instead. This function creates a new list from a provided iterable, sorts +it and returns it. For example, here's how to iterate over the keys of a +dictionary in sorted order:: for key in sorted(mydict): ... # do whatever with mydict[key]... -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 3 23:15:27 2011 From: python-checkins at python.org (antoine.pitrou) Date: Sat, 03 Dec 2011 23:15:27 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_from_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/f09477693434 changeset: 73846:f09477693434 parent: 73842:abd2adbfceb7 parent: 73845:bf0c5cd303e5 user: Antoine Pitrou date: Sat Dec 03 23:10:12 2011 +0100 summary: Merge from 3.2 files: Doc/faq/design.rst | 131 +++++++++++++------------------- 1 files changed, 55 insertions(+), 76 deletions(-) diff --git a/Doc/faq/design.rst b/Doc/faq/design.rst --- a/Doc/faq/design.rst +++ b/Doc/faq/design.rst @@ -380,11 +380,24 @@ Can Python be compiled to machine code, C or some other language? ----------------------------------------------------------------- -Not easily. Python's high level data types, dynamic typing of objects and +Practical answer: + +`Cython `_ and `Pyrex `_ +compile a modified version of Python with optional annotations into C +extensions. `Weave `_ makes it easy to +intermingle Python and C code in various ways to increase performance. +`Nuitka `_ is an up-and-coming compiler of Python +into C++ code, aiming to support the full Python language. + +Theoretical answer: + + .. XXX not sure what to make of this + +Not trivially. Python's high level data types, dynamic typing of objects and run-time invocation of the interpreter (using :func:`eval` or :func:`exec`) -together mean that a "compiled" Python program would probably consist mostly of -calls into the Python run-time system, even for seemingly simple operations like -``x+1``. +together mean that a na?vely "compiled" Python program would probably consist +mostly of calls into the Python run-time system, even for seemingly simple +operations like ``x+1``. Several projects described in the Python newsgroup or at past `Python conferences `_ have shown that this @@ -395,99 +408,64 @@ from the `1997 Python conference `_ for more information.) -Internally, Python source code is always translated into a bytecode -representation, and this bytecode is then executed by the Python virtual -machine. In order to avoid the overhead of repeatedly parsing and translating -modules that rarely change, this byte code is written into a file whose name -ends in ".pyc" whenever a module is parsed. When the corresponding .py file is -changed, it is parsed and translated again and the .pyc file is rewritten. - -There is no performance difference once the .pyc file has been loaded, as the -bytecode read from the .pyc file is exactly the same as the bytecode created by -direct translation. The only difference is that loading code from a .pyc file -is faster than parsing and translating a .py file, so the presence of -precompiled .pyc files improves the start-up time of Python scripts. If -desired, the Lib/compileall.py module can be used to create valid .pyc files for -a given set of modules. - -Note that the main script executed by Python, even if its filename ends in .py, -is not compiled to a .pyc file. It is compiled to bytecode, but the bytecode is -not saved to a file. Usually main scripts are quite short, so this doesn't cost -much speed. - -.. XXX check which of these projects are still alive - -There are also several programs which make it easier to intermingle Python and C -code in various ways to increase performance. See, for example, `Cython -`_, `Pyrex -`_ and `Weave -`_. - How does Python manage memory? ------------------------------ The details of Python memory management depend on the implementation. The -standard C implementation of Python uses reference counting to detect -inaccessible objects, and another mechanism to collect reference cycles, +standard implementation of Python, :term:`CPython`, uses reference counting to +detect inaccessible objects, and another mechanism to collect reference cycles, periodically executing a cycle detection algorithm which looks for inaccessible cycles and deletes the objects involved. The :mod:`gc` module provides functions to perform a garbage collection, obtain debugging statistics, and tune the collector's parameters. -Jython relies on the Java runtime so the JVM's garbage collector is used. This -difference can cause some subtle porting problems if your Python code depends on -the behavior of the reference counting implementation. +Other implementations (such as `Jython `_ or +`PyPy `_), however, can rely on a different mechanism +such as a full-blown garbage collector. This difference can cause some +subtle porting problems if your Python code depends on the behavior of the +reference counting implementation. -.. XXX relevant for Python 3? +In some Python implementations, the following code (which is fine in CPython) +will probably run out of file descriptors:: - Sometimes objects get stuck in traceback temporarily and hence are not - deallocated when you might expect. Clear the traceback with:: + for file in very_long_list_of_files: + f = open(file) + c = f.read(1) - import sys - sys.last_traceback = None +Indeed, using CPython's reference counting and destructor scheme, each new +assignment to *f* closes the previous file. With a traditional GC, however, +those file objects will only get collected (and closed) at varying and possibly +long intervals. - Tracebacks are used for reporting errors, implementing debuggers and related - things. They contain a portion of the program state extracted during the - handling of an exception (usually the most recent exception). +If you want to write code that will work with any Python implementation, +you should explicitly close the file or use the :keyword:`with` statement; +this will work regardless of memory management scheme:: -In the absence of circularities, Python programs do not need to manage memory -explicitly. + for file in very_long_list_of_files: + with open(file) as f: + c = f.read(1) -Why doesn't Python use a more traditional garbage collection scheme? For one -thing, this is not a C standard feature and hence it's not portable. (Yes, we -know about the Boehm GC library. It has bits of assembler code for *most* -common platforms, not for all of them, and although it is mostly transparent, it -isn't completely transparent; patches are required to get Python to work with -it.) + +Why doesn't CPython use a more traditional garbage collection scheme? +--------------------------------------------------------------------- + +For one thing, this is not a C standard feature and hence it's not portable. +(Yes, we know about the Boehm GC library. It has bits of assembler code for +*most* common platforms, not for all of them, and although it is mostly +transparent, it isn't completely transparent; patches are required to get +Python to work with it.) Traditional GC also becomes a problem when Python is embedded into other applications. While in a standalone Python it's fine to replace the standard malloc() and free() with versions provided by the GC library, an application embedding Python may want to have its *own* substitute for malloc() and free(), -and may not want Python's. Right now, Python works with anything that +and may not want Python's. Right now, CPython works with anything that implements malloc() and free() properly. -In Jython, the following code (which is fine in CPython) will probably run out -of file descriptors long before it runs out of memory:: - for file in very_long_list_of_files: - f = open(file) - c = f.read(1) - -Using the current reference counting and destructor scheme, each new assignment -to f closes the previous file. Using GC, this is not guaranteed. If you want -to write code that will work with any Python implementation, you should -explicitly close the file or use the :keyword:`with` statement; this will work -regardless of GC:: - - for file in very_long_list_of_files: - with open(file) as f: - c = f.read(1) - - -Why isn't all memory freed when Python exits? ---------------------------------------------- +Why isn't all memory freed when CPython exits? +---------------------------------------------- Objects referenced from the global namespaces of Python modules are not always deallocated when Python exits. This may happen if there are circular @@ -647,10 +625,10 @@ you won't be fooled into accidentally overwriting a list when you need a sorted copy but also need to keep the unsorted version around. -In Python 2.4 a new built-in function -- :func:`sorted` -- has been added. -This function creates a new list from a provided iterable, sorts it and returns -it. For example, here's how to iterate over the keys of a dictionary in sorted -order:: +If you want to return a new list, use the built-in :func:`sorted` function +instead. This function creates a new list from a provided iterable, sorts +it and returns it. For example, here's how to iterate over the keys of a +dictionary in sorted order:: for key in sorted(mydict): ... # do whatever with mydict[key]... -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sun Dec 4 03:05:11 2011 From: python-checkins at python.org (nick.coghlan) Date: Sun, 04 Dec 2011 03:05:11 +0100 Subject: [Python-checkins] =?utf8?q?peps=3A_PEP_406_=28ImportEngine_API=29?= =?utf8?q?=3A_defer_this_until_3=2E4_at_the_earliest=2C_don=27t_propose?= Message-ID: http://hg.python.org/peps/rev/e2ccffa4d3b5 changeset: 4009:e2ccffa4d3b5 user: Nick Coghlan date: Sun Dec 04 12:05:01 2011 +1000 summary: PEP 406 (ImportEngine API): defer this until 3.4 at the earliest, don't propose altering the PEP 302 APIs files: pep-0406.txt | 127 +++++++++++++++++++++++--------------- 1 files changed, 77 insertions(+), 50 deletions(-) diff --git a/pep-0406.txt b/pep-0406.txt --- a/pep-0406.txt +++ b/pep-0406.txt @@ -3,11 +3,12 @@ Version: $Revision$ Last-Modified: $Date$ Author: Nick Coghlan , Greg Slodkowicz -Status: Draft +Status: Deferred Type: Standards Track Content-Type: text/x-rst Created: 4-Jul-2011 -Post-History: 31-Jul-2011, 13-Nov-2011 +Python-Version: 3.4 +Post-History: 31-Jul-2011, 13-Nov-2011, 4-Dec-2011 Abstract ======== @@ -18,17 +19,27 @@ an alternative to completely replacing the built-in implementation of the import statement, by overriding the ``__import__()`` function. To work with the builtin import functionality and importing via import engine objects, -module importers and loaders must accept an optional ``engine`` parameter. In -that sense, this PEP constitutes a revision of finder and loader interfaces -described in PEP 302 [1]_. However, the standard import process will not -supply the additional argument, so this proposal remains fully backwards -compatible. +this PEP proposes a context management based approach to temporarily replacing +the global import state. The PEP also proposes inclusion of a ``GlobalImportEngine`` subclass and a globally accessible instance of that class, which "writes through" to the -process global state and invokes importers and loaders without the additional -``engine`` argument. This provides a backwards compatible bridge between the -proposed encapsulated API and the legacy process global state. +process global state. This provides a backwards compatible bridge between the +proposed encapsulated API and the legacy process global state, and allows +straightforward support for related state updates (e.g. selectively +invalidating path cache entries when ``sys.path`` is modified). + + +PEP Deferral +============ + +The import system is already seeing substantial changes in Python 3.3, to +natively handle packages split across multiple directories (PEP 382) and +(potentially) to make the import semantics in the main module better match +those in other modules (PEP 395). + +Accordingly, the proposal in this PEP will not be seriously considered until +Python 3.4 at the earliest. Rationale @@ -58,6 +69,10 @@ *additional* process global state, in order to correctly update package paths as ``sys.path`` is modified. +Finally, providing a coherent object for all this state makes it feasible to +also provide context management features that allow the import state to be +temporarily substituted. + Proposal ======== @@ -68,11 +83,10 @@ desired and also an ``import_module()`` method, equivalent to ``importlib.import_module()`` [3]_. -Since the new style finders and loaders should also have the option to -modify the global import state, we introduce a ``GlobalImportState`` -class with an interface identical to ``ImportEngine`` but taking -advantage of the global state. This can be easily implemented using -class properties. +Since there are global import state invariants that are assumed and should be +maintained, we introduce a ``GlobalImportState`` class with an interface +identical to ``ImportEngine`` but directly accessing the current global import +state. This can be easily implemented using class properties. Specification @@ -121,6 +135,14 @@ methods like ``ImportEngine`` but writes through to the global state in ``sys``. +To support various namespace package mechanisms, when ``sys.path`` is altered, +tools like ``pkgutil.extend_path`` should be used to also modify other parts +of the import state (in this case, package ``__path__`` attributes). The path +importer cache should also be invalidated when a variety of changes are made. + +The ``ImportEngine`` API will provide convenience methods that automatically +make related import state updates as part of a single operation. + Global variables ~~~~~~~~~~~~~~~~ @@ -133,24 +155,26 @@ a copy of the process global import state. -Necessary changes to finder/loader interfaces: -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ +No changes to finder/loader interfaces +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -``find_module (cls, fullname, path=None, engine=None)`` +Rather than attempting to update the PEP 302 APIs to accept additional state, +this PEP proposes that ``ImportEngine`` support the content management +protocol (similar to the context substitution mechanisms in the ``decimal`` +module). -``load_module (cls, fullname, path=None, engine=None)`` +The context management mechanism for ``ImportEngine`` would: -The only difference between engine compatible and PEP 302 compatible -finders/loaders is the presence of an additional ``engine`` parameter. -This is intended to specify an ImportEngine instance or subclass thereof. -This parameter is optional so that engine compatible finders and -loaders can be made backwards compatible with PEP 302 calling conventions by -falling back on ``engine.sysengine`` with the following simple pattern:: +* On entry: + * Acquire the import lock + * Substitute the global import state with the import engine's own state +* On exit: + * Restore the previous global import state + * Release the import lock - def find_module(cls, fullname, path=None, engine=None): - if not engine: - engine = importlib.engine.sysengine - ... +The precise API for this is TBD (but will probably use a distinct context +management object, along the lines of that created by +``decimal.localcontext``). Open Issues @@ -185,35 +209,38 @@ cache - it's only loading them directly which causes problems. -Nested imports -~~~~~~~~~~~~~~ +Scope of substitution +~~~~~~~~~~~~~~~~~~~~~ -The reference implementation currently applies only to the outermost import. -Any imports by the module being imported will be handled using the standard -import machinery. +Related to the previous open issue is the question of what state to substitute +when using the context management API. It is currently the case that replacing +``sys.modules`` can be unreliable due to cached references and there's the +underlying fact that having independent copies of some modules is simply +impossible due to platform limitations. -One way to handle this is to place the burden on the implementation of module -loaders to set ``module.__dict__["__import__"] = engine.__import__`` before -running the module's code. The ``importlib`` design facilities this by -allowing the change to be made in one place (``_LoaderBasics._load_module``). +As part of this PEP, it will be necessary to document explicitly: - -Scope of API updates -~~~~~~~~~~~~~~~~~~~~ - -The reference implementation focuses on finding and loading modules. There -may be other PEP 302 APIs that should also be updated to accept an optional -``engine`` parameter. +* Which parts of the global import state can be substituted (and declare code + which caches references to that state without dealing with the substitution + case buggy) +* Which parts must be modified in-place (and hence are not substituted by the + ``ImportEngine`` context management API, or otherwise scoped to + ``ImportEngine`` instances) Reference Implementation ======================== -A reference implementation [4]_ based on Brett Cannon's importlib has been -developed by Greg Slodkowicz as part of the 2011 Google Summer of Code. Note -that the current implementation avoids modifying existing code, and hence -duplicates a lot of things unnecessarily. An actual implementation would just -modify any such affected code in place. +A reference implementation [4]_ for an earlier draft of this PEP, based on +Brett Cannon's importlib has been developed by Greg Slodkowicz as part of the +2011 Google Summer of Code. Note that the current implementation avoids +modifying existing code, and hence duplicates a lot of things unnecessarily. +An actual implementation would just modify any such affected code in place. + +That earlier draft of the PEP proposed change the PEP 302 APIs to support passing +in an optional engine instance. This had the (serious) downside of not correctly +affecting further imports from the imported module, hence the change to the +context management based proposal for substituting the global state. References -- Repository URL: http://hg.python.org/peps From ncoghlan at gmail.com Sun Dec 4 05:11:58 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Sun, 4 Dec 2011 14:11:58 +1000 Subject: [Python-checkins] cpython (3.2): Issue #13211: Add .reason attribute to HTTPError to implement parent class In-Reply-To: References: Message-ID: On Sun, Dec 4, 2011 at 12:46 AM, jason.coombs wrote: > +def test_HTTPError_interface(): > + ? ?""" > + ? ?Issue 13211 reveals that HTTPError didn't implement the URLError > + ? ?interface even though HTTPError is a subclass of URLError. > + > + ? ?>>> err = urllib.error.HTTPError(msg='something bad happened', url=None, code=None, hdrs=None, fp=None) > + ? ?>>> assert hasattr(err, 'reason') > + ? ?>>> err.reason > + ? ?'something bad happened' > + ? ?""" > + Did you re-run the test suite after forward-porting to 3.3? I'm consistently getting failures: $ ./python -m test test_urllib2 [1/1] test_urllib2 ********************************************************************** File "/home/ncoghlan/devel/py3k/Lib/test/test_urllib2.py", line 1457, in test.test_urllib2.test_HTTPError_interface Failed example: err = urllib.error.HTTPError(msg='something bad happened', url=None, code=None, hdrs=None, fp=None) Exception raised: Traceback (most recent call last): File "/home/ncoghlan/devel/py3k/Lib/doctest.py", line 1253, in __run compileflags, 1), test.globs) File "", line 1, in err = urllib.error.HTTPError(msg='something bad happened', url=None, code=None, hdrs=None, fp=None) TypeError: HTTPError does not take keyword arguments ********************************************************************** File "/home/ncoghlan/devel/py3k/Lib/test/test_urllib2.py", line 1458, in test.test_urllib2.test_HTTPError_interface Failed example: assert hasattr(err, 'reason') Exception raised: Traceback (most recent call last): File "/home/ncoghlan/devel/py3k/Lib/doctest.py", line 1253, in __run compileflags, 1), test.globs) File "", line 1, in assert hasattr(err, 'reason') NameError: name 'err' is not defined ********************************************************************** File "/home/ncoghlan/devel/py3k/Lib/test/test_urllib2.py", line 1459, in test.test_urllib2.test_HTTPError_interface Failed example: err.reason Exception raised: Traceback (most recent call last): File "/home/ncoghlan/devel/py3k/Lib/doctest.py", line 1253, in __run compileflags, 1), test.globs) File "", line 1, in err.reason NameError: name 'err' is not defined ********************************************************************** 1 items had failures: 3 of 3 in test.test_urllib2.test_HTTPError_interface ***Test Failed*** 3 failures. test test_urllib2 failed -- 3 of 65 doctests failed 1 test failed: test_urllib2 [142313 refs] Now, this failure is quite possibly due to a flaw in the PEP 3151 implementation (see http://bugs.python.org/issue12555), but picking up this kind of thing is the reason we say to always run the tests before committing, even for a simple merge. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From python-checkins at python.org Sun Dec 4 05:20:40 2011 From: python-checkins at python.org (jason.coombs) Date: Sun, 04 Dec 2011 05:20:40 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Pass_positional_arguments_-?= =?utf8?q?_HTTPError_is_not_accepting_keyword_arguments=2E?= Message-ID: http://hg.python.org/cpython/rev/a3ddee916808 changeset: 73847:a3ddee916808 user: Jason R. Coombs date: Sat Dec 03 23:18:11 2011 -0500 summary: Pass positional arguments - HTTPError is not accepting keyword arguments. Reference #13211 and #12555. files: Lib/test/test_urllib2.py | 4 +++- 1 files changed, 3 insertions(+), 1 deletions(-) diff --git a/Lib/test/test_urllib2.py b/Lib/test/test_urllib2.py --- a/Lib/test/test_urllib2.py +++ b/Lib/test/test_urllib2.py @@ -1454,7 +1454,9 @@ Issue 13211 reveals that HTTPError didn't implement the URLError interface even though HTTPError is a subclass of URLError. - >>> err = urllib.error.HTTPError(msg='something bad happened', url=None, code=None, hdrs=None, fp=None) + >>> msg = 'something bad happened' + >>> url = code = hdrs = fp = None + >>> err = urllib.error.HTTPError(msg, url, code, hdrs, fp) >>> assert hasattr(err, 'reason') >>> err.reason 'something bad happened' -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Sun Dec 4 05:38:45 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Sun, 04 Dec 2011 05:38:45 +0100 Subject: [Python-checkins] Daily reference leaks (f09477693434): sum=-417 Message-ID: results for f09477693434 on branch "default" -------------------------------------------- test_urllib2net leaked [1405, -1822, 0] references, sum=-417 Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/reflogF7_xgV', '-x'] From python-checkins at python.org Sun Dec 4 11:51:38 2011 From: python-checkins at python.org (georg.brandl) Date: Sun, 04 Dec 2011 11:51:38 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogRml4IHR5cG8u?= Message-ID: http://hg.python.org/cpython/rev/0436ef8be253 changeset: 73848:0436ef8be253 branch: 3.2 parent: 73845:bf0c5cd303e5 user: Georg Brandl date: Sun Dec 04 11:51:21 2011 +0100 summary: Fix typo. files: Doc/tools/sphinxext/static/copybutton.js | 4 ++-- 1 files changed, 2 insertions(+), 2 deletions(-) diff --git a/Doc/tools/sphinxext/static/copybutton.js b/Doc/tools/sphinxext/static/copybutton.js --- a/Doc/tools/sphinxext/static/copybutton.js +++ b/Doc/tools/sphinxext/static/copybutton.js @@ -8,8 +8,8 @@ // get the styles from the current theme pre.parent().parent().css('position', 'relative'); - var hide_text = 'Hide the prompts and ouput'; - var show_text = 'Show the prompts and ouput'; + var hide_text = 'Hide the prompts and output'; + var show_text = 'Show the prompts and output'; var border_width = pre.css('border-top-width'); var border_style = pre.css('border-top-style'); var border_color = pre.css('border-top-color'); -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sun Dec 4 11:51:38 2011 From: python-checkins at python.org (georg.brandl) Date: Sun, 04 Dec 2011 11:51:38 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_with_3=2E2=2E?= Message-ID: http://hg.python.org/cpython/rev/2e4fd707201c changeset: 73849:2e4fd707201c parent: 73847:a3ddee916808 parent: 73848:0436ef8be253 user: Georg Brandl date: Sun Dec 04 11:51:33 2011 +0100 summary: Merge with 3.2. files: Doc/tools/sphinxext/static/copybutton.js | 4 ++-- 1 files changed, 2 insertions(+), 2 deletions(-) diff --git a/Doc/tools/sphinxext/static/copybutton.js b/Doc/tools/sphinxext/static/copybutton.js --- a/Doc/tools/sphinxext/static/copybutton.js +++ b/Doc/tools/sphinxext/static/copybutton.js @@ -8,8 +8,8 @@ // get the styles from the current theme pre.parent().parent().css('position', 'relative'); - var hide_text = 'Hide the prompts and ouput'; - var show_text = 'Show the prompts and ouput'; + var hide_text = 'Hide the prompts and output'; + var show_text = 'Show the prompts and output'; var border_width = pre.css('border-top-width'); var border_style = pre.css('border-top-style'); var border_color = pre.css('border-top-color'); -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sun Dec 4 14:26:02 2011 From: python-checkins at python.org (jason.coombs) Date: Sun, 04 Dec 2011 14:26:02 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Corrected_order_of_paramete?= =?utf8?q?rs_to_HTTPError_in_test=5Furllib2=2Epy=2E?= Message-ID: http://hg.python.org/cpython/rev/8fa1dc66de5d changeset: 73850:8fa1dc66de5d user: Jason R. Coombs date: Sun Dec 04 08:14:18 2011 -0500 summary: Corrected order of parameters to HTTPError in test_urllib2.py. files: Lib/test/test_urllib2.py | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Lib/test/test_urllib2.py b/Lib/test/test_urllib2.py --- a/Lib/test/test_urllib2.py +++ b/Lib/test/test_urllib2.py @@ -1456,7 +1456,7 @@ >>> msg = 'something bad happened' >>> url = code = hdrs = fp = None - >>> err = urllib.error.HTTPError(msg, url, code, hdrs, fp) + >>> err = urllib.error.HTTPError(url, code, msg, hdrs, fp) >>> assert hasattr(err, 'reason') >>> err.reason 'something bad happened' -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 00:05:07 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 00:05:07 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEzNTI3?= =?utf8?q?=3A_remove_mention_of_Python_megawidgets_and_Tkinter3000_WCK?= Message-ID: http://hg.python.org/cpython/rev/2111bf7e5bca changeset: 73851:2111bf7e5bca branch: 3.2 parent: 73848:0436ef8be253 user: Antoine Pitrou date: Sun Dec 04 23:56:30 2011 +0100 summary: Issue #13527: remove mention of Python megawidgets and Tkinter3000 WCK from the doc. These two projects appear dead. files: Doc/library/othergui.rst | 30 +-------------------------- 1 files changed, 2 insertions(+), 28 deletions(-) diff --git a/Doc/library/othergui.rst b/Doc/library/othergui.rst --- a/Doc/library/othergui.rst +++ b/Doc/library/othergui.rst @@ -3,34 +3,8 @@ Other Graphical User Interface Packages ======================================= -There are an number of extension widget sets to :mod:`tkinter`. - -.. seealso:: - - `Python megawidgets `_ - is a toolkit for building high-level compound widgets in Python using the - :mod:`tkinter` package. It consists of a set of base classes and a library of - flexible and extensible megawidgets built on this foundation. These megawidgets - include notebooks, comboboxes, selection widgets, paned widgets, scrolled - widgets, dialog windows, etc. Also, with the Pmw.Blt interface to BLT, the - busy, graph, stripchart, tabset and vector commands are be available. - - The initial ideas for Pmw were taken from the Tk ``itcl`` extensions ``[incr - Tk]`` by Michael McLennan and ``[incr Widgets]`` by Mark Ulferts. Several of the - megawidgets are direct translations from the itcl to Python. It offers most of - the range of widgets that ``[incr Widgets]`` does, and is almost as complete as - Tix, lacking however Tix's fast :class:`HList` widget for drawing trees. - - `Tkinter3000 Widget Construction Kit (WCK) `_ - is a library that allows you to write new Tkinter widgets in pure Python. The - WCK framework gives you full control over widget creation, configuration, screen - appearance, and event handling. WCK widgets can be very fast and light-weight, - since they can operate directly on Python data structures, without having to - transfer data through the Tk/Tcl layer. - - -The major cross-platform (Windows, Mac OS X, Unix-like) GUI toolkits that are -also available for Python: +Major cross-platform (Windows, Mac OS X, Unix-like) GUI toolkits are +available for Python: .. seealso:: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 00:05:08 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 00:05:08 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Issue_=2313527=3A_remove_mention_of_Python_megawidgets_and_T?= =?utf8?q?kinter3000_WCK?= Message-ID: http://hg.python.org/cpython/rev/f0008683585c changeset: 73852:f0008683585c parent: 73850:8fa1dc66de5d parent: 73851:2111bf7e5bca user: Antoine Pitrou date: Sun Dec 04 23:57:55 2011 +0100 summary: Issue #13527: remove mention of Python megawidgets and Tkinter3000 WCK from the doc. These two projects appear dead. files: Doc/library/othergui.rst | 30 +-------------------------- 1 files changed, 2 insertions(+), 28 deletions(-) diff --git a/Doc/library/othergui.rst b/Doc/library/othergui.rst --- a/Doc/library/othergui.rst +++ b/Doc/library/othergui.rst @@ -3,34 +3,8 @@ Other Graphical User Interface Packages ======================================= -There are an number of extension widget sets to :mod:`tkinter`. - -.. seealso:: - - `Python megawidgets `_ - is a toolkit for building high-level compound widgets in Python using the - :mod:`tkinter` package. It consists of a set of base classes and a library of - flexible and extensible megawidgets built on this foundation. These megawidgets - include notebooks, comboboxes, selection widgets, paned widgets, scrolled - widgets, dialog windows, etc. Also, with the Pmw.Blt interface to BLT, the - busy, graph, stripchart, tabset and vector commands are be available. - - The initial ideas for Pmw were taken from the Tk ``itcl`` extensions ``[incr - Tk]`` by Michael McLennan and ``[incr Widgets]`` by Mark Ulferts. Several of the - megawidgets are direct translations from the itcl to Python. It offers most of - the range of widgets that ``[incr Widgets]`` does, and is almost as complete as - Tix, lacking however Tix's fast :class:`HList` widget for drawing trees. - - `Tkinter3000 Widget Construction Kit (WCK) `_ - is a library that allows you to write new Tkinter widgets in pure Python. The - WCK framework gives you full control over widget creation, configuration, screen - appearance, and event handling. WCK widgets can be very fast and light-weight, - since they can operate directly on Python data structures, without having to - transfer data through the Tk/Tcl layer. - - -The major cross-platform (Windows, Mac OS X, Unix-like) GUI toolkits that are -also available for Python: +Major cross-platform (Windows, Mac OS X, Unix-like) GUI toolkits are +available for Python: .. seealso:: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 00:05:08 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 00:05:08 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogSXNzdWUgIzEzNTI3?= =?utf8?q?=3A_remove_mention_of_Python_megawidgets_and_Tkinter3000_WCK?= Message-ID: http://hg.python.org/cpython/rev/478b4e9551fa changeset: 73853:478b4e9551fa branch: 2.7 parent: 73826:fb8b6d310fb8 user: Antoine Pitrou date: Sun Dec 04 23:56:30 2011 +0100 summary: Issue #13527: remove mention of Python megawidgets and Tkinter3000 WCK from the doc. These two projects appear dead. files: Doc/library/othergui.rst | 30 +-------------------------- 1 files changed, 2 insertions(+), 28 deletions(-) diff --git a/Doc/library/othergui.rst b/Doc/library/othergui.rst --- a/Doc/library/othergui.rst +++ b/Doc/library/othergui.rst @@ -3,34 +3,8 @@ Other Graphical User Interface Packages ======================================= -There are an number of extension widget sets to :mod:`Tkinter`. - -.. seealso:: - - `Python megawidgets `_ - is a toolkit for building high-level compound widgets in Python using the - :mod:`Tkinter` module. It consists of a set of base classes and a library of - flexible and extensible megawidgets built on this foundation. These megawidgets - include notebooks, comboboxes, selection widgets, paned widgets, scrolled - widgets, dialog windows, etc. Also, with the Pmw.Blt interface to BLT, the - busy, graph, stripchart, tabset and vector commands are be available. - - The initial ideas for Pmw were taken from the Tk ``itcl`` extensions ``[incr - Tk]`` by Michael McLennan and ``[incr Widgets]`` by Mark Ulferts. Several of the - megawidgets are direct translations from the itcl to Python. It offers most of - the range of widgets that ``[incr Widgets]`` does, and is almost as complete as - Tix, lacking however Tix's fast :class:`HList` widget for drawing trees. - - `Tkinter3000 Widget Construction Kit (WCK) `_ - is a library that allows you to write new Tkinter widgets in pure Python. The - WCK framework gives you full control over widget creation, configuration, screen - appearance, and event handling. WCK widgets can be very fast and light-weight, - since they can operate directly on Python data structures, without having to - transfer data through the Tk/Tcl layer. - - -The major cross-platform (Windows, Mac OS X, Unix-like) GUI toolkits that are -also available for Python: +Major cross-platform (Windows, Mac OS X, Unix-like) GUI toolkits are +available for Python: .. seealso:: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 00:47:08 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 00:47:08 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Remove_obsolete?= =?utf8?q?_references_to_bsddb?= Message-ID: http://hg.python.org/cpython/rev/1494e021cb9f changeset: 73854:1494e021cb9f branch: 3.2 parent: 73851:2111bf7e5bca user: Antoine Pitrou date: Mon Dec 05 00:41:19 2011 +0100 summary: Remove obsolete references to bsddb files: Doc/faq/library.rst | 44 --------------------------------- 1 files changed, 0 insertions(+), 44 deletions(-) diff --git a/Doc/faq/library.rst b/Doc/faq/library.rst --- a/Doc/faq/library.rst +++ b/Doc/faq/library.rst @@ -814,52 +814,6 @@ general such as using gdbm with pickle/shelve. -If my program crashes with a bsddb (or anydbm) database open, it gets corrupted. How come? ------------------------------------------------------------------------------------------- - -.. XXX move this FAQ entry elsewhere? - -.. note:: - - The bsddb module is now available as a standalone package `pybsddb - `_. - -Databases opened for write access with the bsddb module (and often by the anydbm -module, since it will preferentially use bsddb) must explicitly be closed using -the ``.close()`` method of the database. The underlying library caches database -contents which need to be converted to on-disk form and written. - -If you have initialized a new bsddb database but not written anything to it -before the program crashes, you will often wind up with a zero-length file and -encounter an exception the next time the file is opened. - - -I tried to open Berkeley DB file, but bsddb produces bsddb.error: (22, 'Invalid argument'). Help! How can I restore my data? ----------------------------------------------------------------------------------------------------------------------------- - -.. XXX move this FAQ entry elsewhere? - -.. note:: - - The bsddb module is now available as a standalone package `pybsddb - `_. - -Don't panic! Your data is probably intact. The most frequent cause for the error -is that you tried to open an earlier Berkeley DB file with a later version of -the Berkeley DB library. - -Many Linux systems now have all three versions of Berkeley DB available. If you -are migrating from version 1 to a newer version use db_dump185 to dump a plain -text version of the database. If you are migrating from version 2 to version 3 -use db2_dump to create a plain text version of the database. In either case, -use db_load to create a new native database for the latest version installed on -your computer. If you have version 3 of Berkeley DB installed, you should be -able to use db2_load to create a native version 2 database. - -You should move away from Berkeley DB version 1 files because the hash file code -contains known bugs that can corrupt your data. - - Mathematics and Numerics ======================== -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 00:47:09 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 00:47:09 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Remove_obsolete_references_to_bsddb?= Message-ID: http://hg.python.org/cpython/rev/e80a4e3b799b changeset: 73855:e80a4e3b799b parent: 73852:f0008683585c parent: 73854:1494e021cb9f user: Antoine Pitrou date: Mon Dec 05 00:41:51 2011 +0100 summary: Remove obsolete references to bsddb files: Doc/faq/library.rst | 44 --------------------------------- 1 files changed, 0 insertions(+), 44 deletions(-) diff --git a/Doc/faq/library.rst b/Doc/faq/library.rst --- a/Doc/faq/library.rst +++ b/Doc/faq/library.rst @@ -814,52 +814,6 @@ general such as using gdbm with pickle/shelve. -If my program crashes with a bsddb (or anydbm) database open, it gets corrupted. How come? ------------------------------------------------------------------------------------------- - -.. XXX move this FAQ entry elsewhere? - -.. note:: - - The bsddb module is now available as a standalone package `pybsddb - `_. - -Databases opened for write access with the bsddb module (and often by the anydbm -module, since it will preferentially use bsddb) must explicitly be closed using -the ``.close()`` method of the database. The underlying library caches database -contents which need to be converted to on-disk form and written. - -If you have initialized a new bsddb database but not written anything to it -before the program crashes, you will often wind up with a zero-length file and -encounter an exception the next time the file is opened. - - -I tried to open Berkeley DB file, but bsddb produces bsddb.error: (22, 'Invalid argument'). Help! How can I restore my data? ----------------------------------------------------------------------------------------------------------------------------- - -.. XXX move this FAQ entry elsewhere? - -.. note:: - - The bsddb module is now available as a standalone package `pybsddb - `_. - -Don't panic! Your data is probably intact. The most frequent cause for the error -is that you tried to open an earlier Berkeley DB file with a later version of -the Berkeley DB library. - -Many Linux systems now have all three versions of Berkeley DB available. If you -are migrating from version 1 to a newer version use db_dump185 to dump a plain -text version of the database. If you are migrating from version 2 to version 3 -use db2_dump to create a plain text version of the database. In either case, -use db_load to create a new native database for the latest version installed on -your computer. If you have version 3 of Berkeley DB installed, you should be -able to use db2_load to create a native version 2 database. - -You should move away from Berkeley DB version 1 files because the hash file code -contains known bugs that can corrupt your data. - - Mathematics and Numerics ======================== -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:11:11 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:11:11 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_The_functional_?= =?utf8?q?module_hasn=27t_been_maintained_since_2006_and_doesn=27t_work_wi?= =?utf8?q?th?= Message-ID: http://hg.python.org/cpython/rev/2aeef275bec8 changeset: 73856:2aeef275bec8 branch: 3.2 parent: 73854:1494e021cb9f user: Antoine Pitrou date: Mon Dec 05 01:05:32 2011 +0100 summary: The functional module hasn't been maintained since 2006 and doesn't work with Python 3. Remove section about it from the functional programming FAQ. files: Doc/howto/functional.rst | 128 --------------------------- 1 files changed, 0 insertions(+), 128 deletions(-) diff --git a/Doc/howto/functional.rst b/Doc/howto/functional.rst --- a/Doc/howto/functional.rst +++ b/Doc/howto/functional.rst @@ -1010,135 +1010,6 @@ Consult the operator module's documentation for a complete list. - -The functional module ---------------------- - -Collin Winter's `functional module `__ -provides a number of more advanced tools for functional programming. It also -reimplements several Python built-ins, trying to make them more intuitive to -those used to functional programming in other languages. - -This section contains an introduction to some of the most important functions in -``functional``; full documentation can be found at `the project's website -`__. - -``compose(outer, inner, unpack=False)`` - -The ``compose()`` function implements function composition. In other words, it -returns a wrapper around the ``outer`` and ``inner`` callables, such that the -return value from ``inner`` is fed directly to ``outer``. That is, :: - - >>> def add(a, b): - ... return a + b - ... - >>> def double(a): - ... return 2 * a - ... - >>> compose(double, add)(5, 6) - 22 - -is equivalent to :: - - >>> double(add(5, 6)) - 22 - -The ``unpack`` keyword is provided to work around the fact that Python functions -are not always `fully curried `__. By -default, it is expected that the ``inner`` function will return a single object -and that the ``outer`` function will take a single argument. Setting the -``unpack`` argument causes ``compose`` to expect a tuple from ``inner`` which -will be expanded before being passed to ``outer``. Put simply, :: - - compose(f, g)(5, 6) - -is equivalent to:: - - f(g(5, 6)) - -while :: - - compose(f, g, unpack=True)(5, 6) - -is equivalent to:: - - f(*g(5, 6)) - -Even though ``compose()`` only accepts two functions, it's trivial to build up a -version that will compose any number of functions. We'll use -:func:`functools.reduce`, ``compose()`` and ``partial()`` (the last of which is -provided by both ``functional`` and ``functools``). :: - - from functional import compose, partial - import functools - - - multi_compose = partial(functools.reduce, compose) - - -We can also use ``map()``, ``compose()`` and ``partial()`` to craft a version of -``"".join(...)`` that converts its arguments to string:: - - from functional import compose, partial - - join = compose("".join, partial(map, str)) - - -``flip(func)`` - -``flip()`` wraps the callable in ``func`` and causes it to receive its -non-keyword arguments in reverse order. :: - - >>> def triple(a, b, c): - ... return (a, b, c) - ... - >>> triple(5, 6, 7) - (5, 6, 7) - >>> - >>> flipped_triple = flip(triple) - >>> flipped_triple(5, 6, 7) - (7, 6, 5) - -``foldl(func, start, iterable)`` - -``foldl()`` takes a binary function, a starting value (usually some kind of -'zero'), and an iterable. The function is applied to the starting value and the -first element of the list, then the result of that and the second element of the -list, then the result of that and the third element of the list, and so on. - -This means that a call such as:: - - foldl(f, 0, [1, 2, 3]) - -is equivalent to:: - - f(f(f(0, 1), 2), 3) - - -``foldl()`` is roughly equivalent to the following recursive function:: - - def foldl(func, start, seq): - if len(seq) == 0: - return start - - return foldl(func, func(start, seq[0]), seq[1:]) - -Speaking of equivalence, the above ``foldl`` call can be expressed in terms of -the built-in :func:`functools.reduce` like so:: - - import functools - functools.reduce(f, [1, 2, 3], 0) - - -We can use ``foldl()``, ``operator.concat()`` and ``partial()`` to write a -cleaner, more aesthetically-pleasing version of Python's ``"".join(...)`` -idiom:: - - from functional import foldl, partial from operator import concat - - join = partial(foldl, concat, "") - - Small functions and the lambda expression ========================================= -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:11:12 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:11:12 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_The_functional_module_hasn=27t_been_maintained_since_2006_an?= =?utf8?q?d_doesn=27t_work_with?= Message-ID: http://hg.python.org/cpython/rev/3828f81a64e7 changeset: 73857:3828f81a64e7 parent: 73855:e80a4e3b799b parent: 73856:2aeef275bec8 user: Antoine Pitrou date: Mon Dec 05 01:05:55 2011 +0100 summary: The functional module hasn't been maintained since 2006 and doesn't work with Python 3. Remove section about it from the functional programming FAQ. files: Doc/howto/functional.rst | 128 --------------------------- 1 files changed, 0 insertions(+), 128 deletions(-) diff --git a/Doc/howto/functional.rst b/Doc/howto/functional.rst --- a/Doc/howto/functional.rst +++ b/Doc/howto/functional.rst @@ -1010,135 +1010,6 @@ Consult the operator module's documentation for a complete list. - -The functional module ---------------------- - -Collin Winter's `functional module `__ -provides a number of more advanced tools for functional programming. It also -reimplements several Python built-ins, trying to make them more intuitive to -those used to functional programming in other languages. - -This section contains an introduction to some of the most important functions in -``functional``; full documentation can be found at `the project's website -`__. - -``compose(outer, inner, unpack=False)`` - -The ``compose()`` function implements function composition. In other words, it -returns a wrapper around the ``outer`` and ``inner`` callables, such that the -return value from ``inner`` is fed directly to ``outer``. That is, :: - - >>> def add(a, b): - ... return a + b - ... - >>> def double(a): - ... return 2 * a - ... - >>> compose(double, add)(5, 6) - 22 - -is equivalent to :: - - >>> double(add(5, 6)) - 22 - -The ``unpack`` keyword is provided to work around the fact that Python functions -are not always `fully curried `__. By -default, it is expected that the ``inner`` function will return a single object -and that the ``outer`` function will take a single argument. Setting the -``unpack`` argument causes ``compose`` to expect a tuple from ``inner`` which -will be expanded before being passed to ``outer``. Put simply, :: - - compose(f, g)(5, 6) - -is equivalent to:: - - f(g(5, 6)) - -while :: - - compose(f, g, unpack=True)(5, 6) - -is equivalent to:: - - f(*g(5, 6)) - -Even though ``compose()`` only accepts two functions, it's trivial to build up a -version that will compose any number of functions. We'll use -:func:`functools.reduce`, ``compose()`` and ``partial()`` (the last of which is -provided by both ``functional`` and ``functools``). :: - - from functional import compose, partial - import functools - - - multi_compose = partial(functools.reduce, compose) - - -We can also use ``map()``, ``compose()`` and ``partial()`` to craft a version of -``"".join(...)`` that converts its arguments to string:: - - from functional import compose, partial - - join = compose("".join, partial(map, str)) - - -``flip(func)`` - -``flip()`` wraps the callable in ``func`` and causes it to receive its -non-keyword arguments in reverse order. :: - - >>> def triple(a, b, c): - ... return (a, b, c) - ... - >>> triple(5, 6, 7) - (5, 6, 7) - >>> - >>> flipped_triple = flip(triple) - >>> flipped_triple(5, 6, 7) - (7, 6, 5) - -``foldl(func, start, iterable)`` - -``foldl()`` takes a binary function, a starting value (usually some kind of -'zero'), and an iterable. The function is applied to the starting value and the -first element of the list, then the result of that and the second element of the -list, then the result of that and the third element of the list, and so on. - -This means that a call such as:: - - foldl(f, 0, [1, 2, 3]) - -is equivalent to:: - - f(f(f(0, 1), 2), 3) - - -``foldl()`` is roughly equivalent to the following recursive function:: - - def foldl(func, start, seq): - if len(seq) == 0: - return start - - return foldl(func, func(start, seq[0]), seq[1:]) - -Speaking of equivalence, the above ``foldl`` call can be expressed in terms of -the built-in :func:`functools.reduce` like so:: - - import functools - functools.reduce(f, [1, 2, 3], 0) - - -We can use ``foldl()``, ``operator.concat()`` and ``partial()`` to write a -cleaner, more aesthetically-pleasing version of Python's ``"".join(...)`` -idiom:: - - from functional import foldl, partial from operator import concat - - join = partial(foldl, concat, "") - - Small functions and the lambda expression ========================================= -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:27:26 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:27:26 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Remove_referenc?= =?utf8?q?e_to_the_base64_encoding=2E?= Message-ID: http://hg.python.org/cpython/rev/427b9dae1ae3 changeset: 73858:427b9dae1ae3 branch: 3.2 parent: 73856:2aeef275bec8 user: Antoine Pitrou date: Mon Dec 05 01:21:46 2011 +0100 summary: Remove reference to the base64 encoding. files: Doc/howto/unicode.rst | 27 +++++---------------------- 1 files changed, 5 insertions(+), 22 deletions(-) diff --git a/Doc/howto/unicode.rst b/Doc/howto/unicode.rst --- a/Doc/howto/unicode.rst +++ b/Doc/howto/unicode.rst @@ -552,7 +552,6 @@ i.e. Unix systems. - Tips for Writing Unicode-aware Programs --------------------------------------- @@ -572,28 +571,12 @@ When using data coming from a web browser or some other untrusted source, a common technique is to check for illegal characters in a string before using the string in a generated command line or storing it in a database. If you're doing -this, be careful to check the string once it's in the form that will be used or -stored; it's possible for encodings to be used to disguise characters. This is -especially true if the input data also specifies the encoding; many encodings -leave the commonly checked-for characters alone, but Python includes some -encodings such as ``'base64'`` that modify every single character. +this, be careful to check the decoded string, not the encoded bytes data; +some encodings may have interesting properties, such as not being bijective +or not being fully ASCII-compatible. This is especially true if the input +data also specifies the encoding, since the attacker can then choose a +clever way to hide malicious text in the encoded bytestream. -For example, let's say you have a content management system that takes a Unicode -filename, and you want to disallow paths with a '/' character. You might write -this code:: - - def read_file(filename, encoding): - if '/' in filename: - raise ValueError("'/' not allowed in filenames") - unicode_name = filename.decode(encoding) - with open(unicode_name, 'r') as f: - # ... return contents of file ... - -However, if an attacker could specify the ``'base64'`` encoding, they could pass -``'L2V0Yy9wYXNzd2Q='``, which is the base-64 encoded form of the string -``'/etc/passwd'``, to read a system file. The above code looks for ``'/'`` -characters in the encoded form and misses the dangerous character in the -resulting decoded form. References ---------- -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:27:26 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:27:26 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Remove_reference_to_the_base64_encoding=2E?= Message-ID: http://hg.python.org/cpython/rev/8701f6373d0b changeset: 73859:8701f6373d0b parent: 73857:3828f81a64e7 parent: 73858:427b9dae1ae3 user: Antoine Pitrou date: Mon Dec 05 01:22:03 2011 +0100 summary: Remove reference to the base64 encoding. files: Doc/howto/unicode.rst | 27 +++++---------------------- 1 files changed, 5 insertions(+), 22 deletions(-) diff --git a/Doc/howto/unicode.rst b/Doc/howto/unicode.rst --- a/Doc/howto/unicode.rst +++ b/Doc/howto/unicode.rst @@ -552,7 +552,6 @@ i.e. Unix systems. - Tips for Writing Unicode-aware Programs --------------------------------------- @@ -572,28 +571,12 @@ When using data coming from a web browser or some other untrusted source, a common technique is to check for illegal characters in a string before using the string in a generated command line or storing it in a database. If you're doing -this, be careful to check the string once it's in the form that will be used or -stored; it's possible for encodings to be used to disguise characters. This is -especially true if the input data also specifies the encoding; many encodings -leave the commonly checked-for characters alone, but Python includes some -encodings such as ``'base64'`` that modify every single character. +this, be careful to check the decoded string, not the encoded bytes data; +some encodings may have interesting properties, such as not being bijective +or not being fully ASCII-compatible. This is especially true if the input +data also specifies the encoding, since the attacker can then choose a +clever way to hide malicious text in the encoded bytestream. -For example, let's say you have a content management system that takes a Unicode -filename, and you want to disallow paths with a '/' character. You might write -this code:: - - def read_file(filename, encoding): - if '/' in filename: - raise ValueError("'/' not allowed in filenames") - unicode_name = filename.decode(encoding) - with open(unicode_name, 'r') as f: - # ... return contents of file ... - -However, if an attacker could specify the ``'base64'`` encoding, they could pass -``'L2V0Yy9wYXNzd2Q='``, which is the base-64 encoded form of the string -``'/etc/passwd'``, to read a system file. The above code looks for ``'/'`` -characters in the encoded form and misses the dangerous character in the -resulting decoded form. References ---------- -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:53:05 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:53:05 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Remove_the_outd?= =?utf8?q?ated_notion_that_multithreading_doesn=27t_work_well_on_Unices=2E?= Message-ID: http://hg.python.org/cpython/rev/64d980770571 changeset: 73860:64d980770571 branch: 3.2 parent: 73858:427b9dae1ae3 user: Antoine Pitrou date: Mon Dec 05 01:32:29 2011 +0100 summary: Remove the outdated notion that multithreading doesn't work well on Unices. files: Doc/howto/sockets.rst | 18 ++++++------------ 1 files changed, 6 insertions(+), 12 deletions(-) diff --git a/Doc/howto/sockets.rst b/Doc/howto/sockets.rst --- a/Doc/howto/sockets.rst +++ b/Doc/howto/sockets.rst @@ -395,19 +395,13 @@ There's no question that the fastest sockets code uses non-blocking sockets and select to multiplex them. You can put together something that will saturate a -LAN connection without putting any strain on the CPU. The trouble is that an app -written this way can't do much of anything else - it needs to be ready to -shuffle bytes around at all times. +LAN connection without putting any strain on the CPU. -Assuming that your app is actually supposed to do something more than that, -threading is the optimal solution, (and using non-blocking sockets will be -faster than using blocking sockets). Unfortunately, threading support in Unixes -varies both in API and quality. So the normal Unix solution is to fork a -subprocess to deal with each connection. The overhead for this is significant -(and don't do this on Windows - the overhead of process creation is enormous -there). It also means that unless each subprocess is completely independent, -you'll need to use another form of IPC, say a pipe, or shared memory and -semaphores, to communicate between the parent and child processes. +The trouble is that an app written this way can't do much of anything else - +it needs to be ready to shuffle bytes around at all times. Assuming that your +app is actually supposed to do something more than that, threading is the +optimal solution, (and using non-blocking sockets will be faster than using +blocking sockets). Finally, remember that even though blocking sockets are somewhat slower than non-blocking, in many cases they are the "right" solution. After all, if your -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:53:06 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:53:06 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Use_www=2Epytho?= =?utf8?q?n=2Eorg_instead_of_a_hostname_pointing_to_a_parked_=28or_squatte?= =?utf8?q?d=29?= Message-ID: http://hg.python.org/cpython/rev/7e310c9cf46e changeset: 73861:7e310c9cf46e branch: 3.2 user: Antoine Pitrou date: Mon Dec 05 01:37:34 2011 +0100 summary: Use www.python.org instead of a hostname pointing to a parked (or squatted) domain. Also, reformat a bit. files: Doc/howto/sockets.rst | 23 ++++++++++------------- 1 files changed, 10 insertions(+), 13 deletions(-) diff --git a/Doc/howto/sockets.rst b/Doc/howto/sockets.rst --- a/Doc/howto/sockets.rst +++ b/Doc/howto/sockets.rst @@ -60,11 +60,10 @@ Roughly speaking, when you clicked on the link that brought you to this page, your browser did something like the following:: - #create an INET, STREAMing socket + # create an INET, STREAMing socket s = socket.socket(socket.AF_INET, socket.SOCK_STREAM) - #now connect to the web server on port 80 - # - the normal http port - s.connect(("www.mcmillan-inc.com", 80)) + # now connect to the web server on port 80 - the normal http port + s.connect(("www.python.org", 80)) When the ``connect`` completes, the socket ``s`` can be used to send in a request for the text of the page. The same socket will read the @@ -75,13 +74,11 @@ What happens in the web server is a bit more complex. First, the web server creates a "server socket":: - #create an INET, STREAMing socket - serversocket = socket.socket( - socket.AF_INET, socket.SOCK_STREAM) - #bind the socket to a public host, - # and a well-known port + # create an INET, STREAMing socket + serversocket = socket.socket(socket.AF_INET, socket.SOCK_STREAM) + # bind the socket to a public host, and a well-known port serversocket.bind((socket.gethostname(), 80)) - #become a server socket + # become a server socket serversocket.listen(5) A couple things to notice: we used ``socket.gethostname()`` so that the socket @@ -101,10 +98,10 @@ mainloop of the web server:: while True: - #accept connections from outside + # accept connections from outside (clientsocket, address) = serversocket.accept() - #now do something with the clientsocket - #in this case, we'll pretend this is a threaded server + # now do something with the clientsocket + # in this case, we'll pretend this is a threaded server ct = client_thread(clientsocket) ct.run() -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:53:07 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:53:07 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Reword_IPC_sect?= =?utf8?q?ion?= Message-ID: http://hg.python.org/cpython/rev/9d8a0cfcd6d9 changeset: 73862:9d8a0cfcd6d9 branch: 3.2 user: Antoine Pitrou date: Mon Dec 05 01:43:32 2011 +0100 summary: Reword IPC section files: Doc/howto/sockets.rst | 11 ++++++----- 1 files changed, 6 insertions(+), 5 deletions(-) diff --git a/Doc/howto/sockets.rst b/Doc/howto/sockets.rst --- a/Doc/howto/sockets.rst +++ b/Doc/howto/sockets.rst @@ -123,12 +123,13 @@ --- If you need fast IPC between two processes on one machine, you should look into -whatever form of shared memory the platform offers. A simple protocol based -around shared memory and locks or semaphores is by far the fastest technique. +pipes or shared memory. If you do decide to use AF_INET sockets, bind the +"server" socket to ``'localhost'``. On most platforms, this will take a +shortcut around a couple of layers of network code and be quite a bit faster. -If you do decide to use sockets, bind the "server" socket to ``'localhost'``. On -most platforms, this will take a shortcut around a couple of layers of network -code and be quite a bit faster. +.. seealso:: + The :mod:`multiprocessing` integrates cross-platform IPC into a higher-level + API. Using a Socket -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:53:07 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:53:07 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_s/SOCKSTREAM/TC?= =?utf8?q?P/?= Message-ID: http://hg.python.org/cpython/rev/c34188efb965 changeset: 73863:c34188efb965 branch: 3.2 user: Antoine Pitrou date: Mon Dec 05 01:46:35 2011 +0100 summary: s/SOCKSTREAM/TCP/ files: Doc/howto/sockets.rst | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Doc/howto/sockets.rst b/Doc/howto/sockets.rst --- a/Doc/howto/sockets.rst +++ b/Doc/howto/sockets.rst @@ -298,7 +298,7 @@ Probably the worst thing about using blocking sockets is what happens when the other side comes down hard (without doing a ``close``). Your socket is likely to -hang. SOCKSTREAM is a reliable protocol, and it will wait a long, long time +hang. TCP is a reliable protocol, and it will wait a long, long time before giving up on a connection. If you're using threads, the entire thread is essentially dead. There's not much you can do about it. As long as you aren't doing something dumb, like holding a lock while doing a blocking read, the -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 01:53:08 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 01:53:08 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_assorted_fixes_from_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/f2be1e010cc8 changeset: 73864:f2be1e010cc8 parent: 73859:8701f6373d0b parent: 73863:c34188efb965 user: Antoine Pitrou date: Mon Dec 05 01:47:40 2011 +0100 summary: Merge assorted fixes from 3.2 files: Doc/howto/sockets.rst | 36 ++++++++++++++---------------- 1 files changed, 17 insertions(+), 19 deletions(-) diff --git a/Doc/howto/sockets.rst b/Doc/howto/sockets.rst --- a/Doc/howto/sockets.rst +++ b/Doc/howto/sockets.rst @@ -60,11 +60,10 @@ Roughly speaking, when you clicked on the link that brought you to this page, your browser did something like the following:: - #create an INET, STREAMing socket + # create an INET, STREAMing socket s = socket.socket(socket.AF_INET, socket.SOCK_STREAM) - #now connect to the web server on port 80 - # - the normal http port - s.connect(("www.mcmillan-inc.com", 80)) + # now connect to the web server on port 80 - the normal http port + s.connect(("www.python.org", 80)) When the ``connect`` completes, the socket ``s`` can be used to send in a request for the text of the page. The same socket will read the @@ -75,13 +74,11 @@ What happens in the web server is a bit more complex. First, the web server creates a "server socket":: - #create an INET, STREAMing socket - serversocket = socket.socket( - socket.AF_INET, socket.SOCK_STREAM) - #bind the socket to a public host, - # and a well-known port + # create an INET, STREAMing socket + serversocket = socket.socket(socket.AF_INET, socket.SOCK_STREAM) + # bind the socket to a public host, and a well-known port serversocket.bind((socket.gethostname(), 80)) - #become a server socket + # become a server socket serversocket.listen(5) A couple things to notice: we used ``socket.gethostname()`` so that the socket @@ -101,10 +98,10 @@ mainloop of the web server:: while True: - #accept connections from outside + # accept connections from outside (clientsocket, address) = serversocket.accept() - #now do something with the clientsocket - #in this case, we'll pretend this is a threaded server + # now do something with the clientsocket + # in this case, we'll pretend this is a threaded server ct = client_thread(clientsocket) ct.run() @@ -126,12 +123,13 @@ --- If you need fast IPC between two processes on one machine, you should look into -whatever form of shared memory the platform offers. A simple protocol based -around shared memory and locks or semaphores is by far the fastest technique. +pipes or shared memory. If you do decide to use AF_INET sockets, bind the +"server" socket to ``'localhost'``. On most platforms, this will take a +shortcut around a couple of layers of network code and be quite a bit faster. -If you do decide to use sockets, bind the "server" socket to ``'localhost'``. On -most platforms, this will take a shortcut around a couple of layers of network -code and be quite a bit faster. +.. seealso:: + The :mod:`multiprocessing` integrates cross-platform IPC into a higher-level + API. Using a Socket @@ -300,7 +298,7 @@ Probably the worst thing about using blocking sockets is what happens when the other side comes down hard (without doing a ``close``). Your socket is likely to -hang. SOCKSTREAM is a reliable protocol, and it will wait a long, long time +hang. TCP is a reliable protocol, and it will wait a long, long time before giving up on a connection. If you're using threads, the entire thread is essentially dead. There's not much you can do about it. As long as you aren't doing something dumb, like holding a lock while doing a blocking read, the -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Mon Dec 5 05:32:51 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Mon, 05 Dec 2011 05:32:51 +0100 Subject: [Python-checkins] Daily reference leaks (f2be1e010cc8): sum=0 Message-ID: results for f2be1e010cc8 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/refloglEA4bV', '-x'] From python-checkins at python.org Mon Dec 5 20:45:48 2011 From: python-checkins at python.org (antoine.pitrou) Date: Mon, 05 Dec 2011 20:45:48 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2313503=3A_Use_a_mor?= =?utf8?q?e_efficient_reduction_format_for_bytearrays_with?= Message-ID: http://hg.python.org/cpython/rev/e2959a6a1440 changeset: 73865:e2959a6a1440 user: Antoine Pitrou date: Mon Dec 05 20:40:08 2011 +0100 summary: Issue #13503: Use a more efficient reduction format for bytearrays with pickle protocol >= 3. The old reduction format is kept with older protocols in order to allow unpickling under Python 2. Patch by Irmen de Jong. files: Misc/NEWS | 4 ++ Objects/bytearrayobject.c | 52 +++++++++++++++++++++----- 2 files changed, 46 insertions(+), 10 deletions(-) diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -10,6 +10,10 @@ Core and Builtins ----------------- +- Issue #13503: Use a more efficient reduction format for bytearrays with + pickle protocol >= 3. The old reduction format is kept with older protocols + in order to allow unpickling under Python 2. Patch by Irmen de Jong. + - Issue #7111: Python can now be run without a stdin, stdout or stderr stream. It was already the case with Python 2. However, the corresponding sys module entries are now set to None (instead of an unusable file object). diff --git a/Objects/bytearrayobject.c b/Objects/bytearrayobject.c --- a/Objects/bytearrayobject.c +++ b/Objects/bytearrayobject.c @@ -2725,20 +2725,13 @@ return NULL; } -PyDoc_STRVAR(reduce_doc, "Return state information for pickling."); static PyObject * -bytearray_reduce(PyByteArrayObject *self) +_common_reduce(PyByteArrayObject *self, int proto) { - PyObject *latin1, *dict; + PyObject *dict; _Py_IDENTIFIER(__dict__); - if (self->ob_bytes) - latin1 = PyUnicode_DecodeLatin1(self->ob_bytes, - Py_SIZE(self), NULL); - else - latin1 = PyUnicode_FromString(""); - dict = _PyObject_GetAttrId((PyObject *)self, &PyId___dict__); if (dict == NULL) { PyErr_Clear(); @@ -2746,7 +2739,45 @@ Py_INCREF(dict); } - return Py_BuildValue("(O(Ns)N)", Py_TYPE(self), latin1, "latin-1", dict); + if (proto < 3) { + /* use str based reduction for backwards compatibility with Python 2.x */ + PyObject *latin1; + if (self->ob_bytes) + latin1 = PyUnicode_DecodeLatin1(self->ob_bytes, Py_SIZE(self), NULL); + else + latin1 = PyUnicode_FromString(""); + return Py_BuildValue("(O(Ns)N)", Py_TYPE(self), latin1, "latin-1", dict); + } + else { + /* use more efficient byte based reduction */ + if (self->ob_bytes) { + return Py_BuildValue("(O(y#)N)", Py_TYPE(self), self->ob_bytes, Py_SIZE(self), dict); + } + else { + return Py_BuildValue("(O()N)", Py_TYPE(self), dict); + } + } +} + +PyDoc_STRVAR(reduce_doc, "Return state information for pickling."); + +static PyObject * +bytearray_reduce(PyByteArrayObject *self) +{ + return _common_reduce(self, 2); +} + +PyDoc_STRVAR(reduce_ex_doc, "Return state information for pickling."); + +static PyObject * +bytearray_reduce_ex(PyByteArrayObject *self, PyObject *args) +{ + int proto = 0; + + if (!PyArg_ParseTuple(args, "|i:__reduce_ex__", &proto)) + return NULL; + + return _common_reduce(self, proto); } PyDoc_STRVAR(sizeof_doc, @@ -2790,6 +2821,7 @@ bytearray_methods[] = { {"__alloc__", (PyCFunction)bytearray_alloc, METH_NOARGS, alloc_doc}, {"__reduce__", (PyCFunction)bytearray_reduce, METH_NOARGS, reduce_doc}, + {"__reduce_ex__", (PyCFunction)bytearray_reduce_ex, METH_VARARGS, reduce_ex_doc}, {"__sizeof__", (PyCFunction)bytearray_sizeof, METH_NOARGS, sizeof_doc}, {"append", (PyCFunction)bytearray_append, METH_O, append__doc__}, {"capitalize", (PyCFunction)stringlib_capitalize, METH_NOARGS, -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 22:50:49 2011 From: python-checkins at python.org (barry.warsaw) Date: Mon, 05 Dec 2011 22:50:49 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_-_Issue_=231114?= =?utf8?q?7=3A_Fix_an_unused_argument_in_=5FPy=5FANNOTATE=5FMEMORY=5FORDER?= =?utf8?q?=2E__=28Fix?= Message-ID: http://hg.python.org/cpython/rev/4579cd952156 changeset: 73866:4579cd952156 branch: 3.2 parent: 73863:c34188efb965 user: Barry Warsaw date: Mon Dec 05 16:45:02 2011 -0500 summary: - Issue #11147: Fix an unused argument in _Py_ANNOTATE_MEMORY_ORDER. (Fix given by Campbell Barton). files: Include/pyatomic.h | 1 + Misc/NEWS | 3 +++ 2 files changed, 4 insertions(+), 0 deletions(-) diff --git a/Include/pyatomic.h b/Include/pyatomic.h --- a/Include/pyatomic.h +++ b/Include/pyatomic.h @@ -58,6 +58,7 @@ static __inline__ void _Py_ANNOTATE_MEMORY_ORDER(const volatile void *address, _Py_memory_order order) { + (void)address; /* shut up -Wunused-parameter */ switch(order) { case _Py_memory_order_release: case _Py_memory_order_acq_rel: diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -10,6 +10,9 @@ Core and Builtins ----------------- +- Issue #11147: Fix an unused argument in _Py_ANNOTATE_MEMORY_ORDER. (Fix + given by Campbell Barton). + - Issue #7111: Python can now be run without a stdin, stdout or stderr stream. It was already the case with Python 2. However, the corresponding sys module entries are now set to None (instead of an unusable file object). -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Mon Dec 5 22:50:50 2011 From: python-checkins at python.org (barry.warsaw) Date: Mon, 05 Dec 2011 22:50:50 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_-_Issue_=2311147=3A_Fix_an_unused_argument_in_=5FPy=5FANNOTA?= =?utf8?q?TE=5FMEMORY=5FORDER=2E__=28Fix?= Message-ID: http://hg.python.org/cpython/rev/6b6c79eba944 changeset: 73867:6b6c79eba944 parent: 73865:e2959a6a1440 parent: 73866:4579cd952156 user: Barry Warsaw date: Mon Dec 05 16:50:41 2011 -0500 summary: - Issue #11147: Fix an unused argument in _Py_ANNOTATE_MEMORY_ORDER. (Fix given by Campbell Barton). files: Include/pyatomic.h | 1 + Misc/NEWS | 3 +++ 2 files changed, 4 insertions(+), 0 deletions(-) diff --git a/Include/pyatomic.h b/Include/pyatomic.h --- a/Include/pyatomic.h +++ b/Include/pyatomic.h @@ -58,6 +58,7 @@ static __inline__ void _Py_ANNOTATE_MEMORY_ORDER(const volatile void *address, _Py_memory_order order) { + (void)address; /* shut up -Wunused-parameter */ switch(order) { case _Py_memory_order_release: case _Py_memory_order_acq_rel: diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -10,6 +10,9 @@ Core and Builtins ----------------- +- Issue #11147: Fix an unused argument in _Py_ANNOTATE_MEMORY_ORDER. (Fix + given by Campbell Barton). + - Issue #13503: Use a more efficient reduction format for bytearrays with pickle protocol >= 3. The old reduction format is kept with older protocols in order to allow unpickling under Python 2. Patch by Irmen de Jong. -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Tue Dec 6 05:33:11 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Tue, 06 Dec 2011 05:33:11 +0100 Subject: [Python-checkins] Daily reference leaks (6b6c79eba944): sum=0 Message-ID: results for 6b6c79eba944 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/reflognNz12r', '-x'] From python-checkins at python.org Tue Dec 6 13:10:22 2011 From: python-checkins at python.org (lars.gustaebel) Date: Tue, 06 Dec 2011 13:10:22 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Correctly_detec?= =?utf8?q?t_bzip2_compressed_streams_with_blocksizes_other_than_900k=2E?= Message-ID: http://hg.python.org/cpython/rev/80876df8adce changeset: 73868:80876df8adce branch: 3.2 parent: 73866:4579cd952156 user: Lars Gust?bel date: Tue Dec 06 12:56:38 2011 +0100 summary: Correctly detect bzip2 compressed streams with blocksizes other than 900k. files: Lib/tarfile.py | 2 +- Lib/test/test_tarfile.py | 22 ++++++++++++++++++---- Misc/NEWS | 3 +++ 3 files changed, 22 insertions(+), 5 deletions(-) diff --git a/Lib/tarfile.py b/Lib/tarfile.py --- a/Lib/tarfile.py +++ b/Lib/tarfile.py @@ -627,7 +627,7 @@ def getcomptype(self): if self.buf.startswith(b"\037\213\010"): return "gz" - if self.buf.startswith(b"BZh91"): + if self.buf[0:3] == b"BZh" and self.buf[4:10] == b"1AY&SY": return "bz2" return "tar" diff --git a/Lib/test/test_tarfile.py b/Lib/test/test_tarfile.py --- a/Lib/test/test_tarfile.py +++ b/Lib/test/test_tarfile.py @@ -529,6 +529,23 @@ def test_detect_fileobj(self): self._test_modes(self._testfunc_fileobj) + def test_detect_stream_bz2(self): + # Originally, tarfile's stream detection looked for the string + # "BZh91" at the start of the file. This is incorrect because + # the '9' represents the blocksize (900kB). If the file was + # compressed using another blocksize autodetection fails. + if not bz2: + return + + with open(tarname, "rb") as fobj: + data = fobj.read() + + # Compress with blocksize 100kB, the file starts with "BZh11". + with bz2.BZ2File(tmpname, "wb", compresslevel=1) as fobj: + fobj.write(data) + + self._testfunc_file(tmpname, "r|*") + class MemberReadTest(ReadTest): @@ -1818,11 +1835,8 @@ if bz2: # Create testtar.tar.bz2 and add bz2-specific tests. support.unlink(bz2name) - tar = bz2.BZ2File(bz2name, "wb") - try: + with bz2.BZ2File(bz2name, "wb") as tar: tar.write(data) - finally: - tar.close() tests += [ Bz2MiscReadTest, diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -90,6 +90,9 @@ Library ------- +- tarfile.py: Correctly detect bzip2 compressed streams with blocksizes + other than 900k. + - Issue #13439: Fix many errors in turtle docstrings. - Issue #13487: Make inspect.getmodule robust against changes done to -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 13:10:30 2011 From: python-checkins at python.org (lars.gustaebel) Date: Tue, 06 Dec 2011 13:10:30 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_with_3=2E2=3A_Correctly_detect_bzip2_compressed_stream?= =?utf8?q?s_with_blocksizes_other?= Message-ID: http://hg.python.org/cpython/rev/9149aefff883 changeset: 73869:9149aefff883 parent: 73867:6b6c79eba944 parent: 73868:80876df8adce user: Lars Gust?bel date: Tue Dec 06 13:00:58 2011 +0100 summary: Merge with 3.2: Correctly detect bzip2 compressed streams with blocksizes other than 900k. files: Lib/tarfile.py | 2 +- Lib/test/test_tarfile.py | 22 ++++++++++++++++++---- Misc/NEWS | 3 +++ 3 files changed, 22 insertions(+), 5 deletions(-) diff --git a/Lib/tarfile.py b/Lib/tarfile.py --- a/Lib/tarfile.py +++ b/Lib/tarfile.py @@ -624,7 +624,7 @@ def getcomptype(self): if self.buf.startswith(b"\037\213\010"): return "gz" - if self.buf.startswith(b"BZh91"): + if self.buf[0:3] == b"BZh" and self.buf[4:10] == b"1AY&SY": return "bz2" return "tar" diff --git a/Lib/test/test_tarfile.py b/Lib/test/test_tarfile.py --- a/Lib/test/test_tarfile.py +++ b/Lib/test/test_tarfile.py @@ -529,6 +529,23 @@ def test_detect_fileobj(self): self._test_modes(self._testfunc_fileobj) + def test_detect_stream_bz2(self): + # Originally, tarfile's stream detection looked for the string + # "BZh91" at the start of the file. This is incorrect because + # the '9' represents the blocksize (900kB). If the file was + # compressed using another blocksize autodetection fails. + if not bz2: + return + + with open(tarname, "rb") as fobj: + data = fobj.read() + + # Compress with blocksize 100kB, the file starts with "BZh11". + with bz2.BZ2File(tmpname, "wb", compresslevel=1) as fobj: + fobj.write(data) + + self._testfunc_file(tmpname, "r|*") + class MemberReadTest(ReadTest): @@ -1818,11 +1835,8 @@ if bz2: # Create testtar.tar.bz2 and add bz2-specific tests. support.unlink(bz2name) - tar = bz2.BZ2File(bz2name, "wb") - try: + with bz2.BZ2File(bz2name, "wb") as tar: tar.write(data) - finally: - tar.close() tests += [ Bz2MiscReadTest, diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -406,6 +406,9 @@ Library ------- +- tarfile.py: Correctly detect bzip2 compressed streams with blocksizes + other than 900k. + - Issue #13439: Fix many errors in turtle docstrings. - Issue #6715: Add a module 'lzma' for compression using the LZMA algorithm. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 13:10:37 2011 From: python-checkins at python.org (lars.gustaebel) Date: Tue, 06 Dec 2011 13:10:37 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=282=2E7=29=3A_Correctly_detec?= =?utf8?q?t_bzip2_compressed_streams_with_blocksizes_other_than_900k=2E?= Message-ID: http://hg.python.org/cpython/rev/6d1a91e9f506 changeset: 73870:6d1a91e9f506 branch: 2.7 parent: 73853:478b4e9551fa user: Lars Gust?bel date: Tue Dec 06 13:07:09 2011 +0100 summary: Correctly detect bzip2 compressed streams with blocksizes other than 900k. files: Lib/tarfile.py | 2 +- Lib/test/test_tarfile.py | 17 +++++++++++++++++ Misc/NEWS | 3 +++ 3 files changed, 21 insertions(+), 1 deletions(-) diff --git a/Lib/tarfile.py b/Lib/tarfile.py --- a/Lib/tarfile.py +++ b/Lib/tarfile.py @@ -627,7 +627,7 @@ def getcomptype(self): if self.buf.startswith("\037\213\010"): return "gz" - if self.buf.startswith("BZh91"): + if self.buf[0:3] == "BZh" and self.buf[4:10] == "1AY&SY": return "bz2" return "tar" diff --git a/Lib/test/test_tarfile.py b/Lib/test/test_tarfile.py --- a/Lib/test/test_tarfile.py +++ b/Lib/test/test_tarfile.py @@ -440,6 +440,23 @@ def test_detect_fileobj(self): self._test_modes(self._testfunc_fileobj) + def test_detect_stream_bz2(self): + # Originally, tarfile's stream detection looked for the string + # "BZh91" at the start of the file. This is incorrect because + # the '9' represents the blocksize (900kB). If the file was + # compressed using another blocksize autodetection fails. + if not bz2: + return + + with open(tarname, "rb") as fobj: + data = fobj.read() + + # Compress with blocksize 100kB, the file starts with "BZh11". + with bz2.BZ2File(tmpname, "wb", compresslevel=1) as fobj: + fobj.write(data) + + self._testfunc_file(tmpname, "r|*") + class MemberReadTest(ReadTest): diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -79,6 +79,9 @@ Library ------- +- tarfile.py: Correctly detect bzip2 compressed streams with blocksizes + other than 900k. + - Issue #13439: Fix many errors in turtle docstrings. - Issue #12856: Ensure child processes do not inherit the parent's random -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 13:49:05 2011 From: python-checkins at python.org (lars.gustaebel) Date: Tue, 06 Dec 2011 13:49:05 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Remove_no_longer_needed_wor?= =?utf8?q?k-around_for_bz2_file_object_support=2E?= Message-ID: http://hg.python.org/cpython/rev/56b7b1ecc240 changeset: 73871:56b7b1ecc240 parent: 73869:9149aefff883 user: Lars Gust?bel date: Tue Dec 06 13:44:10 2011 +0100 summary: Remove no longer needed work-around for bz2 file object support. files: Lib/tarfile.py | 66 +--------------------------- Lib/test/test_tarfile.py | 3 + 2 files changed, 5 insertions(+), 64 deletions(-) diff --git a/Lib/tarfile.py b/Lib/tarfile.py --- a/Lib/tarfile.py +++ b/Lib/tarfile.py @@ -632,66 +632,6 @@ self.fileobj.close() # class StreamProxy -class _BZ2Proxy(object): - """Small proxy class that enables external file object - support for "r:bz2" and "w:bz2" modes. This is actually - a workaround for a limitation in bz2 module's BZ2File - class which (unlike gzip.GzipFile) has no support for - a file object argument. - """ - - blocksize = 16 * 1024 - - def __init__(self, fileobj, mode): - self.fileobj = fileobj - self.mode = mode - self.name = getattr(self.fileobj, "name", None) - self.init() - - def init(self): - import bz2 - self.pos = 0 - if self.mode == "r": - self.bz2obj = bz2.BZ2Decompressor() - self.fileobj.seek(0) - self.buf = b"" - else: - self.bz2obj = bz2.BZ2Compressor() - - def read(self, size): - x = len(self.buf) - while x < size: - raw = self.fileobj.read(self.blocksize) - if not raw: - break - data = self.bz2obj.decompress(raw) - self.buf += data - x += len(data) - - buf = self.buf[:size] - self.buf = self.buf[size:] - self.pos += len(buf) - return buf - - def seek(self, pos): - if pos < self.pos: - self.init() - self.read(pos - self.pos) - - def tell(self): - return self.pos - - def write(self, data): - self.pos += len(data) - raw = self.bz2obj.compress(data) - self.fileobj.write(raw) - - def close(self): - if self.mode == "w": - raw = self.bz2obj.flush() - self.fileobj.write(raw) -# class _BZ2Proxy - #------------------------ # Extraction file object #------------------------ @@ -1829,10 +1769,8 @@ except ImportError: raise CompressionError("bz2 module is not available") - if fileobj is not None: - fileobj = _BZ2Proxy(fileobj, mode) - else: - fileobj = bz2.BZ2File(name, mode, compresslevel=compresslevel) + fileobj = bz2.BZ2File(filename=name if fileobj is None else None, + mode=mode, fileobj=fileobj, compresslevel=compresslevel) try: t = cls.taropen(name, mode, fileobj, **kwargs) diff --git a/Lib/test/test_tarfile.py b/Lib/test/test_tarfile.py --- a/Lib/test/test_tarfile.py +++ b/Lib/test/test_tarfile.py @@ -222,6 +222,9 @@ class MiscReadTest(CommonReadTest): def test_no_name_argument(self): + if self.mode.endswith("bz2"): + # BZ2File has no name attribute. + return with open(self.tarname, "rb") as fobj: tar = tarfile.open(fileobj=fobj, mode=self.mode) self.assertEqual(tar.name, os.path.abspath(fobj.name)) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 20:47:51 2011 From: python-checkins at python.org (jesus.cea) Date: Tue, 06 Dec 2011 20:47:51 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogQ2xvc2UgIzEzNTAw?= =?utf8?q?=3A_Hitting_EOF_gets_cmd=2Epy_into_a_infinite_EOF_on_return_loop?= Message-ID: http://hg.python.org/cpython/rev/5910c385fab6 changeset: 73872:5910c385fab6 branch: 2.7 parent: 73870:6d1a91e9f506 user: Jesus Cea date: Tue Dec 06 20:46:04 2011 +0100 summary: Close #13500: Hitting EOF gets cmd.py into a infinite EOF on return loop files: Lib/cmd.py | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/Lib/cmd.py b/Lib/cmd.py --- a/Lib/cmd.py +++ b/Lib/cmd.py @@ -209,6 +209,8 @@ if cmd is None: return self.default(line) self.lastcmd = line + if line == 'EOF' : + self.lastcmd = '' if cmd == '': return self.default(line) else: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 20:47:53 2011 From: python-checkins at python.org (jesus.cea) Date: Tue, 06 Dec 2011 20:47:53 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogQ2xvc2UgIzEzNTAw?= =?utf8?q?=3A_Hitting_EOF_gets_cmd=2Epy_into_a_infinite_EOF_on_return_loop?= Message-ID: http://hg.python.org/cpython/rev/b6b4d74b8d42 changeset: 73873:b6b4d74b8d42 branch: 3.2 parent: 73868:80876df8adce user: Jesus Cea date: Tue Dec 06 20:46:57 2011 +0100 summary: Close #13500: Hitting EOF gets cmd.py into a infinite EOF on return loop files: Lib/cmd.py | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/Lib/cmd.py b/Lib/cmd.py --- a/Lib/cmd.py +++ b/Lib/cmd.py @@ -205,6 +205,8 @@ if cmd is None: return self.default(line) self.lastcmd = line + if line == 'EOF' : + self.lastcmd = '' if cmd == '': return self.default(line) else: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 20:47:53 2011 From: python-checkins at python.org (jesus.cea) Date: Tue, 06 Dec 2011 20:47:53 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_MERGE=3A_Close_=2313500=3A_Hitting_EOF_gets_cmd=2Epy_into_a_?= =?utf8?q?infinite_EOF_on_return_loop?= Message-ID: http://hg.python.org/cpython/rev/70ba352f9586 changeset: 73874:70ba352f9586 parent: 73871:56b7b1ecc240 parent: 73873:b6b4d74b8d42 user: Jesus Cea date: Tue Dec 06 20:47:38 2011 +0100 summary: MERGE: Close #13500: Hitting EOF gets cmd.py into a infinite EOF on return loop files: Lib/cmd.py | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/Lib/cmd.py b/Lib/cmd.py --- a/Lib/cmd.py +++ b/Lib/cmd.py @@ -205,6 +205,8 @@ if cmd is None: return self.default(line) self.lastcmd = line + if line == 'EOF' : + self.lastcmd = '' if cmd == '': return self.default(line) else: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 22:39:53 2011 From: python-checkins at python.org (antoine.pitrou) Date: Tue, 06 Dec 2011 22:39:53 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2313464=3A_Add_a_rea?= =?utf8?q?dinto=28=29_method_to_http=2Eclient=2EHTTPResponse=2E?= Message-ID: http://hg.python.org/cpython/rev/806cfe39f729 changeset: 73875:806cfe39f729 user: Antoine Pitrou date: Tue Dec 06 22:33:57 2011 +0100 summary: Issue #13464: Add a readinto() method to http.client.HTTPResponse. Patch by Jon Kuhn. files: Doc/library/http.client.rst | 6 + Lib/http/client.py | 167 +++++++++++++++++------ Lib/test/test_httplib.py | 86 ++++++++++++ Misc/ACKS | 1 + Misc/NEWS | 3 + 5 files changed, 215 insertions(+), 48 deletions(-) diff --git a/Doc/library/http.client.rst b/Doc/library/http.client.rst --- a/Doc/library/http.client.rst +++ b/Doc/library/http.client.rst @@ -502,6 +502,12 @@ Reads and returns the response body, or up to the next *amt* bytes. +.. method:: HTTPResponse.readinto(b) + + Reads up to the next len(b) bytes of the response body into the buffer *b*. + Returns the number of bytes read. + + .. versionadded:: 3.3 .. method:: HTTPResponse.getheader(name, default=None) diff --git a/Lib/http/client.py b/Lib/http/client.py --- a/Lib/http/client.py +++ b/Lib/http/client.py @@ -485,11 +485,17 @@ self.close() return b"" - if self.chunked: - return self._read_chunked(amt) + if amt is not None: + # Amount is given, so call base class version + # (which is implemented in terms of self.readinto) + return super(HTTPResponse, self).read(amt) + else: + # Amount is not given (unbounded read) so we must check self.length + # and self.chunked - if amt is None: - # unbounded read + if self.chunked: + return self._readall_chunked() + if self.length is None: s = self.fp.read() else: @@ -498,61 +504,49 @@ self.close() # we read everything return s + def readinto(self, b): + if self.fp is None: + return 0 + + if self._method == "HEAD": + self.close() + return 0 + + if self.chunked: + return self._readinto_chunked(b) + if self.length is not None: - if amt > self.length: + if len(b) > self.length: # clip the read to the "end of response" - amt = self.length + b = memoryview(b)[0:self.length] # we do not use _safe_read() here because this may be a .will_close # connection, and the user is reading more bytes than will be provided # (for example, reading in 1k chunks) - s = self.fp.read(amt) + n = self.fp.readinto(b) if self.length is not None: - self.length -= len(s) + self.length -= n if not self.length: self.close() - return s + return n - def _read_chunked(self, amt): - assert self.chunked != _UNKNOWN - chunk_left = self.chunk_left - value = [] - while True: - if chunk_left is None: - line = self.fp.readline(_MAXLINE + 1) - if len(line) > _MAXLINE: - raise LineTooLong("chunk size") - i = line.find(b";") - if i >= 0: - line = line[:i] # strip chunk-extensions - try: - chunk_left = int(line, 16) - except ValueError: - # close the connection as protocol synchronisation is - # probably lost - self.close() - raise IncompleteRead(b''.join(value)) - if chunk_left == 0: - break - if amt is None: - value.append(self._safe_read(chunk_left)) - elif amt < chunk_left: - value.append(self._safe_read(amt)) - self.chunk_left = chunk_left - amt - return b''.join(value) - elif amt == chunk_left: - value.append(self._safe_read(amt)) - self._safe_read(2) # toss the CRLF at the end of the chunk - self.chunk_left = None - return b''.join(value) - else: - value.append(self._safe_read(chunk_left)) - amt -= chunk_left + def _read_next_chunk_size(self): + # Read the next chunk size from the file + line = self.fp.readline(_MAXLINE + 1) + if len(line) > _MAXLINE: + raise LineTooLong("chunk size") + i = line.find(b";") + if i >= 0: + line = line[:i] # strip chunk-extensions + try: + return int(line, 16) + except ValueError: + # close the connection as protocol synchronisation is + # probably lost + self.close() + raise - # we read the whole chunk, get another - self._safe_read(2) # toss the CRLF at the end of the chunk - chunk_left = None - + def _read_and_discard_trailer(self): # read and discard trailer up to the CRLF terminator ### note: we shouldn't have any trailers! while True: @@ -566,11 +560,72 @@ if line == b"\r\n": break + def _readall_chunked(self): + assert self.chunked != _UNKNOWN + chunk_left = self.chunk_left + value = [] + while True: + if chunk_left is None: + try: + chunk_left = self._read_next_chunk_size() + if chunk_left == 0: + break + except ValueError: + raise IncompleteRead(b''.join(value)) + value.append(self._safe_read(chunk_left)) + + # we read the whole chunk, get another + self._safe_read(2) # toss the CRLF at the end of the chunk + chunk_left = None + + self._read_and_discard_trailer() + # we read everything; close the "file" self.close() return b''.join(value) + def _readinto_chunked(self, b): + assert self.chunked != _UNKNOWN + chunk_left = self.chunk_left + + total_bytes = 0 + mvb = memoryview(b) + while True: + if chunk_left is None: + try: + chunk_left = self._read_next_chunk_size() + if chunk_left == 0: + break + except ValueError: + raise IncompleteRead(bytes(b[0:total_bytes])) + + if len(mvb) < chunk_left: + n = self._safe_readinto(mvb) + self.chunk_left = chunk_left - n + return n + elif len(mvb) == chunk_left: + n = self._safe_readinto(mvb) + self._safe_read(2) # toss the CRLF at the end of the chunk + self.chunk_left = None + return n + else: + temp_mvb = mvb[0:chunk_left] + n = self._safe_readinto(temp_mvb) + mvb = mvb[n:] + total_bytes += n + + # we read the whole chunk, get another + self._safe_read(2) # toss the CRLF at the end of the chunk + chunk_left = None + + self._read_and_discard_trailer() + + # we read everything; close the "file" + self.close() + + return total_bytes + def _safe_read(self, amt): """Read the number of bytes requested, compensating for partial reads. @@ -594,6 +649,22 @@ amt -= len(chunk) return b"".join(s) + def _safe_readinto(self, b): + """Same as _safe_read, but for reading into a buffer.""" + total_bytes = 0 + mvb = memoryview(b) + while total_bytes < len(b): + if MAXAMOUNT < len(mvb): + temp_mvb = mvb[0:MAXAMOUNT] + n = self.fp.readinto(temp_mvb) + else: + n = self.fp.readinto(mvb) + if not n: + raise IncompleteRead(bytes(mvb[0:total_bytes]), len(b)) + mvb = mvb[n:] + total_bytes += n + return total_bytes + def fileno(self): return self.fp.fileno() diff --git a/Lib/test/test_httplib.py b/Lib/test/test_httplib.py --- a/Lib/test/test_httplib.py +++ b/Lib/test/test_httplib.py @@ -158,6 +158,23 @@ self.assertEqual(resp.read(2), b'xt') self.assertTrue(resp.isclosed()) + def test_partial_readintos(self): + # if we have a lenght, the system knows when to close itself + # same behaviour than when we read the whole thing with read() + body = "HTTP/1.1 200 Ok\r\nContent-Length: 4\r\n\r\nText" + sock = FakeSocket(body) + resp = client.HTTPResponse(sock) + resp.begin() + b = bytearray(2) + n = resp.readinto(b) + self.assertEqual(n, 2) + self.assertEqual(bytes(b), b'Te') + self.assertFalse(resp.isclosed()) + n = resp.readinto(b) + self.assertEqual(n, 2) + self.assertEqual(bytes(b), b'xt') + self.assertTrue(resp.isclosed()) + def test_host_port(self): # Check invalid host_port @@ -206,6 +223,21 @@ if resp.read(): self.fail("Did not expect response from HEAD request") + def test_readinto_head(self): + # Test that the library doesn't attempt to read any data + # from a HEAD request. (Tickles SF bug #622042.) + sock = FakeSocket( + 'HTTP/1.1 200 OK\r\n' + 'Content-Length: 14432\r\n' + '\r\n', + NoEOFStringIO) + resp = client.HTTPResponse(sock, method="HEAD") + resp.begin() + b = bytearray(5) + if resp.readinto(b) != 0: + self.fail("Did not expect response from HEAD request") + self.assertEqual(bytes(b), b'\x00'*5) + def test_send_file(self): expected = (b'GET /foo HTTP/1.1\r\nHost: example.com\r\n' b'Accept-Encoding: identity\r\nContent-Length:') @@ -285,6 +317,40 @@ finally: resp.close() + def test_readinto_chunked(self): + chunked_start = ( + 'HTTP/1.1 200 OK\r\n' + 'Transfer-Encoding: chunked\r\n\r\n' + 'a\r\n' + 'hello worl\r\n' + '1\r\n' + 'd\r\n' + ) + sock = FakeSocket(chunked_start + '0\r\n') + resp = client.HTTPResponse(sock, method="GET") + resp.begin() + b = bytearray(16) + n = resp.readinto(b) + self.assertEqual(b[:11], b'hello world') + self.assertEqual(n, 11) + resp.close() + + for x in ('', 'foo\r\n'): + sock = FakeSocket(chunked_start + x) + resp = client.HTTPResponse(sock, method="GET") + resp.begin() + try: + b = bytearray(16) + n = resp.readinto(b) + except client.IncompleteRead as i: + self.assertEqual(i.partial, b'hello world') + self.assertEqual(repr(i),'IncompleteRead(11 bytes read)') + self.assertEqual(str(i),'IncompleteRead(11 bytes read)') + else: + self.fail('IncompleteRead expected') + finally: + resp.close() + def test_chunked_head(self): chunked_start = ( 'HTTP/1.1 200 OK\r\n' @@ -302,6 +368,26 @@ self.assertEqual(resp.reason, 'OK') self.assertTrue(resp.isclosed()) + def test_readinto_chunked_head(self): + chunked_start = ( + 'HTTP/1.1 200 OK\r\n' + 'Transfer-Encoding: chunked\r\n\r\n' + 'a\r\n' + 'hello world\r\n' + '1\r\n' + 'd\r\n' + ) + sock = FakeSocket(chunked_start + '0\r\n') + resp = client.HTTPResponse(sock, method="HEAD") + resp.begin() + b = bytearray(5) + n = resp.readinto(b) + self.assertEqual(n, 0) + self.assertEqual(bytes(b), b'\x00'*5) + self.assertEqual(resp.status, 200) + self.assertEqual(resp.reason, 'OK') + self.assertTrue(resp.isclosed()) + def test_negative_content_length(self): sock = FakeSocket( 'HTTP/1.1 200 OK\r\nContent-Length: -1\r\n\r\nHello\r\n') diff --git a/Misc/ACKS b/Misc/ACKS --- a/Misc/ACKS +++ b/Misc/ACKS @@ -547,6 +547,7 @@ Andrej Krpic Ivan Krsti? Andrew Kuchling +Jon Kuhn Vladimir Kushnir Ross Lagerwall Cameron Laird diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -406,6 +406,9 @@ Library ------- +- Issue #13464: Add a readinto() method to http.client.HTTPResponse. Patch + by Jon Kuhn. + - tarfile.py: Correctly detect bzip2 compressed streams with blocksizes other than 900k. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Tue Dec 6 22:39:54 2011 From: python-checkins at python.org (antoine.pitrou) Date: Tue, 06 Dec 2011 22:39:54 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Fix_dangling_whitespace?= Message-ID: http://hg.python.org/cpython/rev/daaacc0ec584 changeset: 73876:daaacc0ec584 user: Antoine Pitrou date: Tue Dec 06 22:34:36 2011 +0100 summary: Fix dangling whitespace files: Lib/http/client.py | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Lib/http/client.py b/Lib/http/client.py --- a/Lib/http/client.py +++ b/Lib/http/client.py @@ -599,7 +599,7 @@ break except ValueError: raise IncompleteRead(bytes(b[0:total_bytes])) - + if len(mvb) < chunk_left: n = self._safe_readinto(mvb) self.chunk_left = chunk_left - n -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Wed Dec 7 05:34:00 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Wed, 07 Dec 2011 05:34:00 +0100 Subject: [Python-checkins] Daily reference leaks (daaacc0ec584): sum=0 Message-ID: results for daaacc0ec584 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/refloglUfbkU', '-x'] From python-checkins at python.org Wed Dec 7 10:14:52 2011 From: python-checkins at python.org (ned.deily) Date: Wed, 07 Dec 2011 10:14:52 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzg2NDE6?= =?utf8?q?_Update_IDLE_3_syntax_coloring_to_recognize_b=22=2E=2E=22_and_no?= =?utf8?b?dCB1Ii4uIi4=?= Message-ID: http://hg.python.org/cpython/rev/3822c8087d70 changeset: 73877:3822c8087d70 branch: 3.2 parent: 73873:b6b4d74b8d42 user: Ned Deily date: Wed Dec 07 01:08:35 2011 -0800 summary: Issue #8641: Update IDLE 3 syntax coloring to recognize b".." and not u"..". (Patch by Tal Einat) files: Lib/idlelib/ColorDelegator.py | 8 ++++---- Misc/ACKS | 1 + Misc/NEWS | 3 +++ 3 files changed, 8 insertions(+), 4 deletions(-) diff --git a/Lib/idlelib/ColorDelegator.py b/Lib/idlelib/ColorDelegator.py --- a/Lib/idlelib/ColorDelegator.py +++ b/Lib/idlelib/ColorDelegator.py @@ -20,10 +20,10 @@ # 1st 'file' colorized normal, 2nd as builtin, 3rd as string builtin = r"([^.'\"\\#]\b|^)" + any("BUILTIN", builtinlist) + r"\b" comment = any("COMMENT", [r"#[^\n]*"]) - sqstring = r"(\b[rRuU])?'[^'\\\n]*(\\.[^'\\\n]*)*'?" - dqstring = r'(\b[rRuU])?"[^"\\\n]*(\\.[^"\\\n]*)*"?' - sq3string = r"(\b[rRuU])?'''[^'\\]*((\\.|'(?!''))[^'\\]*)*(''')?" - dq3string = r'(\b[rRuU])?"""[^"\\]*((\\.|"(?!""))[^"\\]*)*(""")?' + sqstring = r"(\b[rRbB])?'[^'\\\n]*(\\.[^'\\\n]*)*'?" + dqstring = r'(\b[rRbB])?"[^"\\\n]*(\\.[^"\\\n]*)*"?' + sq3string = r"(\b[rRbB])?'''[^'\\]*((\\.|'(?!''))[^'\\]*)*(''')?" + dq3string = r'(\b[rRbB])?"""[^"\\]*((\\.|"(?!""))[^"\\]*)*(""")?' string = any("STRING", [sq3string, dq3string, sqstring, dqstring]) return kw + "|" + builtin + "|" + comment + "|" + string +\ "|" + any("SYNC", [r"\n"]) diff --git a/Misc/ACKS b/Misc/ACKS --- a/Misc/ACKS +++ b/Misc/ACKS @@ -256,6 +256,7 @@ Rodolpho Eckhardt Grant Edwards John Ehresman +Tal Einat Eric Eisner Andrew Eland Julien ?lie diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -90,6 +90,9 @@ Library ------- +- Issue #8641: Update IDLE 3 syntax coloring to recognize b".." and not u"..". + Patch by Tal Einat. + - tarfile.py: Correctly detect bzip2 compressed streams with blocksizes other than 900k. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Wed Dec 7 10:14:55 2011 From: python-checkins at python.org (ned.deily) Date: Wed, 07 Dec 2011 10:14:55 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Issue_=238641=3A_Update_IDLE_3_syntax_coloring_to_recognize_?= =?utf8?b?YiIuLiIgYW5kIG5vdCB1Ii4uIi4=?= Message-ID: http://hg.python.org/cpython/rev/e49220f4c31f changeset: 73878:e49220f4c31f parent: 73876:daaacc0ec584 parent: 73877:3822c8087d70 user: Ned Deily date: Wed Dec 07 01:12:50 2011 -0800 summary: Issue #8641: Update IDLE 3 syntax coloring to recognize b".." and not u"..". (Patch by Tal Einat) files: Lib/idlelib/ColorDelegator.py | 8 ++++---- Misc/ACKS | 1 + Misc/NEWS | 3 +++ 3 files changed, 8 insertions(+), 4 deletions(-) diff --git a/Lib/idlelib/ColorDelegator.py b/Lib/idlelib/ColorDelegator.py --- a/Lib/idlelib/ColorDelegator.py +++ b/Lib/idlelib/ColorDelegator.py @@ -20,10 +20,10 @@ # 1st 'file' colorized normal, 2nd as builtin, 3rd as string builtin = r"([^.'\"\\#]\b|^)" + any("BUILTIN", builtinlist) + r"\b" comment = any("COMMENT", [r"#[^\n]*"]) - sqstring = r"(\b[rRuU])?'[^'\\\n]*(\\.[^'\\\n]*)*'?" - dqstring = r'(\b[rRuU])?"[^"\\\n]*(\\.[^"\\\n]*)*"?' - sq3string = r"(\b[rRuU])?'''[^'\\]*((\\.|'(?!''))[^'\\]*)*(''')?" - dq3string = r'(\b[rRuU])?"""[^"\\]*((\\.|"(?!""))[^"\\]*)*(""")?' + sqstring = r"(\b[rRbB])?'[^'\\\n]*(\\.[^'\\\n]*)*'?" + dqstring = r'(\b[rRbB])?"[^"\\\n]*(\\.[^"\\\n]*)*"?' + sq3string = r"(\b[rRbB])?'''[^'\\]*((\\.|'(?!''))[^'\\]*)*(''')?" + dq3string = r'(\b[rRbB])?"""[^"\\]*((\\.|"(?!""))[^"\\]*)*(""")?' string = any("STRING", [sq3string, dq3string, sqstring, dqstring]) return kw + "|" + builtin + "|" + comment + "|" + string +\ "|" + any("SYNC", [r"\n"]) diff --git a/Misc/ACKS b/Misc/ACKS --- a/Misc/ACKS +++ b/Misc/ACKS @@ -275,6 +275,7 @@ John Edmonds Grant Edwards John Ehresman +Tal Einat Eric Eisner Andrew Eland Julien ?lie diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -406,6 +406,9 @@ Library ------- +- Issue #8641: Update IDLE 3 syntax coloring to recognize b".." and not u"..". + Patch by Tal Einat. + - Issue #13464: Add a readinto() method to http.client.HTTPResponse. Patch by Jon Kuhn. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Wed Dec 7 19:16:31 2011 From: python-checkins at python.org (charles-francois.natali) Date: Wed, 07 Dec 2011 19:16:31 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2311051=3A_Reduce_th?= =?utf8?q?e_number_of_syscalls_per_import=2E?= Message-ID: http://hg.python.org/cpython/rev/a541bda2f5e2 changeset: 73879:a541bda2f5e2 user: Charles-Fran?ois Natali date: Wed Dec 07 19:16:01 2011 +0100 summary: Issue #11051: Reduce the number of syscalls per import. files: Python/import.c | 3 +-- 1 files changed, 1 insertions(+), 2 deletions(-) diff --git a/Python/import.c b/Python/import.c --- a/Python/import.c +++ b/Python/import.c @@ -1944,8 +1944,7 @@ if (Py_VerboseFlag > 1) PySys_FormatStderr("# trying %R\n", filename); - if (_Py_stat(filename, &statbuf) == 0 && /* it exists */ - S_ISDIR(statbuf.st_mode)) /* it's a directory */ + if (_Py_stat(filename, &statbuf) != 0 || S_ISDIR(statbuf.st_mode)) { Py_DECREF(filename); continue; -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Wed Dec 7 21:51:15 2011 From: python-checkins at python.org (amaury.forgeotdarc) Date: Wed, 07 Dec 2011 21:51:15 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogSXNzdWUgIzEzNTQ2?= =?utf8?q?=3A_Fixed_an_overflow_issue_that_could_crash_the_intepreter_when?= Message-ID: http://hg.python.org/cpython/rev/57de1ad15c54 changeset: 73880:57de1ad15c54 branch: 2.7 parent: 73872:5910c385fab6 user: Amaury Forgeot d'Arc date: Wed Dec 07 21:46:48 2011 +0100 summary: Issue #13546: Fixed an overflow issue that could crash the intepreter when calling sys.setrecursionlimit((1<<31)-1). 2.7 only. files: Lib/test/test_sys.py | 12 ++++++++++++ Misc/NEWS | 3 +++ Python/errors.c | 6 ++++-- 3 files changed, 19 insertions(+), 2 deletions(-) diff --git a/Lib/test/test_sys.py b/Lib/test/test_sys.py --- a/Lib/test/test_sys.py +++ b/Lib/test/test_sys.py @@ -224,6 +224,18 @@ self.assertEqual(sys.getrecursionlimit(), 10000) sys.setrecursionlimit(oldlimit) + self.assertRaises(OverflowError, sys.setrecursionlimit, 1 << 31) + try: + sys.setrecursionlimit((1 << 31) - 5) + try: + # issue13546: isinstance(e, ValueError) used to fail + # when the recursion limit is close to 1<<31 + raise ValueError() + except ValueError, e: + pass + finally: + sys.setrecursionlimit(oldlimit) + def test_getwindowsversion(self): # Raise SkipTest if sys doesn't have getwindowsversion attribute test.test_support.get_attribute(sys, "getwindowsversion") diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -9,6 +9,9 @@ Core and Builtins ----------------- +- Issue #13546: Fixed an overflow issue that could crash the intepreter when + calling sys.setrecursionlimit((1<<31)-1). + - Issue #13333: The UTF-7 decoder now accepts lone surrogates (the encoder already accepts them). diff --git a/Python/errors.c b/Python/errors.c --- a/Python/errors.c +++ b/Python/errors.c @@ -111,9 +111,11 @@ PyErr_Fetch(&exception, &value, &tb); /* Temporarily bump the recursion limit, so that in the most common case PyObject_IsSubclass will not raise a recursion - error we have to ignore anyway. */ + error we have to ignore anyway. Don't do it when the limit + is already insanely high, to avoid overflow */ reclimit = Py_GetRecursionLimit(); - Py_SetRecursionLimit(reclimit + 5); + if (reclimit < (1 << 30)) + Py_SetRecursionLimit(reclimit + 5); res = PyObject_IsSubclass(err, exc); Py_SetRecursionLimit(reclimit); /* This function must not fail, so print the error here */ -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Wed Dec 7 23:04:09 2011 From: python-checkins at python.org (ezio.melotti) Date: Wed, 07 Dec 2011 23:04:09 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogIzEzNTMxOiBhZGQg?= =?utf8?q?a_test_for_defaultdict_with_a_non-callable_arg=2E__Patch_by_Mike?= Message-ID: http://hg.python.org/cpython/rev/a8deeb549e1a changeset: 73881:a8deeb549e1a branch: 2.7 user: Ezio Melotti date: Thu Dec 08 00:00:49 2011 +0200 summary: #13531: add a test for defaultdict with a non-callable arg. Patch by Mike Cheng. files: Lib/test/test_defaultdict.py | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_defaultdict.py b/Lib/test/test_defaultdict.py --- a/Lib/test/test_defaultdict.py +++ b/Lib/test/test_defaultdict.py @@ -171,6 +171,8 @@ finally: os.remove(tfn) + def test_callable_arg(self): + self.assertRaises(TypeError, defaultdict, {}) def test_main(): test_support.run_unittest(TestDefaultDict) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Wed Dec 7 23:04:10 2011 From: python-checkins at python.org (ezio.melotti) Date: Wed, 07 Dec 2011 23:04:10 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogIzEzNTMxOiBhZGQg?= =?utf8?q?a_test_for_defaultdict_with_a_non-callable_arg=2E__Patch_by_Mike?= Message-ID: http://hg.python.org/cpython/rev/17ceebc61b65 changeset: 73882:17ceebc61b65 branch: 3.2 parent: 73877:3822c8087d70 user: Ezio Melotti date: Thu Dec 08 00:02:00 2011 +0200 summary: #13531: add a test for defaultdict with a non-callable arg. Patch by Mike Cheng. files: Lib/test/test_defaultdict.py | 3 +++ 1 files changed, 3 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_defaultdict.py b/Lib/test/test_defaultdict.py --- a/Lib/test/test_defaultdict.py +++ b/Lib/test/test_defaultdict.py @@ -172,6 +172,9 @@ finally: os.remove(tfn) + def test_callable_arg(self): + self.assertRaises(TypeError, defaultdict, {}) + def test_pickleing(self): d = defaultdict(int) d[1] -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Wed Dec 7 23:04:10 2011 From: python-checkins at python.org (ezio.melotti) Date: Wed, 07 Dec 2011 23:04:10 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_=2313531=3A_merge_with_3=2E2=2E?= Message-ID: http://hg.python.org/cpython/rev/4180308547d9 changeset: 73883:4180308547d9 parent: 73879:a541bda2f5e2 parent: 73882:17ceebc61b65 user: Ezio Melotti date: Thu Dec 08 00:03:59 2011 +0200 summary: #13531: merge with 3.2. files: Lib/test/test_defaultdict.py | 3 +++ 1 files changed, 3 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_defaultdict.py b/Lib/test/test_defaultdict.py --- a/Lib/test/test_defaultdict.py +++ b/Lib/test/test_defaultdict.py @@ -172,6 +172,9 @@ finally: os.remove(tfn) + def test_callable_arg(self): + self.assertRaises(TypeError, defaultdict, {}) + def test_pickleing(self): d = defaultdict(int) d[1] -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Wed Dec 7 23:20:03 2011 From: python-checkins at python.org (charles-francois.natali) Date: Wed, 07 Dec 2011 23:20:03 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Followup_to_a541bda2f5e2=3A?= =?utf8?q?_Add_a_short_comment=2E?= Message-ID: http://hg.python.org/cpython/rev/174fbbed8747 changeset: 73884:174fbbed8747 user: Charles-Fran?ois Natali date: Wed Dec 07 23:17:58 2011 +0100 summary: Followup to a541bda2f5e2: Add a short comment. files: Python/import.c | 1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/Python/import.c b/Python/import.c --- a/Python/import.c +++ b/Python/import.c @@ -1946,6 +1946,7 @@ if (_Py_stat(filename, &statbuf) != 0 || S_ISDIR(statbuf.st_mode)) { + /* it doesn't exist, or it's a directory */ Py_DECREF(filename); continue; } -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 00:06:28 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 08 Dec 2011 00:06:28 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_libpython=2Epy=3A_defer_cal?= =?utf8?q?l_to_gdb=2Elookup=5Ftype=28=27PyUnicodeObject=27=29?= Message-ID: http://hg.python.org/cpython/rev/50c7ac2fe13c changeset: 73885:50c7ac2fe13c user: Victor Stinner date: Thu Dec 08 00:08:22 2011 +0100 summary: libpython.py: defer call to gdb.lookup_type('PyUnicodeObject') The lookup fails at startup if Python is linked to a shared library. files: Tools/gdb/libpython.py | 7 ++++++- 1 files changed, 6 insertions(+), 1 deletions(-) diff --git a/Tools/gdb/libpython.py b/Tools/gdb/libpython.py --- a/Tools/gdb/libpython.py +++ b/Tools/gdb/libpython.py @@ -53,7 +53,8 @@ _type_unsigned_short_ptr = gdb.lookup_type('unsigned short').pointer() _type_unsigned_int_ptr = gdb.lookup_type('unsigned int').pointer() -_is_pep393 = 'data' in [f.name for f in gdb.lookup_type('PyUnicodeObject').target().fields()] +# value computed later, see PyUnicodeObjectPtr.proxy() +_is_pep393 = None SIZEOF_VOID_P = _type_void_ptr.sizeof @@ -1123,6 +1124,10 @@ return _type_Py_UNICODE.sizeof def proxyval(self, visited): + global _is_pep393 + if _is_pep393 is None: + fields = gdb.lookup_type('PyUnicodeObject').target().fields() + _is_pep393 = 'data' in [f.name for f in fields] if _is_pep393: # Python 3.3 and newer may_have_surrogates = False -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 00:32:58 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 08 Dec 2011 00:32:58 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzExODg2?= =?utf8?q?=3A_workaround_an_OS_bug_=28time_zone_data=29_in_test=5Ftime?= Message-ID: http://hg.python.org/cpython/rev/c143e66e5efe changeset: 73886:c143e66e5efe branch: 3.2 parent: 73882:17ceebc61b65 user: Victor Stinner date: Thu Dec 08 00:32:51 2011 +0100 summary: Issue #11886: workaround an OS bug (time zone data) in test_time Australian Eastern Standard Time (UTC+10) is called "EST" (as Eastern Standard Time, UTC-5) instead of "AEST" on some operating systems (e.g. FreeBSD), which is wrong. See for example this bug: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=93810 files: Lib/test/test_time.py | 7 ++++++- 1 files changed, 6 insertions(+), 1 deletions(-) diff --git a/Lib/test/test_time.py b/Lib/test/test_time.py --- a/Lib/test/test_time.py +++ b/Lib/test/test_time.py @@ -206,7 +206,12 @@ environ['TZ'] = victoria time.tzset() self.assertNotEqual(time.gmtime(xmas2002), time.localtime(xmas2002)) - self.assertTrue(time.tzname[0] == 'AEST', str(time.tzname[0])) + + # Issue #11886: Australian Eastern Standard Time (UTC+10) is called + # "EST" (as Eastern Standard Time, UTC-5) instead of "AEST" on some + # operating systems (e.g. FreeBSD), which is wrong. See for example + # this bug: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=93810 + self.assertIn(time.tzname[0], ('AEST' 'EST'), time.tzname[0]) self.assertTrue(time.tzname[1] == 'AEDT', str(time.tzname[1])) self.assertEqual(len(time.tzname), 2) self.assertEqual(time.daylight, 1) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 00:32:59 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 08 Dec 2011 00:32:59 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_=28Merge_3=2E2=29_Issue_=2311886=3A_workaround_an_OS_bug_=28?= =?utf8?q?time_zone_data=29_in_test=5Ftime?= Message-ID: http://hg.python.org/cpython/rev/c7638be1e430 changeset: 73887:c7638be1e430 parent: 73885:50c7ac2fe13c parent: 73886:c143e66e5efe user: Victor Stinner date: Thu Dec 08 00:33:14 2011 +0100 summary: (Merge 3.2) Issue #11886: workaround an OS bug (time zone data) in test_time Australian Eastern Standard Time (UTC+10) is called "EST" (as Eastern Standard Time, UTC-5) instead of "AEST" on some operating systems (e.g. FreeBSD), which is wrong. See for example this bug: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=93810 files: Lib/test/test_time.py | 7 ++++++- 1 files changed, 6 insertions(+), 1 deletions(-) diff --git a/Lib/test/test_time.py b/Lib/test/test_time.py --- a/Lib/test/test_time.py +++ b/Lib/test/test_time.py @@ -250,7 +250,12 @@ environ['TZ'] = victoria time.tzset() self.assertNotEqual(time.gmtime(xmas2002), time.localtime(xmas2002)) - self.assertTrue(time.tzname[0] == 'AEST', str(time.tzname[0])) + + # Issue #11886: Australian Eastern Standard Time (UTC+10) is called + # "EST" (as Eastern Standard Time, UTC-5) instead of "AEST" on some + # operating systems (e.g. FreeBSD), which is wrong. See for example + # this bug: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=93810 + self.assertIn(time.tzname[0], ('AEST' 'EST'), time.tzname[0]) self.assertTrue(time.tzname[1] == 'AEDT', str(time.tzname[1])) self.assertEqual(len(time.tzname), 2) self.assertEqual(time.daylight, 1) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 00:33:03 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 08 Dec 2011 00:33:03 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogSXNzdWUgIzExODg2?= =?utf8?q?=3A_workaround_an_OS_bug_=28time_zone_data=29_in_test=5Ftime?= Message-ID: http://hg.python.org/cpython/rev/2bca2cee79a1 changeset: 73888:2bca2cee79a1 branch: 2.7 parent: 73881:a8deeb549e1a user: Victor Stinner date: Thu Dec 08 00:32:51 2011 +0100 summary: Issue #11886: workaround an OS bug (time zone data) in test_time Australian Eastern Standard Time (UTC+10) is called "EST" (as Eastern Standard Time, UTC-5) instead of "AEST" on some operating systems (e.g. FreeBSD), which is wrong. See for example this bug: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=93810 files: Lib/test/test_time.py | 7 ++++++- 1 files changed, 6 insertions(+), 1 deletions(-) diff --git a/Lib/test/test_time.py b/Lib/test/test_time.py --- a/Lib/test/test_time.py +++ b/Lib/test/test_time.py @@ -184,7 +184,12 @@ environ['TZ'] = victoria time.tzset() self.assertNotEqual(time.gmtime(xmas2002), time.localtime(xmas2002)) - self.assertTrue(time.tzname[0] == 'AEST', str(time.tzname[0])) + + # Issue #11886: Australian Eastern Standard Time (UTC+10) is called + # "EST" (as Eastern Standard Time, UTC-5) instead of "AEST" on some + # operating systems (e.g. FreeBSD), which is wrong. See for example + # this bug: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=93810 + self.assertIn(time.tzname[0], ('AEST' 'EST'), time.tzname[0]) self.assertTrue(time.tzname[1] == 'AEDT', str(time.tzname[1])) self.assertEqual(len(time.tzname), 2) self.assertEqual(time.daylight, 1) -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Thu Dec 8 05:36:56 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Thu, 08 Dec 2011 05:36:56 +0100 Subject: [Python-checkins] Daily reference leaks (c7638be1e430): sum=0 Message-ID: results for c7638be1e430 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/reflog2HGBjZ', '-x'] From python-checkins at python.org Thu Dec 8 22:12:09 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 08 Dec 2011 22:12:09 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_PyUnicode=5FFromWideChar=28?= =?utf8?q?=29_and_PyUnicode=5FFromUnicode=28=29_raise_a_ValueError_if_a?= Message-ID: http://hg.python.org/cpython/rev/489ea02ed351 changeset: 73889:489ea02ed351 parent: 73887:c7638be1e430 user: Victor Stinner date: Thu Dec 08 22:14:11 2011 +0100 summary: PyUnicode_FromWideChar() and PyUnicode_FromUnicode() raise a ValueError if a character in not in range [U+0000; U+10ffff]. files: Objects/unicodeobject.c | 67 ++++++++++++++-------------- 1 files changed, 34 insertions(+), 33 deletions(-) diff --git a/Objects/unicodeobject.c b/Objects/unicodeobject.c --- a/Objects/unicodeobject.c +++ b/Objects/unicodeobject.c @@ -66,6 +66,9 @@ extern "C" { #endif +/* Maximum code point of Unicode 6.0: 0x10ffff (1,114,111) */ +#define MAX_UNICODE 0x10ffff + #ifdef Py_DEBUG # define _PyUnicode_CHECK(op) _PyUnicode_CheckConsistency(op, 0) #else @@ -393,9 +396,7 @@ } else { assert(maxchar >= 0x10000); - /* FIXME: Issue #13441: on Solaris, localeconv() and strxfrm() - return characters outside the range U+0000-U+10FFFF. */ - /* assert(maxchar <= 0x10FFFF); */ + assert(maxchar <= MAX_UNICODE); } } return 1; @@ -1295,36 +1296,37 @@ Py_UCS4 *maxchar, Py_ssize_t *num_surrogates) { const wchar_t *iter; + Py_UCS4 ch; assert(num_surrogates != NULL && maxchar != NULL); *num_surrogates = 0; *maxchar = 0; for (iter = begin; iter < end; ) { - if (*iter > *maxchar) { - *maxchar = *iter; -#if SIZEOF_WCHAR_T != 2 - if (*maxchar >= 0x10000) - return 0; -#endif - } #if SIZEOF_WCHAR_T == 2 if (Py_UNICODE_IS_HIGH_SURROGATE(iter[0]) && (iter+1) < end && Py_UNICODE_IS_LOW_SURROGATE(iter[1])) { - Py_UCS4 surrogate_val; - surrogate_val = Py_UNICODE_JOIN_SURROGATES(iter[0], iter[1]); + ch = Py_UNICODE_JOIN_SURROGATES(iter[0], iter[1]); ++(*num_surrogates); - if (surrogate_val > *maxchar) - *maxchar = surrogate_val; iter += 2; } else +#endif + { + ch = *iter; iter++; -#else - iter++; -#endif + } + if (ch > *maxchar) { + *maxchar = ch; + if (*maxchar > MAX_UNICODE) { + PyErr_Format(PyExc_ValueError, + "character U+%x is not in range [U+0000; U+10ffff]", + ch); + return -1; + } + } } return 0; } @@ -1669,8 +1671,7 @@ &maxchar, &num_surrogates) == -1) return NULL; - unicode = PyUnicode_New(size - num_surrogates, - maxchar); + unicode = PyUnicode_New(size - num_surrogates, maxchar); if (!unicode) return NULL; @@ -1808,7 +1809,7 @@ return 0x10000; default: assert(0 && "invalid kind"); - return 0x10ffff; + return MAX_UNICODE; } } @@ -2796,7 +2797,7 @@ PyUnicode_FromOrdinal(int ordinal) { PyObject *v; - if (ordinal < 0 || ordinal > 0x10ffff) { + if (ordinal < 0 || ordinal > MAX_UNICODE) { PyErr_SetString(PyExc_ValueError, "chr() arg not in range(0x110000)"); return NULL; @@ -3472,7 +3473,7 @@ four_bytes = PyUnicode_4BYTE_DATA(unicode); for (; four_bytes < ucs4_end; ++four_bytes, ++w) { if (*four_bytes > 0xFFFF) { - assert(*four_bytes <= 0x10FFFF); + assert(*four_bytes <= MAX_UNICODE); /* encode surrogate pair in this case */ *w++ = Py_UNICODE_HIGH_SURROGATE(*four_bytes); *w = Py_UNICODE_LOW_SURROGATE(*four_bytes); @@ -4118,7 +4119,7 @@ continue; encode_char: if (ch >= 0x10000) { - assert(ch <= 0x10FFFF); + assert(ch <= MAX_UNICODE); /* code first surrogate */ base64bits += 16; @@ -4577,7 +4578,7 @@ } ch = ((s[0] & 0x7) << 18) + ((s[1] & 0x3f) << 12) + ((s[2] & 0x3f) << 6) + (s[3] & 0x3f); - assert ((ch > 0xFFFF) && (ch <= 0x10ffff)); + assert ((ch > 0xFFFF) && (ch <= MAX_UNICODE)); WRITE_MAYBE_FAIL(i++, ch); break; @@ -4714,7 +4715,7 @@ } ch = ((s[0] & 0x7) << 18) + ((s[1] & 0x3f) << 12) + ((s[2] & 0x3f) << 6) + (s[3] & 0x3f); - assert ((ch > 0xFFFF) && (ch <= 0x10ffff)); + assert ((ch > 0xFFFF) && (ch <= MAX_UNICODE)); #if SIZEOF_WCHAR_T == 4 *p++ = (wchar_t)ch; @@ -4884,7 +4885,7 @@ *p++ = (char)(0x80 | ((ch >> 6) & 0x3f)); *p++ = (char)(0x80 | (ch & 0x3f)); } else /* ch >= 0x10000 */ { - assert(ch <= 0x10FFFF); + assert(ch <= MAX_UNICODE); /* Encode UCS4 Unicode ordinals */ *p++ = (char)(0xf0 | (ch >> 18)); *p++ = (char)(0x80 | ((ch >> 12) & 0x3f)); @@ -5792,7 +5793,7 @@ break; store: /* when we get here, chr is a 32-bit unicode character */ - if (chr <= 0x10ffff) { + if (chr <= MAX_UNICODE) { WRITECHAR(chr); } else { endinpos = s-starts; @@ -5957,7 +5958,7 @@ /* Map 21-bit characters to '\U00xxxxxx' */ else if (ch >= 0x10000) { - assert(ch <= 0x10FFFF); + assert(ch <= MAX_UNICODE); *p++ = '\\'; *p++ = 'U'; *p++ = Py_hexdigits[(ch >> 28) & 0x0000000F]; @@ -6108,7 +6109,7 @@ else x += 10 + c - 'A'; } - if (x <= 0x10ffff) { + if (x <= MAX_UNICODE) { if (unicode_putchar(&v, &outpos, x) < 0) goto onError; } else { @@ -6175,7 +6176,7 @@ Py_UCS4 ch = PyUnicode_READ(kind, data, pos); /* Map 32-bit characters to '\Uxxxxxxxx' */ if (ch >= 0x10000) { - assert(ch <= 0x10FFFF); + assert(ch <= MAX_UNICODE); *p++ = '\\'; *p++ = 'U'; *p++ = Py_hexdigits[(ch >> 28) & 0xf]; @@ -6536,7 +6537,7 @@ else if (ch < 1000000) repsize += 2+6+1; else { - assert(ch <= 0x10FFFF); + assert(ch <= MAX_UNICODE); repsize += 2+7+1; } } @@ -9275,7 +9276,7 @@ else if (maxchar_new <= 65535) maxchar_new = 65535; else - maxchar_new = 1114111; /* 0x10ffff */ + maxchar_new = MAX_UNICODE; if (!maxchar_new && PyUnicode_CheckExact(self)) { /* fixfct should return TRUE if it modified the buffer. If @@ -13059,7 +13060,7 @@ if (x == -1 && PyErr_Occurred()) goto onError; - if (x < 0 || x > 0x10ffff) { + if (x < 0 || x > MAX_UNICODE) { PyErr_SetString(PyExc_OverflowError, "%c arg not in range(0x110000)"); return (Py_UCS4) -1; -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 22:31:15 2011 From: python-checkins at python.org (stefan.krah) Date: Thu, 08 Dec 2011 22:31:15 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzExMTQ5?= =?utf8?q?=3A_Also_enable_-fwrapv_if_=24CC_is_a_full_path?= Message-ID: http://hg.python.org/cpython/rev/7efad6256e58 changeset: 73890:7efad6256e58 branch: 3.2 parent: 73886:c143e66e5efe user: Stefan Krah date: Thu Dec 08 22:20:31 2011 +0100 summary: Issue #11149: Also enable -fwrapv if $CC is a full path or has a trailing version number. files: configure | 7 ++++--- configure.in | 7 ++++--- 2 files changed, 8 insertions(+), 6 deletions(-) diff --git a/configure b/configure --- a/configure +++ b/configure @@ -5498,9 +5498,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) diff --git a/configure.in b/configure.in --- a/configure.in +++ b/configure.in @@ -928,9 +928,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 22:31:16 2011 From: python-checkins at python.org (stefan.krah) Date: Thu, 08 Dec 2011 22:31:16 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_second_fix_for_issue_=2311149=2E?= Message-ID: http://hg.python.org/cpython/rev/e48df59af394 changeset: 73891:e48df59af394 parent: 73887:c7638be1e430 parent: 73890:7efad6256e58 user: Stefan Krah date: Thu Dec 08 22:22:58 2011 +0100 summary: Merge second fix for issue #11149. files: configure | 7 ++++--- configure.in | 7 ++++--- 2 files changed, 8 insertions(+), 6 deletions(-) diff --git a/configure b/configure --- a/configure +++ b/configure @@ -5450,9 +5450,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) diff --git a/configure.in b/configure.in --- a/configure.in +++ b/configure.in @@ -907,9 +907,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 22:31:17 2011 From: python-checkins at python.org (stefan.krah) Date: Thu, 08 Dec 2011 22:31:17 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=282=2E7=29=3A_Backport_second?= =?utf8?q?_fix_for_issue_=2311149=2E?= Message-ID: http://hg.python.org/cpython/rev/9d329adbbb01 changeset: 73892:9d329adbbb01 branch: 2.7 parent: 73888:2bca2cee79a1 user: Stefan Krah date: Thu Dec 08 22:26:06 2011 +0100 summary: Backport second fix for issue #11149. files: configure | 7 ++++--- configure.in | 7 ++++--- 2 files changed, 8 insertions(+), 6 deletions(-) diff --git a/configure b/configure --- a/configure +++ b/configure @@ -5413,9 +5413,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) diff --git a/configure.in b/configure.in --- a/configure.in +++ b/configure.in @@ -934,9 +934,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 22:31:27 2011 From: python-checkins at python.org (stefan.krah) Date: Thu, 08 Dec 2011 22:31:27 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_default_-=3E_default?= =?utf8?b?KTogTWVyZ2Uu?= Message-ID: http://hg.python.org/cpython/rev/090574ed8db1 changeset: 73893:090574ed8db1 parent: 73889:489ea02ed351 parent: 73891:e48df59af394 user: Stefan Krah date: Thu Dec 08 22:30:18 2011 +0100 summary: Merge. files: configure | 7 ++++--- configure.in | 7 ++++--- 2 files changed, 8 insertions(+), 6 deletions(-) diff --git a/configure b/configure --- a/configure +++ b/configure @@ -5450,9 +5450,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) diff --git a/configure.in b/configure.in --- a/configure.in +++ b/configure.in @@ -907,9 +907,10 @@ fi # Clang also needs -fwrapv - if test "$CC" = "clang" ; then - WRAP="-fwrapv" - fi + case $CC in + *clang*) WRAP="-fwrapv" + ;; + esac case $ac_cv_prog_cc_g in yes) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 23:34:27 2011 From: python-checkins at python.org (stefan.krah) Date: Thu, 08 Dec 2011 23:34:27 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEzNTQ3?= =?utf8?q?=3A_clean_Lib/=5Fsysconfigdata=2Epy_and_Modules/=5Ftestembed?= Message-ID: http://hg.python.org/cpython/rev/8ed2c4d4df62 changeset: 73894:8ed2c4d4df62 branch: 3.2 parent: 73890:7efad6256e58 user: Stefan Krah date: Thu Dec 08 23:25:15 2011 +0100 summary: Issue #13547: clean Lib/_sysconfigdata.py and Modules/_testembed files: Makefile.pre.in | 1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/Makefile.pre.in b/Makefile.pre.in --- a/Makefile.pre.in +++ b/Makefile.pre.in @@ -1252,6 +1252,7 @@ find build -name 'fficonfig.h' -exec rm -f {} ';' || true find build -name 'fficonfig.py' -exec rm -f {} ';' || true -rm -f Lib/lib2to3/*Grammar*.pickle + -rm -f Modules/_testembed profile-removal: find . -name '*.gc??' -exec rm -f {} ';' -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 23:34:28 2011 From: python-checkins at python.org (stefan.krah) Date: Thu, 08 Dec 2011 23:34:28 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_fix_for_issue_=2313547=2E?= Message-ID: http://hg.python.org/cpython/rev/053c95ad09cf changeset: 73895:053c95ad09cf parent: 73893:090574ed8db1 parent: 73894:8ed2c4d4df62 user: Stefan Krah date: Thu Dec 08 23:31:40 2011 +0100 summary: Merge fix for issue #13547. files: Makefile.pre.in | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) diff --git a/Makefile.pre.in b/Makefile.pre.in --- a/Makefile.pre.in +++ b/Makefile.pre.in @@ -1315,6 +1315,8 @@ find build -name 'fficonfig.h' -exec rm -f {} ';' || true find build -name 'fficonfig.py' -exec rm -f {} ';' || true -rm -f Lib/lib2to3/*Grammar*.pickle + -rm -f Lib/_sysconfigdata.py + -rm -f Modules/_testembed profile-removal: find . -name '*.gc??' -exec rm -f {} ';' -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Thu Dec 8 23:41:00 2011 From: python-checkins at python.org (victor.stinner) Date: Thu, 08 Dec 2011 23:41:00 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2313441=3A_Log_the_l?= =?utf8?q?ocale_when_localeconv=28=29_fails?= Message-ID: http://hg.python.org/cpython/rev/93bab8400ca5 changeset: 73896:93bab8400ca5 user: Victor Stinner date: Thu Dec 08 23:42:52 2011 +0100 summary: Issue #13441: Log the locale when localeconv() fails files: Lib/test/test__locale.py | 6 +++++- 1 files changed, 5 insertions(+), 1 deletions(-) diff --git a/Lib/test/test__locale.py b/Lib/test/test__locale.py --- a/Lib/test/test__locale.py +++ b/Lib/test/test__locale.py @@ -86,9 +86,13 @@ setlocale(LC_CTYPE, loc) except Error: continue + try: + formatting = localeconv() + except Exception as err: + self.fail("localeconv() failed with %s locale: %s" % (loc, err)) for lc in ("decimal_point", "thousands_sep"): - self.numeric_tester('localeconv', localeconv()[lc], lc, loc) + self.numeric_tester('localeconv', formatting[lc], lc, loc) @unittest.skipUnless(nl_langinfo, "nl_langinfo is not available") def test_lc_numeric_basic(self): -- Repository URL: http://hg.python.org/cpython From tjreedy at udel.edu Thu Dec 8 22:50:35 2011 From: tjreedy at udel.edu (Terry Reedy) Date: Thu, 08 Dec 2011 16:50:35 -0500 Subject: [Python-checkins] cpython: PyUnicode_FromWideChar() and PyUnicode_FromUnicode() raise a ValueError if a In-Reply-To: References: Message-ID: <4EE1312B.4030904@udel.edu> On 12/8/2011 4:12 PM, victor.stinner wrote: > http://hg.python.org/cpython/rev/489ea02ed351 > changeset: 73889:489ea02ed351 > parent: 73887:c7638be1e430 > user: Victor Stinner > date: Thu Dec 08 22:14:11 2011 +0100 > summary: > PyUnicode_FromWideChar() and PyUnicode_FromUnicode() raise a ValueError if a > character in not in range [U+0000; U+10ffff]. > > files: > Objects/unicodeobject.c | 67 ++++++++++++++-------------- > 1 files changed, 34 insertions(+), 33 deletions(-) > > > diff --git a/Objects/unicodeobject.c b/Objects/unicodeobject.c > +/* Maximum code point of Unicode 6.0: 0x10ffff (1,114,111) */ > +#define MAX_UNICODE 0x10ffff Isn't this the value assigned, on all systems, to sys.maxunicode, in 3.3? If so, it must already be defined somewhere else. From python-checkins at python.org Fri Dec 9 00:08:38 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 00:08:38 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_What=27s_New_in_Python_3=2E?= =?utf8?q?3=3A_Add_a_Deprecated_section?= Message-ID: http://hg.python.org/cpython/rev/0846fc6bf6a8 changeset: 73897:0846fc6bf6a8 user: Victor Stinner date: Fri Dec 09 00:10:41 2011 +0100 summary: What's New in Python 3.3: Add a Deprecated section files: Doc/whatsnew/3.3.rst | 9 ++++++--- 1 files changed, 6 insertions(+), 3 deletions(-) diff --git a/Doc/whatsnew/3.3.rst b/Doc/whatsnew/3.3.rst --- a/Doc/whatsnew/3.3.rst +++ b/Doc/whatsnew/3.3.rst @@ -711,8 +711,11 @@ +Deprecated +========== + Unsupported Operating Systems -============================= +----------------------------- OS/2 and VMS are no longer supported due to the lack of a maintainer. @@ -721,7 +724,7 @@ Deprecated Python modules, functions and methods -================================================ +------------------------------------------------ * The :mod:`packaging` module replaces the :mod:`distutils` module * The ``unicode_internal`` codec has been deprecated because of the @@ -737,7 +740,7 @@ Deprecated functions and types of the C API -=========================================== +------------------------------------------- The :c:type:`Py_UNICODE` has been deprecated by the :pep:`393` and will be removed in Python 4. All functions using this type are deprecated: -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 00:16:02 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 00:16:02 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Document_PyUnicode=5FCopy?= =?utf8?q?=28=29_and_PyUnicode=5FEncodeCodePage=28=29?= Message-ID: http://hg.python.org/cpython/rev/e7d94d35506b changeset: 73898:e7d94d35506b user: Victor Stinner date: Fri Dec 09 00:18:11 2011 +0100 summary: Document PyUnicode_Copy() and PyUnicode_EncodeCodePage() files: Doc/c-api/unicode.rst | 18 +++++++++++++++++- 1 files changed, 17 insertions(+), 1 deletions(-) diff --git a/Doc/c-api/unicode.rst b/Doc/c-api/unicode.rst --- a/Doc/c-api/unicode.rst +++ b/Doc/c-api/unicode.rst @@ -386,6 +386,13 @@ .. versionadded:: 3.3 +.. c:function:: PyObject* PyUnicode_Copy(PyObject *unicode) + + Get a new copy of a Unicode object. + + .. versionadded:: 3.3 + + .. c:function:: PyObject* PyUnicode_FromKindAndData(int kind, const void *buffer, \ Py_ssize_t size) @@ -1379,6 +1386,15 @@ raised by the codec. +.. c:function:: PyObject* PyUnicode_EncodeCodePage(int code_page, PyObject *unicode, const char *errors) + + Encode the Unicode object using the specified code page and return a Python + bytes object. Return *NULL* if an exception was raised by the codec. Use + :c:data:`CP_ACP` code page to get the MBCS encoder. + + .. versionadded:: 3.3 + + .. c:function:: PyObject* PyUnicode_EncodeMBCS(const Py_UNICODE *s, Py_ssize_t size, const char *errors) Encode the :c:type:`Py_UNICODE` buffer of the given *size* using MBCS and return @@ -1387,7 +1403,7 @@ .. deprecated-removed:: 3.3 4.0 Part of the old-style :c:type:`Py_UNICODE` API; please migrate to using - :c:func:`PyUnicode_AsMBCSString`. + :c:func:`PyUnicode_AsMBCSString` or :c:func:`PyUnicode_EncodeCodePage`. Methods & Slots -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 00:34:38 2011 From: python-checkins at python.org (nadeem.vawda) Date: Fri, 09 Dec 2011 00:34:38 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_What=27s_New_in_Python_3=2E?= =?utf8?q?3=3A_Add_entry_for_lzma_module_=28issue_=236715=29=2E?= Message-ID: http://hg.python.org/cpython/rev/66df5ace0eee changeset: 73899:66df5ace0eee user: Nadeem Vawda date: Fri Dec 09 01:32:46 2011 +0200 summary: What's New in Python 3.3: Add entry for lzma module (issue #6715). files: Doc/whatsnew/3.3.rst | 10 ++++++++++ 1 files changed, 10 insertions(+), 0 deletions(-) diff --git a/Doc/whatsnew/3.3.rst b/Doc/whatsnew/3.3.rst --- a/Doc/whatsnew/3.3.rst +++ b/Doc/whatsnew/3.3.rst @@ -391,6 +391,16 @@ (Contributed by Sijin Joseph in :issue:`8808`) +lzma +---- + +The newly-added :mod:`lzma` module provides data compression and decompression +using the LZMA algorithm, including support for the ``.xz`` and ``.lzma`` +file formats. + +(Contributed by Nadeem Vawda and Per ?yvind Karlsen in :issue:`6715`) + + math ---- -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 01:18:19 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 01:18:19 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2313441=3A_Don=27t_t?= =?utf8?q?est_the_hu=5FHU_locale_on_Solaris_to_workaround_a_mbstowcs=28=29?= Message-ID: http://hg.python.org/cpython/rev/87c6be1e393a changeset: 73900:87c6be1e393a user: Victor Stinner date: Fri Dec 09 01:20:03 2011 +0100 summary: Issue #13441: Don't test the hu_HU locale on Solaris to workaround a mbstowcs() bug. On Solaris, if the locale is hu_HU (and if the locale encoding is not UTF-8), the thousauds separator is b'\xA0' which is decoded as U+30000020 instead of U+0020 by mbstowcs(). files: Lib/test/test__locale.py | 9 ++++++++- 1 files changed, 8 insertions(+), 1 deletions(-) diff --git a/Lib/test/test__locale.py b/Lib/test/test__locale.py --- a/Lib/test/test__locale.py +++ b/Lib/test/test__locale.py @@ -17,7 +17,7 @@ candidate_locales = ['es_UY', 'fr_FR', 'fi_FI', 'es_CO', 'pt_PT', 'it_IT', 'et_EE', 'es_PY', 'no_NO', 'nl_NL', 'lv_LV', 'el_GR', 'be_BY', 'fr_BE', 'ro_RO', 'ru_UA', 'ru_RU', 'es_VE', 'ca_ES', 'se_NO', 'es_EC', 'id_ID', - 'ka_GE', 'es_CL', 'hu_HU', 'wa_BE', 'lt_LT', 'sl_SI', 'hr_HR', 'es_AR', + 'ka_GE', 'es_CL', 'wa_BE', 'lt_LT', 'sl_SI', 'hr_HR', 'es_AR', 'es_ES', 'oc_FR', 'gl_ES', 'bg_BG', 'is_IS', 'mk_MK', 'de_AT', 'pt_BR', 'da_DK', 'nn_NO', 'cs_CZ', 'de_LU', 'es_BO', 'sq_AL', 'sk_SK', 'fr_CH', 'de_DE', 'sr_YU', 'br_FR', 'nl_BE', 'sv_FI', 'pl_PL', 'fr_CA', 'fo_FO', @@ -25,6 +25,13 @@ 'eu_ES', 'vi_VN', 'af_ZA', 'nb_NO', 'en_DK', 'tg_TJ', 'en_US', 'es_ES.ISO8859-1', 'fr_FR.ISO8859-15', 'ru_RU.KOI8-R', 'ko_KR.eucKR'] +# Issue #13441: Don't test the hu_HU locale on Solaris to workaround a +# mbstowcs() bug. On Solaris, if the locale is hu_HU (and if the locale +# encoding is not UTF-8), the thousauds separator is b'\xA0' which is decoded +# as U+30000020 instead of U+0020 by mbstowcs(). +if sys.platform != 'sunos5': + candidate_locales.append('hu_HU') + # Workaround for MSVC6(debug) crash bug if "MSC v.1200" in sys.version: def accept(loc): -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 04:25:19 2011 From: python-checkins at python.org (jason.coombs) Date: Fri, 09 Dec 2011 04:25:19 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=282=2E7=29=3A_Implemented_sug?= =?utf8?q?gested_improvements_for_pdb_test_by_=C3=89ric_Araujo?= Message-ID: http://hg.python.org/cpython/rev/70337a6d5dde changeset: 73901:70337a6d5dde branch: 2.7 parent: 73892:9d329adbbb01 user: Jason R. Coombs date: Thu Dec 08 22:14:56 2011 -0500 summary: Implemented suggested improvements for pdb test by ?ric Araujo files: Lib/test/test_pdb.py | 35 ++++++++++++++++--------------- 1 files changed, 18 insertions(+), 17 deletions(-) diff --git a/Lib/test/test_pdb.py b/Lib/test/test_pdb.py --- a/Lib/test/test_pdb.py +++ b/Lib/test/test_pdb.py @@ -280,35 +280,36 @@ 4 """ -class Tester7750(unittest.TestCase): - # if the filename has something that resolves to a python - # escape character (such as \t), it will fail - test_fn = '.\\test7750.py' +class ModuleInitTester(unittest.TestCase): - msg = "issue7750 only applies when os.sep is a backslash" - @unittest.skipUnless(os.path.sep == '\\', msg) - def test_issue7750(self): - with open(self.test_fn, 'w') as f: - f.write('print("hello world")') - cmd = [sys.executable, '-m', 'pdb', self.test_fn,] + def test_filename_correct(self): + """ + In issue 7750, it was found that if the filename has a sequence that + resolves to an escape character in a Python string (such as \t), it + will be treated as the escaped character. + """ + # the test_fn must contain something like \t + # on Windows, this will create 'test_mod.py' in the current directory. + # on Unix, this will create '.\test_mod.py' in the current directory. + test_fn = '.\\test_mod.py' + code = 'print("testing pdb")' + with open(test_fn, 'w') as f: + f.write(code) + self.addCleanup(os.remove, test_fn) + cmd = [sys.executable, '-m', 'pdb', test_fn,] proc = subprocess.Popen(cmd, stdout=subprocess.PIPE, stdin=subprocess.PIPE, stderr=subprocess.STDOUT, ) stdout, stderr = proc.communicate('quit\n') - self.assertNotIn('IOError', stdout, "pdb munged the filename") - - def tearDown(self): - if os.path.isfile(self.test_fn): - os.remove(self.test_fn) + self.assertIn(code, stdout, "pdb munged the filename") def test_main(): from test import test_pdb test_support.run_doctest(test_pdb, verbosity=True) - + test_support.run_unittest(ModuleInitTester) if __name__ == '__main__': test_main() - unittest.main() -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Fri Dec 9 05:36:03 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Fri, 09 Dec 2011 05:36:03 +0100 Subject: [Python-checkins] Daily reference leaks (87c6be1e393a): sum=0 Message-ID: results for 87c6be1e393a on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/reflogf1EUl2', '-x'] From python-checkins at python.org Fri Dec 9 10:28:40 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 10:28:40 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2313441=3A_Skip_some?= =?utf8?q?_locales_=28e=2Eg=2E_cs=5FCZ_and_hu=5FHU=29_on_Solaris_to_workar?= =?utf8?q?ound?= Message-ID: http://hg.python.org/cpython/rev/2a2d0872d993 changeset: 73902:2a2d0872d993 parent: 73900:87c6be1e393a user: Victor Stinner date: Fri Dec 09 10:28:45 2011 +0100 summary: Issue #13441: Skip some locales (e.g. cs_CZ and hu_HU) on Solaris to workaround a mbstowcs() bug. For example, on Solaris, the hu_HU locale uses the locale encoding ISO-8859-2, the thousauds separator is b'\xA0' and it is decoded as U+30000020 (an invalid character) by mbstowcs(). The workaround is not enabled yet (commented): I would like first to get more information about the failing locales. files: Lib/test/test__locale.py | 43 +++++++++++++++++++-------- 1 files changed, 30 insertions(+), 13 deletions(-) diff --git a/Lib/test/test__locale.py b/Lib/test/test__locale.py --- a/Lib/test/test__locale.py +++ b/Lib/test/test__locale.py @@ -1,13 +1,15 @@ -from test.support import run_unittest from _locale import (setlocale, LC_ALL, LC_CTYPE, LC_NUMERIC, localeconv, Error) try: from _locale import (RADIXCHAR, THOUSEP, nl_langinfo) except ImportError: nl_langinfo = None +import codecs +import locale +import sys import unittest -import sys from platform import uname +from test.support import run_unittest if uname()[0] == "Darwin": maj, min, mic = [int(part) for part in uname()[2].split(".")] @@ -17,7 +19,7 @@ candidate_locales = ['es_UY', 'fr_FR', 'fi_FI', 'es_CO', 'pt_PT', 'it_IT', 'et_EE', 'es_PY', 'no_NO', 'nl_NL', 'lv_LV', 'el_GR', 'be_BY', 'fr_BE', 'ro_RO', 'ru_UA', 'ru_RU', 'es_VE', 'ca_ES', 'se_NO', 'es_EC', 'id_ID', - 'ka_GE', 'es_CL', 'wa_BE', 'lt_LT', 'sl_SI', 'hr_HR', 'es_AR', + 'ka_GE', 'es_CL', 'wa_BE', 'hu_HU', 'lt_LT', 'sl_SI', 'hr_HR', 'es_AR', 'es_ES', 'oc_FR', 'gl_ES', 'bg_BG', 'is_IS', 'mk_MK', 'de_AT', 'pt_BR', 'da_DK', 'nn_NO', 'cs_CZ', 'de_LU', 'es_BO', 'sq_AL', 'sk_SK', 'fr_CH', 'de_DE', 'sr_YU', 'br_FR', 'nl_BE', 'sv_FI', 'pl_PL', 'fr_CA', 'fo_FO', @@ -25,12 +27,30 @@ 'eu_ES', 'vi_VN', 'af_ZA', 'nb_NO', 'en_DK', 'tg_TJ', 'en_US', 'es_ES.ISO8859-1', 'fr_FR.ISO8859-15', 'ru_RU.KOI8-R', 'ko_KR.eucKR'] -# Issue #13441: Don't test the hu_HU locale on Solaris to workaround a -# mbstowcs() bug. On Solaris, if the locale is hu_HU (and if the locale -# encoding is not UTF-8), the thousauds separator is b'\xA0' which is decoded -# as U+30000020 instead of U+0020 by mbstowcs(). -if sys.platform != 'sunos5': - candidate_locales.append('hu_HU') +# Issue #13441: Skip some locales (e.g. cs_CZ and hu_HU) on Solaris to +# workaround a mbstowcs() bug. For example, on Solaris, the hu_HU locale uses +# the locale encoding ISO-8859-2, the thousauds separator is b'\xA0' and it is +# decoded as U+30000020 (an invalid character) by mbstowcs(). +if sys.platform == 'sunos5': + old_locale = locale.setlocale(locale.LC_ALL) + try: + locales = [] + for loc in candidate_locales: + try: + locale.setlocale(locale.LC_ALL, loc) + except Error: + continue + encoding = locale.getpreferredencoding(False) + try: + localeconv() + except Exception as err: + print("WARNING: Skip locale %s (encoding %s): [%s] %s" + % (loc, encoding, type(err), err)) + else: + locales.append(loc) + #candidate_locales = locales + finally: + locale.setlocale(locale.LC_ALL, old_locale) # Workaround for MSVC6(debug) crash bug if "MSC v.1200" in sys.version: @@ -93,10 +113,7 @@ setlocale(LC_CTYPE, loc) except Error: continue - try: - formatting = localeconv() - except Exception as err: - self.fail("localeconv() failed with %s locale: %s" % (loc, err)) + formatting = localeconv() for lc in ("decimal_point", "thousands_sep"): self.numeric_tester('localeconv', formatting[lc], lc, loc) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 11:29:13 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 11:29:13 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2313441=3A_Enable_th?= =?utf8?q?e_workaround_for_Solaris_locale_bug?= Message-ID: http://hg.python.org/cpython/rev/7ffe3d304487 changeset: 73903:7ffe3d304487 user: Victor Stinner date: Fri Dec 09 11:29:44 2011 +0100 summary: Issue #13441: Enable the workaround for Solaris locale bug Skip locales triggering the mbstowcs() bug. I collected the locale list thanks my previous commit: * hu_HU (ISO8859-2): character U+30000020 * de_AT (ISO8859-1): character U+30000076 * cs_CZ (ISO8859-2): character U+30000020 * sk_SK (ISO8859-2): character U+30000020 * pl_PL (ISO8859-2): character U+30000020 * fr_CA (ISO8859-1): character U+30000020 files: Lib/test/test__locale.py | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Lib/test/test__locale.py b/Lib/test/test__locale.py --- a/Lib/test/test__locale.py +++ b/Lib/test/test__locale.py @@ -48,7 +48,7 @@ % (loc, encoding, type(err), err)) else: locales.append(loc) - #candidate_locales = locales + candidate_locales = locales finally: locale.setlocale(locale.LC_ALL, old_locale) -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 20:19:11 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 20:19:11 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzU5MDU6?= =?utf8?q?_time=2Estrftime=28=29_is_now_using_the_locale_encoding=2C_inste?= =?utf8?q?ad_of?= Message-ID: http://hg.python.org/cpython/rev/8620e6901e58 changeset: 73904:8620e6901e58 branch: 3.2 parent: 73894:8ed2c4d4df62 user: Victor Stinner date: Fri Dec 09 20:19:24 2011 +0100 summary: Issue #5905: time.strftime() is now using the locale encoding, instead of UTF-8, if the wcsftime() function is not available. files: Misc/NEWS | 3 +++ Modules/timemodule.c | 13 ++++--------- 2 files changed, 7 insertions(+), 9 deletions(-) diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -90,6 +90,9 @@ Library ------- +- Issue #5905: time.strftime() is now using the locale encoding, instead of + UTF-8, if the wcsftime() function is not available. + - Issue #8641: Update IDLE 3 syntax coloring to recognize b".." and not u"..". Patch by Tal Einat. diff --git a/Modules/timemodule.c b/Modules/timemodule.c --- a/Modules/timemodule.c +++ b/Modules/timemodule.c @@ -3,8 +3,6 @@ #include "Python.h" #include "_time.h" -#define TZNAME_ENCODING "utf-8" - #include #ifdef HAVE_SYS_TYPES_H @@ -48,8 +46,6 @@ #if defined(MS_WINDOWS) && !defined(__BORLANDC__) /* Win32 has better clock replacement; we have our own version below. */ #undef HAVE_CLOCK -#undef TZNAME_ENCODING -#define TZNAME_ENCODING "mbcs" #endif /* MS_WINDOWS && !defined(__BORLANDC__) */ #if defined(PYOS_OS2) @@ -502,7 +498,7 @@ fmt = format; #else /* Convert the unicode string to an ascii one */ - format = PyUnicode_AsEncodedString(format_arg, TZNAME_ENCODING, NULL); + format = PyUnicode_EncodeFSDefault(format_arg); if (format == NULL) return NULL; fmt = PyBytes_AS_STRING(format); @@ -546,8 +542,7 @@ #ifdef HAVE_WCSFTIME ret = PyUnicode_FromWideChar(outbuf, buflen); #else - ret = PyUnicode_Decode(outbuf, buflen, - TZNAME_ENCODING, NULL); + ret = PyUnicode_DecodeFSDefaultAndSize(outbuf, buflen); #endif PyMem_Free(outbuf); break; @@ -789,8 +784,8 @@ #endif /* PYOS_OS2 */ #endif PyModule_AddIntConstant(m, "daylight", daylight); - otz0 = PyUnicode_Decode(tzname[0], strlen(tzname[0]), TZNAME_ENCODING, NULL); - otz1 = PyUnicode_Decode(tzname[1], strlen(tzname[1]), TZNAME_ENCODING, NULL); + otz0 = PyUnicode_DecodeFSDefaultAndSize(tzname[0], strlen(tzname[0])); + otz1 = PyUnicode_DecodeFSDefaultAndSize(tzname[1], strlen(tzname[1])); PyModule_AddObject(m, "tzname", Py_BuildValue("(NN)", otz0, otz1)); #else /* !HAVE_TZNAME || __GLIBC__ || __CYGWIN__*/ #ifdef HAVE_STRUCT_TM_TM_ZONE -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 20:19:12 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 20:19:12 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?b?OiAoTWVyZ2UgMy4yKSBJc3N1ZSAjNTkwNTogdGltZS5zdHJmdGltZSgpIGlzIG5v?= =?utf8?q?w_using_the_locale_encoding=2C?= Message-ID: http://hg.python.org/cpython/rev/bee7694988a4 changeset: 73905:bee7694988a4 parent: 73903:7ffe3d304487 parent: 73904:8620e6901e58 user: Victor Stinner date: Fri Dec 09 20:21:17 2011 +0100 summary: (Merge 3.2) Issue #5905: time.strftime() is now using the locale encoding, instead of UTF-8, if the wcsftime() function is not available. files: Misc/NEWS | 3 +++ Modules/timemodule.c | 15 ++++----------- 2 files changed, 7 insertions(+), 11 deletions(-) diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -406,6 +406,9 @@ Library ------- +- Issue #5905: time.strftime() is now using the locale encoding, instead of + UTF-8, if the wcsftime() function is not available. + - Issue #8641: Update IDLE 3 syntax coloring to recognize b".." and not u"..". Patch by Tal Einat. diff --git a/Modules/timemodule.c b/Modules/timemodule.c --- a/Modules/timemodule.c +++ b/Modules/timemodule.c @@ -30,12 +30,6 @@ #endif /* MS_WINDOWS */ #endif /* !__WATCOMC__ || __QNX__ */ -#if defined(HAVE_MBCS) -# define TZNAME_ENCODING "mbcs" -#else -# define TZNAME_ENCODING "utf-8" -#endif - #if defined(PYOS_OS2) #define INCL_DOS #define INCL_ERRORS @@ -492,7 +486,7 @@ fmt = format; #else /* Convert the unicode string to an ascii one */ - format = PyUnicode_AsEncodedString(format_arg, TZNAME_ENCODING, NULL); + format = PyUnicode_EncodeFSDefault(format_arg); if (format == NULL) return NULL; fmt = PyBytes_AS_STRING(format); @@ -536,8 +530,7 @@ #ifdef HAVE_WCSFTIME ret = PyUnicode_FromWideChar(outbuf, buflen); #else - ret = PyUnicode_Decode(outbuf, buflen, - TZNAME_ENCODING, NULL); + ret = PyUnicode_DecodeFSDefaultAndSize(outbuf, buflen); #endif PyMem_Free(outbuf); break; @@ -769,8 +762,8 @@ #endif /* PYOS_OS2 */ #endif PyModule_AddIntConstant(m, "daylight", daylight); - otz0 = PyUnicode_Decode(tzname[0], strlen(tzname[0]), TZNAME_ENCODING, NULL); - otz1 = PyUnicode_Decode(tzname[1], strlen(tzname[1]), TZNAME_ENCODING, NULL); + otz0 = PyUnicode_DecodeFSDefaultAndSize(tzname[0], strlen(tzname[0])); + otz1 = PyUnicode_DecodeFSDefaultAndSize(tzname[1], strlen(tzname[1])); PyModule_AddObject(m, "tzname", Py_BuildValue("(NN)", otz0, otz1)); #else /* !HAVE_TZNAME || __GLIBC__ || __CYGWIN__*/ #ifdef HAVE_STRUCT_TM_TM_ZONE -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 20:47:39 2011 From: python-checkins at python.org (victor.stinner) Date: Fri, 09 Dec 2011 20:47:39 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Write_tests_for_invalid_cha?= =?utf8?q?racters_=28U+00110000=29?= Message-ID: http://hg.python.org/cpython/rev/bfa9d1ba36ae changeset: 73906:bfa9d1ba36ae user: Victor Stinner date: Fri Dec 09 20:49:49 2011 +0100 summary: Write tests for invalid characters (U+00110000) Test the following functions: * codecs.raw_unicode_escape_decode() * PyUnicode_FromWideChar() * PyUnicode_FromUnicode() * "unicode_internal" and "unicode_escape" decoders files: Lib/test/test_codecs.py | 16 ++++++++++++++++ Modules/_testcapimodule.c | 18 ++++++++++++++++++ 2 files changed, 34 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_codecs.py b/Lib/test/test_codecs.py --- a/Lib/test/test_codecs.py +++ b/Lib/test/test_codecs.py @@ -1034,6 +1034,16 @@ 'deprecated', DeprecationWarning)): self.assertRaises(UnicodeDecodeError, internal.decode, "unicode_internal") + if sys.byteorder == "little": + invalid = b"\x00\x00\x11\x00" + else: + invalid = b"\x00\x11\x00\x00" + with support.check_warnings(): + self.assertRaises(UnicodeDecodeError, + invalid.decode, "unicode_internal") + with support.check_warnings(): + self.assertEqual(invalid.decode("unicode_internal", "replace"), + '\ufffd') @unittest.skipUnless(SIZEOF_WCHAR_T == 4, 'specific to 32-bit wchar_t') def test_decode_error_attributes(self): @@ -1729,6 +1739,12 @@ self.assertEqual(codecs.raw_unicode_escape_decode(r"\u1234"), ("\u1234", 6)) self.assertEqual(codecs.raw_unicode_escape_decode(br"\u1234"), ("\u1234", 6)) + self.assertRaises(UnicodeDecodeError, codecs.unicode_escape_decode, br"\U00110000") + self.assertEqual(codecs.unicode_escape_decode(r"\U00110000", "replace"), ("\ufffd", 10)) + + self.assertRaises(UnicodeDecodeError, codecs.raw_unicode_escape_decode, br"\U00110000") + self.assertEqual(codecs.raw_unicode_escape_decode(r"\U00110000", "replace"), ("\ufffd", 10)) + class SurrogateEscapeTest(unittest.TestCase): def test_utf8(self): diff --git a/Modules/_testcapimodule.c b/Modules/_testcapimodule.c --- a/Modules/_testcapimodule.c +++ b/Modules/_testcapimodule.c @@ -1409,6 +1409,7 @@ #if defined(SIZEOF_WCHAR_T) && (SIZEOF_WCHAR_T == 4) const wchar_t wtext[2] = {(wchar_t)0x10ABCDu}; size_t wtextlen = 1; + const wchar_t invalid[1] = {(wchar_t)0x110000u}; #else const wchar_t wtext[3] = {(wchar_t)0xDBEAu, (wchar_t)0xDFCDu}; size_t wtextlen = 2; @@ -1444,6 +1445,23 @@ Py_DECREF(wide); Py_DECREF(utf8); + +#if defined(SIZEOF_WCHAR_T) && (SIZEOF_WCHAR_T == 4) + wide = PyUnicode_FromWideChar(invalid, 1); + if (wide == NULL) + PyErr_Clear(); + else + return raiseTestError("test_widechar", + "PyUnicode_FromWideChar(L\"\\U00110000\", 1) didn't fail"); + + wide = PyUnicode_FromUnicode(invalid, 1); + if (wide == NULL) + PyErr_Clear(); + else + return raiseTestError("test_widechar", + "PyUnicode_FromUnicode(L\"\\U00110000\", 1) didn't fail"); +#endif + Py_RETURN_NONE; } -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 22:37:18 2011 From: python-checkins at python.org (florent.xicluna) Date: Fri, 09 Dec 2011 22:37:18 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Closes_=232979=3A_add_param?= =?utf8?q?eter_=27use=5Fbuiltin=5Ftypes=27_to_the_SimpleXMLRPCServer=2E?= Message-ID: http://hg.python.org/cpython/rev/b3c1a504ebc1 changeset: 73907:b3c1a504ebc1 user: Florent Xicluna date: Fri Dec 09 22:35:06 2011 +0100 summary: Closes #2979: add parameter 'use_builtin_types' to the SimpleXMLRPCServer. files: Doc/library/xmlrpc.server.rst | 28 ++++++++++++++++-- Lib/test/test_xmlrpc.py | 34 +++++++++++++++++++++++ Lib/xmlrpc/server.py | 25 ++++++++++------ 3 files changed, 73 insertions(+), 14 deletions(-) diff --git a/Doc/library/xmlrpc.server.rst b/Doc/library/xmlrpc.server.rst --- a/Doc/library/xmlrpc.server.rst +++ b/Doc/library/xmlrpc.server.rst @@ -16,7 +16,9 @@ :class:`CGIXMLRPCRequestHandler`. -.. class:: SimpleXMLRPCServer(addr, requestHandler=SimpleXMLRPCRequestHandler, logRequests=True, allow_none=False, encoding=None, bind_and_activate=True) +.. class:: SimpleXMLRPCServer(addr, requestHandler=SimpleXMLRPCRequestHandler,\ + logRequests=True, allow_none=False, encoding=None,\ + bind_and_activate=True, use_builtin_types=False) Create a new server instance. This class provides methods for registration of functions that can be called by the XML-RPC protocol. The *requestHandler* @@ -25,18 +27,31 @@ are passed to the :class:`socketserver.TCPServer` constructor. If *logRequests* is true (the default), requests will be logged; setting this parameter to false will turn off logging. The *allow_none* and *encoding* parameters are passed - on to :mod:`xmlrpc.client` and control the XML-RPC responses that will be returned + on to :mod:`xmlrpc.client` and control the XML-RPC responses that will be returned from the server. The *bind_and_activate* parameter controls whether :meth:`server_bind` and :meth:`server_activate` are called immediately by the constructor; it defaults to true. Setting it to false allows code to manipulate the *allow_reuse_address* class variable before the address is bound. + The *use_builtin_types* parameter is passed to the + :func:`~xmlrpc.client.loads` function and controls which types are processed + when date/times values or binary data are received; it defaults to false. + .. versionchanged:: 3.3 + The *use_builtin_types* flag was added. -.. class:: CGIXMLRPCRequestHandler(allow_none=False, encoding=None) + +.. class:: CGIXMLRPCRequestHandler(allow_none=False, encoding=None,\ + use_builtin_types=False) Create a new instance to handle XML-RPC requests in a CGI environment. The *allow_none* and *encoding* parameters are passed on to :mod:`xmlrpc.client` and control the XML-RPC responses that will be returned from the server. + The *use_builtin_types* parameter is passed to the + :func:`~xmlrpc.client.loads` function and controls which types are processed + when date/times values or binary data are received; it defaults to false. + + .. versionchanged:: 3.3 + The *use_builtin_types* flag was added. .. class:: SimpleXMLRPCRequestHandler() @@ -233,12 +248,17 @@ :class:`DocCGIXMLRPCRequestHandler`. -.. class:: DocXMLRPCServer(addr, requestHandler=DocXMLRPCRequestHandler, logRequests=True, allow_none=False, encoding=None, bind_and_activate=True) +.. class:: DocXMLRPCServer(addr, requestHandler=DocXMLRPCRequestHandler,\ + logRequests=True, allow_none=False, encoding=None,\ + bind_and_activate=True, use_builtin_types=True) Create a new server instance. All parameters have the same meaning as for :class:`SimpleXMLRPCServer`; *requestHandler* defaults to :class:`DocXMLRPCRequestHandler`. + .. versionchanged:: 3.3 + The *use_builtin_types* flag was added. + .. class:: DocCGIXMLRPCRequestHandler() diff --git a/Lib/test/test_xmlrpc.py b/Lib/test/test_xmlrpc.py --- a/Lib/test/test_xmlrpc.py +++ b/Lib/test/test_xmlrpc.py @@ -1023,10 +1023,44 @@ len(content)) +class UseBuiltinTypesTestCase(unittest.TestCase): + + def test_use_builtin_types(self): + # SimpleXMLRPCDispatcher.__init__ accepts use_builtin_types, which + # makes all dispatch of binary data as bytes instances, and all + # dispatch of datetime argument as datetime.datetime instances. + self.log = [] + expected_bytes = b"my dog has fleas" + expected_date = datetime.datetime(2008, 5, 26, 18, 25, 12) + marshaled = xmlrpclib.dumps((expected_bytes, expected_date), 'foobar') + def foobar(*args): + self.log.extend(args) + handler = xmlrpc.server.SimpleXMLRPCDispatcher( + allow_none=True, encoding=None, use_builtin_types=True) + handler.register_function(foobar) + handler._marshaled_dispatch(marshaled) + self.assertEqual(len(self.log), 2) + mybytes, mydate = self.log + self.assertEqual(self.log, [expected_bytes, expected_date]) + self.assertIs(type(mydate), datetime.datetime) + self.assertIs(type(mybytes), bytes) + + def test_cgihandler_has_use_builtin_types_flag(self): + handler = xmlrpc.server.CGIXMLRPCRequestHandler(use_builtin_types=True) + self.assertTrue(handler.use_builtin_types) + + def test_xmlrpcserver_has_use_builtin_types_flag(self): + server = xmlrpc.server.SimpleXMLRPCServer(("localhost", 0), + use_builtin_types=True) + server.server_close() + self.assertTrue(server.use_builtin_types) + + @support.reap_threads def test_main(): xmlrpc_tests = [XMLRPCTestCase, HelperTestCase, DateTimeTestCase, BinaryTestCase, FaultTestCase] + xmlrpc_tests.append(UseBuiltinTypesTestCase) xmlrpc_tests.append(SimpleServerTestCase) xmlrpc_tests.append(KeepaliveServerTestCase1) xmlrpc_tests.append(KeepaliveServerTestCase2) diff --git a/Lib/xmlrpc/server.py b/Lib/xmlrpc/server.py --- a/Lib/xmlrpc/server.py +++ b/Lib/xmlrpc/server.py @@ -160,11 +160,13 @@ can be instanced when used by the MultiPathXMLRPCServer """ - def __init__(self, allow_none=False, encoding=None): + def __init__(self, allow_none=False, encoding=None, + use_builtin_types=False): self.funcs = {} self.instance = None self.allow_none = allow_none self.encoding = encoding or 'utf-8' + self.use_builtin_types = use_builtin_types def register_instance(self, instance, allow_dotted_names=False): """Registers an instance to respond to XML-RPC requests. @@ -245,7 +247,7 @@ """ try: - params, method = loads(data) + params, method = loads(data, use_builtin_types=self.use_builtin_types) # generate response if dispatch_method is not None: @@ -572,10 +574,11 @@ _send_traceback_header = False def __init__(self, addr, requestHandler=SimpleXMLRPCRequestHandler, - logRequests=True, allow_none=False, encoding=None, bind_and_activate=True): + logRequests=True, allow_none=False, encoding=None, + bind_and_activate=True, use_builtin_types=False): self.logRequests = logRequests - SimpleXMLRPCDispatcher.__init__(self, allow_none, encoding) + SimpleXMLRPCDispatcher.__init__(self, allow_none, encoding, use_builtin_types) socketserver.TCPServer.__init__(self, addr, requestHandler, bind_and_activate) # [Bug #1222790] If possible, set close-on-exec flag; if a @@ -595,10 +598,11 @@ Make sure that the requestHandler accepts the paths in question. """ def __init__(self, addr, requestHandler=SimpleXMLRPCRequestHandler, - logRequests=True, allow_none=False, encoding=None, bind_and_activate=True): + logRequests=True, allow_none=False, encoding=None, + bind_and_activate=True, use_builtin_types=False): SimpleXMLRPCServer.__init__(self, addr, requestHandler, logRequests, allow_none, - encoding, bind_and_activate) + encoding, bind_and_activate, use_builtin_types) self.dispatchers = {} self.allow_none = allow_none self.encoding = encoding or 'utf-8' @@ -628,8 +632,8 @@ class CGIXMLRPCRequestHandler(SimpleXMLRPCDispatcher): """Simple handler for XML-RPC data passed through CGI.""" - def __init__(self, allow_none=False, encoding=None): - SimpleXMLRPCDispatcher.__init__(self, allow_none, encoding) + def __init__(self, allow_none=False, encoding=None, use_builtin_types=False): + SimpleXMLRPCDispatcher.__init__(self, allow_none, encoding, use_builtin_types) def handle_xmlrpc(self, request_text): """Handle a single XML-RPC request""" @@ -924,9 +928,10 @@ def __init__(self, addr, requestHandler=DocXMLRPCRequestHandler, logRequests=True, allow_none=False, encoding=None, - bind_and_activate=True): + bind_and_activate=True, use_builtin_types=False): SimpleXMLRPCServer.__init__(self, addr, requestHandler, logRequests, - allow_none, encoding, bind_and_activate) + allow_none, encoding, bind_and_activate, + use_builtin_types) XMLRPCDocGenerator.__init__(self) class DocCGIXMLRPCRequestHandler( CGIXMLRPCRequestHandler, -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 23:16:42 2011 From: python-checkins at python.org (antoine.pitrou) Date: Fri, 09 Dec 2011 23:16:42 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEzNTI4?= =?utf8?q?=3A_rework_the_performance_question_in_the_programming_FAQ?= Message-ID: http://hg.python.org/cpython/rev/eb30f2becb79 changeset: 73908:eb30f2becb79 branch: 3.2 parent: 73904:8620e6901e58 user: Antoine Pitrou date: Fri Dec 09 23:10:31 2011 +0100 summary: Issue #13528: rework the performance question in the programming FAQ files: Doc/faq/programming.rst | 214 ++++++++------------------- 1 files changed, 62 insertions(+), 152 deletions(-) diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -115,159 +115,6 @@ :pep:`8`. -My program is too slow. How do I speed it up? ---------------------------------------------- - -That's a tough one, in general. There are many tricks to speed up Python code; -consider rewriting parts in C as a last resort. - -`Cython `_ and `Pyrex `_ -can compile a slightly modified version of Python code into a C extension, and -can be used on many different platforms. Depending on your code, Cython -may be able to make it significantly faster than when run by the Python -interpreter. - -The rest of this answer will discuss various tricks for squeezing a bit more -speed out of Python code. *Never* apply any optimization tricks unless you know -you need them, after profiling has indicated that a particular function is the -heavily executed hot spot in the code. Optimizations almost always make the -code less clear, and you shouldn't pay the costs of reduced clarity (increased -development time, greater likelihood of bugs) unless the resulting performance -benefit is worth it. - -There is a page on the wiki devoted to `performance tips -`_. - -Guido van Rossum has written up an anecdote related to optimization at -http://www.python.org/doc/essays/list2str.html. - -One thing to notice is that function and (especially) method calls are rather -expensive; if you have designed a purely OO interface with lots of tiny -functions that don't do much more than get or set an instance variable or call -another method, you might consider using a more direct way such as directly -accessing instance variables. Also see the standard module :mod:`profile` which -makes it possible to find out where your program is spending most of its time -(if you have some patience -- the profiling itself can slow your program down by -an order of magnitude). - -Remember that many standard optimization heuristics you may know from other -programming experience may well apply to Python. For example it may be faster -to send output to output devices using larger writes rather than smaller ones in -order to reduce the overhead of kernel system calls. Thus CGI scripts that -write all output in "one shot" may be faster than those that write lots of small -pieces of output. - -Also, be sure to use Python's core features where appropriate. For example, -slicing allows programs to chop up lists and other sequence objects in a single -tick of the interpreter's mainloop using highly optimized C implementations. -Thus to get the same effect as:: - - L2 = [] - for i in range(3): - L2.append(L1[i]) - -it is much shorter and far faster to use :: - - L2 = list(L1[:3]) # "list" is redundant if L1 is a list. - -Note that the functionally-oriented built-in functions such as :func:`map`, -:func:`zip`, and friends can be a convenient accelerator for loops that -perform a single task. For example to pair the elements of two lists -together:: - - >>> list(zip([1, 2, 3], [4, 5, 6])) - [(1, 4), (2, 5), (3, 6)] - -or to compute a number of sines:: - - >>> list(map(math.sin, (1, 2, 3, 4))) - [0.841470984808, 0.909297426826, 0.14112000806, -0.756802495308] - -The operation completes very quickly in such cases. - -Other examples include the ``join()`` and ``split()`` :ref:`methods -of string objects `. - -For example if s1..s7 are large (10K+) strings then -``"".join([s1,s2,s3,s4,s5,s6,s7])`` may be far faster than the more obvious -``s1+s2+s3+s4+s5+s6+s7``, since the "summation" will compute many -subexpressions, whereas ``join()`` does all the copying in one pass. For -manipulating strings, use the ``replace()`` and the ``format()`` :ref:`methods -on string objects `. Use regular expressions only when you're -not dealing with constant string patterns. - -Be sure to use the :meth:`list.sort` built-in method to do sorting, and see the -`sorting mini-HOWTO `_ for examples -of moderately advanced usage. :meth:`list.sort` beats other techniques for -sorting in all but the most extreme circumstances. - -Another common trick is to "push loops into functions or methods." For example -suppose you have a program that runs slowly and you use the profiler to -determine that a Python function ``ff()`` is being called lots of times. If you -notice that ``ff()``:: - - def ff(x): - ... # do something with x computing result... - return result - -tends to be called in loops like:: - - list = map(ff, oldlist) - -or:: - - for x in sequence: - value = ff(x) - ... # do something with value... - -then you can often eliminate function call overhead by rewriting ``ff()`` to:: - - def ffseq(seq): - resultseq = [] - for x in seq: - ... # do something with x computing result... - resultseq.append(result) - return resultseq - -and rewrite the two examples to ``list = ffseq(oldlist)`` and to:: - - for value in ffseq(sequence): - ... # do something with value... - -Single calls to ``ff(x)`` translate to ``ffseq([x])[0]`` with little penalty. -Of course this technique is not always appropriate and there are other variants -which you can figure out. - -You can gain some performance by explicitly storing the results of a function or -method lookup into a local variable. A loop like:: - - for key in token: - dict[key] = dict.get(key, 0) + 1 - -resolves ``dict.get`` every iteration. If the method isn't going to change, a -slightly faster implementation is:: - - dict_get = dict.get # look up the method once - for key in token: - dict[key] = dict_get(key, 0) + 1 - -Default arguments can be used to determine values once, at compile time instead -of at run time. This can only be done for functions or objects which will not -be changed during program execution, such as replacing :: - - def degree_sin(deg): - return math.sin(deg * math.pi / 180.0) - -with :: - - def degree_sin(deg, factor=math.pi/180.0, sin=math.sin): - return sin(deg * factor) - -Because this trick uses default arguments for terms which should not be changed, -it should only be used when you are not concerned with presenting a possibly -confusing API to your users. - - Core Language ============= @@ -938,6 +785,68 @@ See the :ref:`unicode-howto`. +Performance +=========== + +My program is too slow. How do I speed it up? +--------------------------------------------- + +That's a tough one, in general. First, here are a list of things to +remember before diving further: + +* Performance characteristics vary accross Python implementations. This FAQ + focusses on :term:`CPython`. +* Behaviour can vary accross operating systems, especially when talking about + I/O or multi-threading. +* You should always find the hot spots in your program *before* attempting to + optimize any code (see the :mod:`profile` module). +* Writing benchmark scripts will allow you to iterate quickly when searching + for improvements (see the :mod:`timeit` module). +* It is highly recommended to have good code coverage (through unit testing + or any other technique) before potentially introducing regressions hidden + in sophisticated optimizations. + +That being said, there are many tricks to speed up Python code. Here are +some general principles which go a long way towards reaching acceptable +performance levels: + +* Making your algorithms faster (or changing to faster ones) can yield + much larger benefits than trying to sprinkle micro-optimization tricks + all over your code. + +* Use the right data structures. Study documentation for the :ref:`bltin-types` + and the :mod:`collections` module. + +* When the standard library provides a primitive for doing something, it is + likely (although not guaranteed) to be faster than any alternative you + may come up with. This is doubly true for primitives written in C, such + as builtins and some extension types. For example, be sure to use + either the :meth:`list.sort` built-in method or the related :func:`sorted` + function to do sorting (and see the + `sorting mini-HOWTO `_ for examples + of moderately advanced usage). + +* Abstractions tend to create indirections and force the interpreter to work + more. If the levels of indirection outweigh the amount of useful work + done, your program will be slower. You should avoid excessive abstraction, + especially under the form of tiny functions or methods (which are also often + detrimental to readability). + +If you have reached the limit of what pure Python can allow, there are tools +to take you further away. For example, `Cython `_ can +compile a slightly modified version of Python code into a C extension, and +can be used on many different platforms. Cython can take advantage of +compilation (and optional type annotations) to make your code significantly +faster than when interpreted. If you are confident in your C programming +skills, you can also :ref:`write a C extension module ` +yourself. + +.. seealso:: + The wiki page devoted to `performance tips + `_. + +.. _efficient_string_concatenation: + What is the most efficient way to concatenate many strings together? -------------------------------------------------------------------- -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 23:16:43 2011 From: python-checkins at python.org (antoine.pitrou) Date: Fri, 09 Dec 2011 23:16:43 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Issue_=2313528=3A_rework_the_performance_question_in_the_pro?= =?utf8?q?gramming_FAQ?= Message-ID: http://hg.python.org/cpython/rev/9fe28f52eaaa changeset: 73909:9fe28f52eaaa parent: 73907:b3c1a504ebc1 parent: 73908:eb30f2becb79 user: Antoine Pitrou date: Fri Dec 09 23:11:16 2011 +0100 summary: Issue #13528: rework the performance question in the programming FAQ files: Doc/faq/programming.rst | 214 ++++++++------------------- 1 files changed, 62 insertions(+), 152 deletions(-) diff --git a/Doc/faq/programming.rst b/Doc/faq/programming.rst --- a/Doc/faq/programming.rst +++ b/Doc/faq/programming.rst @@ -115,159 +115,6 @@ :pep:`8`. -My program is too slow. How do I speed it up? ---------------------------------------------- - -That's a tough one, in general. There are many tricks to speed up Python code; -consider rewriting parts in C as a last resort. - -`Cython `_ and `Pyrex `_ -can compile a slightly modified version of Python code into a C extension, and -can be used on many different platforms. Depending on your code, Cython -may be able to make it significantly faster than when run by the Python -interpreter. - -The rest of this answer will discuss various tricks for squeezing a bit more -speed out of Python code. *Never* apply any optimization tricks unless you know -you need them, after profiling has indicated that a particular function is the -heavily executed hot spot in the code. Optimizations almost always make the -code less clear, and you shouldn't pay the costs of reduced clarity (increased -development time, greater likelihood of bugs) unless the resulting performance -benefit is worth it. - -There is a page on the wiki devoted to `performance tips -`_. - -Guido van Rossum has written up an anecdote related to optimization at -http://www.python.org/doc/essays/list2str.html. - -One thing to notice is that function and (especially) method calls are rather -expensive; if you have designed a purely OO interface with lots of tiny -functions that don't do much more than get or set an instance variable or call -another method, you might consider using a more direct way such as directly -accessing instance variables. Also see the standard module :mod:`profile` which -makes it possible to find out where your program is spending most of its time -(if you have some patience -- the profiling itself can slow your program down by -an order of magnitude). - -Remember that many standard optimization heuristics you may know from other -programming experience may well apply to Python. For example it may be faster -to send output to output devices using larger writes rather than smaller ones in -order to reduce the overhead of kernel system calls. Thus CGI scripts that -write all output in "one shot" may be faster than those that write lots of small -pieces of output. - -Also, be sure to use Python's core features where appropriate. For example, -slicing allows programs to chop up lists and other sequence objects in a single -tick of the interpreter's mainloop using highly optimized C implementations. -Thus to get the same effect as:: - - L2 = [] - for i in range(3): - L2.append(L1[i]) - -it is much shorter and far faster to use :: - - L2 = list(L1[:3]) # "list" is redundant if L1 is a list. - -Note that the functionally-oriented built-in functions such as :func:`map`, -:func:`zip`, and friends can be a convenient accelerator for loops that -perform a single task. For example to pair the elements of two lists -together:: - - >>> list(zip([1, 2, 3], [4, 5, 6])) - [(1, 4), (2, 5), (3, 6)] - -or to compute a number of sines:: - - >>> list(map(math.sin, (1, 2, 3, 4))) - [0.841470984808, 0.909297426826, 0.14112000806, -0.756802495308] - -The operation completes very quickly in such cases. - -Other examples include the ``join()`` and ``split()`` :ref:`methods -of string objects `. - -For example if s1..s7 are large (10K+) strings then -``"".join([s1,s2,s3,s4,s5,s6,s7])`` may be far faster than the more obvious -``s1+s2+s3+s4+s5+s6+s7``, since the "summation" will compute many -subexpressions, whereas ``join()`` does all the copying in one pass. For -manipulating strings, use the ``replace()`` and the ``format()`` :ref:`methods -on string objects `. Use regular expressions only when you're -not dealing with constant string patterns. - -Be sure to use the :meth:`list.sort` built-in method to do sorting, and see the -`sorting mini-HOWTO `_ for examples -of moderately advanced usage. :meth:`list.sort` beats other techniques for -sorting in all but the most extreme circumstances. - -Another common trick is to "push loops into functions or methods." For example -suppose you have a program that runs slowly and you use the profiler to -determine that a Python function ``ff()`` is being called lots of times. If you -notice that ``ff()``:: - - def ff(x): - ... # do something with x computing result... - return result - -tends to be called in loops like:: - - list = map(ff, oldlist) - -or:: - - for x in sequence: - value = ff(x) - ... # do something with value... - -then you can often eliminate function call overhead by rewriting ``ff()`` to:: - - def ffseq(seq): - resultseq = [] - for x in seq: - ... # do something with x computing result... - resultseq.append(result) - return resultseq - -and rewrite the two examples to ``list = ffseq(oldlist)`` and to:: - - for value in ffseq(sequence): - ... # do something with value... - -Single calls to ``ff(x)`` translate to ``ffseq([x])[0]`` with little penalty. -Of course this technique is not always appropriate and there are other variants -which you can figure out. - -You can gain some performance by explicitly storing the results of a function or -method lookup into a local variable. A loop like:: - - for key in token: - dict[key] = dict.get(key, 0) + 1 - -resolves ``dict.get`` every iteration. If the method isn't going to change, a -slightly faster implementation is:: - - dict_get = dict.get # look up the method once - for key in token: - dict[key] = dict_get(key, 0) + 1 - -Default arguments can be used to determine values once, at compile time instead -of at run time. This can only be done for functions or objects which will not -be changed during program execution, such as replacing :: - - def degree_sin(deg): - return math.sin(deg * math.pi / 180.0) - -with :: - - def degree_sin(deg, factor=math.pi/180.0, sin=math.sin): - return sin(deg * factor) - -Because this trick uses default arguments for terms which should not be changed, -it should only be used when you are not concerned with presenting a possibly -confusing API to your users. - - Core Language ============= @@ -938,6 +785,68 @@ See the :ref:`unicode-howto`. +Performance +=========== + +My program is too slow. How do I speed it up? +--------------------------------------------- + +That's a tough one, in general. First, here are a list of things to +remember before diving further: + +* Performance characteristics vary accross Python implementations. This FAQ + focusses on :term:`CPython`. +* Behaviour can vary accross operating systems, especially when talking about + I/O or multi-threading. +* You should always find the hot spots in your program *before* attempting to + optimize any code (see the :mod:`profile` module). +* Writing benchmark scripts will allow you to iterate quickly when searching + for improvements (see the :mod:`timeit` module). +* It is highly recommended to have good code coverage (through unit testing + or any other technique) before potentially introducing regressions hidden + in sophisticated optimizations. + +That being said, there are many tricks to speed up Python code. Here are +some general principles which go a long way towards reaching acceptable +performance levels: + +* Making your algorithms faster (or changing to faster ones) can yield + much larger benefits than trying to sprinkle micro-optimization tricks + all over your code. + +* Use the right data structures. Study documentation for the :ref:`bltin-types` + and the :mod:`collections` module. + +* When the standard library provides a primitive for doing something, it is + likely (although not guaranteed) to be faster than any alternative you + may come up with. This is doubly true for primitives written in C, such + as builtins and some extension types. For example, be sure to use + either the :meth:`list.sort` built-in method or the related :func:`sorted` + function to do sorting (and see the + `sorting mini-HOWTO `_ for examples + of moderately advanced usage). + +* Abstractions tend to create indirections and force the interpreter to work + more. If the levels of indirection outweigh the amount of useful work + done, your program will be slower. You should avoid excessive abstraction, + especially under the form of tiny functions or methods (which are also often + detrimental to readability). + +If you have reached the limit of what pure Python can allow, there are tools +to take you further away. For example, `Cython `_ can +compile a slightly modified version of Python code into a C extension, and +can be used on many different platforms. Cython can take advantage of +compilation (and optional type annotations) to make your code significantly +faster than when interpreted. If you are confident in your C programming +skills, you can also :ref:`write a C extension module ` +yourself. + +.. seealso:: + The wiki page devoted to `performance tips + `_. + +.. _efficient_string_concatenation: + What is the most efficient way to concatenate many strings together? -------------------------------------------------------------------- -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 23:42:43 2011 From: python-checkins at python.org (florent.xicluna) Date: Fri, 09 Dec 2011 23:42:43 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=282=2E7=29=3A_Fix_docstring_t?= =?utf8?q?ypo=2E?= Message-ID: http://hg.python.org/cpython/rev/78a7d5f8f054 changeset: 73910:78a7d5f8f054 branch: 2.7 parent: 73901:70337a6d5dde user: Florent Xicluna date: Fri Dec 09 23:40:27 2011 +0100 summary: Fix docstring typo. files: Modules/arraymodule.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Modules/arraymodule.c b/Modules/arraymodule.c --- a/Modules/arraymodule.c +++ b/Modules/arraymodule.c @@ -2050,7 +2050,7 @@ \n\ Return a new array whose items are restricted by typecode, and\n\ initialized from the optional initializer value, which must be a list,\n\ -string. or iterable over elements of the appropriate type.\n\ +string or iterable over elements of the appropriate type.\n\ \n\ Arrays represent basic values and behave very much like lists, except\n\ the type of objects stored in them is constrained.\n\ -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 23:42:44 2011 From: python-checkins at python.org (florent.xicluna) Date: Fri, 09 Dec 2011 23:42:44 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Fix_docstring_t?= =?utf8?q?ypo=2E?= Message-ID: http://hg.python.org/cpython/rev/1e2880250610 changeset: 73911:1e2880250610 branch: 3.2 parent: 73908:eb30f2becb79 user: Florent Xicluna date: Fri Dec 09 23:41:19 2011 +0100 summary: Fix docstring typo. files: Modules/arraymodule.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/Modules/arraymodule.c b/Modules/arraymodule.c --- a/Modules/arraymodule.c +++ b/Modules/arraymodule.c @@ -2543,7 +2543,7 @@ \n\ Return a new array whose items are restricted by typecode, and\n\ initialized from the optional initializer value, which must be a list,\n\ -string. or iterable over elements of the appropriate type.\n\ +string or iterable over elements of the appropriate type.\n\ \n\ Arrays represent basic values and behave very much like lists, except\n\ the type of objects stored in them is constrained.\n\ -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 23:42:44 2011 From: python-checkins at python.org (florent.xicluna) Date: Fri, 09 Dec 2011 23:42:44 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Remove_obsolete?= =?utf8?q?_py3k_comment=2E?= Message-ID: http://hg.python.org/cpython/rev/ee0c9ca09c84 changeset: 73912:ee0c9ca09c84 branch: 3.2 user: Florent Xicluna date: Fri Dec 09 23:41:21 2011 +0100 summary: Remove obsolete py3k comment. files: Python/_warnings.c | 1 - 1 files changed, 0 insertions(+), 1 deletions(-) diff --git a/Python/_warnings.c b/Python/_warnings.c --- a/Python/_warnings.c +++ b/Python/_warnings.c @@ -888,7 +888,6 @@ static PyObject * init_filters(void) { - /* Don't silence DeprecationWarning if -3 was used. */ PyObject *filters = PyList_New(5); unsigned int pos = 0; /* Post-incremented in each use. */ unsigned int x; -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Fri Dec 9 23:42:45 2011 From: python-checkins at python.org (florent.xicluna) Date: Fri, 09 Dec 2011 23:42:45 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_3=2E2?= Message-ID: http://hg.python.org/cpython/rev/b7c5e30582d1 changeset: 73913:b7c5e30582d1 parent: 73909:9fe28f52eaaa parent: 73912:ee0c9ca09c84 user: Florent Xicluna date: Fri Dec 09 23:42:29 2011 +0100 summary: Merge 3.2 files: Modules/arraymodule.c | 2 +- Python/_warnings.c | 1 - 2 files changed, 1 insertions(+), 2 deletions(-) diff --git a/Modules/arraymodule.c b/Modules/arraymodule.c --- a/Modules/arraymodule.c +++ b/Modules/arraymodule.c @@ -2619,7 +2619,7 @@ \n\ Return a new array whose items are restricted by typecode, and\n\ initialized from the optional initializer value, which must be a list,\n\ -string. or iterable over elements of the appropriate type.\n\ +string or iterable over elements of the appropriate type.\n\ \n\ Arrays represent basic values and behave very much like lists, except\n\ the type of objects stored in them is constrained.\n\ diff --git a/Python/_warnings.c b/Python/_warnings.c --- a/Python/_warnings.c +++ b/Python/_warnings.c @@ -895,7 +895,6 @@ static PyObject * init_filters(void) { - /* Don't silence DeprecationWarning if -3 was used. */ PyObject *filters = PyList_New(5); unsigned int pos = 0; /* Post-incremented in each use. */ unsigned int x; -- Repository URL: http://hg.python.org/cpython From solipsis at pitrou.net Sat Dec 10 05:34:30 2011 From: solipsis at pitrou.net (solipsis at pitrou.net) Date: Sat, 10 Dec 2011 05:34:30 +0100 Subject: [Python-checkins] Daily reference leaks (b7c5e30582d1): sum=0 Message-ID: results for b7c5e30582d1 on branch "default" -------------------------------------------- Command line was: ['./python', '-m', 'test.regrtest', '-uall', '-R', '3:3:/home/antoine/cpython/refleaks/reflogwfwWLq', '-x'] From python-checkins at python.org Sat Dec 10 11:08:22 2011 From: python-checkins at python.org (florent.xicluna) Date: Sat, 10 Dec 2011 11:08:22 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Issue_=2313248=3A_turn_3=2E?= =?utf8?q?2=27s_PendingDeprecationWarning_into_3=2E3=27s?= Message-ID: http://hg.python.org/cpython/rev/f82ebf9b3a52 changeset: 73914:f82ebf9b3a52 user: Florent Xicluna date: Sat Dec 10 11:07:42 2011 +0100 summary: Issue #13248: turn 3.2's PendingDeprecationWarning into 3.3's DeprecationWarning (cgi, importlib, nntplib, smtpd). files: Doc/library/nntplib.rst | 3 + Lib/cgi.py | 2 +- Lib/importlib/abc.py | 4 +- Lib/importlib/test/source/test_abc_loader.py | 4 +- Lib/nntplib.py | 4 +- Lib/smtpd.py | 44 +++++----- Lib/test/test_smtpd.py | 44 +++++----- Misc/NEWS | 5 + 8 files changed, 59 insertions(+), 51 deletions(-) diff --git a/Doc/library/nntplib.rst b/Doc/library/nntplib.rst --- a/Doc/library/nntplib.rst +++ b/Doc/library/nntplib.rst @@ -517,6 +517,9 @@ article with message ID *id*. Most of the time, this extension is not enabled by NNTP server administrators. + .. deprecated:: 3.3 + The XPATH extension is not actively used. + .. XXX deprecated: diff --git a/Lib/cgi.py b/Lib/cgi.py --- a/Lib/cgi.py +++ b/Lib/cgi.py @@ -1012,7 +1012,7 @@ def escape(s, quote=None): """Deprecated API.""" warn("cgi.escape is deprecated, use html.escape instead", - PendingDeprecationWarning, stacklevel=2) + DeprecationWarning, stacklevel=2) s = s.replace("&", "&") # Must be done first! s = s.replace("<", "<") s = s.replace(">", ">") diff --git a/Lib/importlib/abc.py b/Lib/importlib/abc.py --- a/Lib/importlib/abc.py +++ b/Lib/importlib/abc.py @@ -195,7 +195,7 @@ "use SourceLoader instead. " "See the importlib documentation on how to be " "compatible with Python 3.1 onwards.", - PendingDeprecationWarning) + DeprecationWarning) path = self.source_path(fullname) if path is None: raise ImportError @@ -234,7 +234,7 @@ "removal in Python 3.4; use SourceLoader instead. " "If Python 3.1 compatibility is required, see the " "latest documentation for PyLoader.", - PendingDeprecationWarning) + DeprecationWarning) source_timestamp = self.source_mtime(fullname) # Try to use bytecode if it is available. bytecode_path = self.bytecode_path(fullname) diff --git a/Lib/importlib/test/source/test_abc_loader.py b/Lib/importlib/test/source/test_abc_loader.py --- a/Lib/importlib/test/source/test_abc_loader.py +++ b/Lib/importlib/test/source/test_abc_loader.py @@ -102,7 +102,7 @@ warnings.simplefilter("always") path = super().get_filename(name) assert len(w) == 1 - assert issubclass(w[0].category, PendingDeprecationWarning) + assert issubclass(w[0].category, DeprecationWarning) return path @@ -198,7 +198,7 @@ warnings.simplefilter("always") code_object = super().get_code(name) assert len(w) == 1 - assert issubclass(w[0].category, PendingDeprecationWarning) + assert issubclass(w[0].category, DeprecationWarning) return code_object class PyLoaderTests(testing_abc.LoaderTests): diff --git a/Lib/nntplib.py b/Lib/nntplib.py --- a/Lib/nntplib.py +++ b/Lib/nntplib.py @@ -828,7 +828,7 @@ - list: list of (name,title) strings""" warnings.warn("The XGTITLE extension is not actively used, " "use descriptions() instead", - PendingDeprecationWarning, 2) + DeprecationWarning, 2) line_pat = re.compile('^([^ \t]+)[ \t]+(.*)$') resp, raw_lines = self._longcmdstring('XGTITLE ' + group, file) lines = [] @@ -846,7 +846,7 @@ path: directory path to article """ warnings.warn("The XPATH extension is not actively used", - PendingDeprecationWarning, 2) + DeprecationWarning, 2) resp = self._shortcmd('XPATH {0}'.format(id)) if not resp.startswith('223'): diff --git a/Lib/smtpd.py b/Lib/smtpd.py --- a/Lib/smtpd.py +++ b/Lib/smtpd.py @@ -142,122 +142,122 @@ @property def __server(self): warn("Access to __server attribute on SMTPChannel is deprecated, " - "use 'smtp_server' instead", PendingDeprecationWarning, 2) + "use 'smtp_server' instead", DeprecationWarning, 2) return self.smtp_server @__server.setter def __server(self, value): warn("Setting __server attribute on SMTPChannel is deprecated, " - "set 'smtp_server' instead", PendingDeprecationWarning, 2) + "set 'smtp_server' instead", DeprecationWarning, 2) self.smtp_server = value @property def __line(self): warn("Access to __line attribute on SMTPChannel is deprecated, " - "use 'received_lines' instead", PendingDeprecationWarning, 2) + "use 'received_lines' instead", DeprecationWarning, 2) return self.received_lines @__line.setter def __line(self, value): warn("Setting __line attribute on SMTPChannel is deprecated, " - "set 'received_lines' instead", PendingDeprecationWarning, 2) + "set 'received_lines' instead", DeprecationWarning, 2) self.received_lines = value @property def __state(self): warn("Access to __state attribute on SMTPChannel is deprecated, " - "use 'smtp_state' instead", PendingDeprecationWarning, 2) + "use 'smtp_state' instead", DeprecationWarning, 2) return self.smtp_state @__state.setter def __state(self, value): warn("Setting __state attribute on SMTPChannel is deprecated, " - "set 'smtp_state' instead", PendingDeprecationWarning, 2) + "set 'smtp_state' instead", DeprecationWarning, 2) self.smtp_state = value @property def __greeting(self): warn("Access to __greeting attribute on SMTPChannel is deprecated, " - "use 'seen_greeting' instead", PendingDeprecationWarning, 2) + "use 'seen_greeting' instead", DeprecationWarning, 2) return self.seen_greeting @__greeting.setter def __greeting(self, value): warn("Setting __greeting attribute on SMTPChannel is deprecated, " - "set 'seen_greeting' instead", PendingDeprecationWarning, 2) + "set 'seen_greeting' instead", DeprecationWarning, 2) self.seen_greeting = value @property def __mailfrom(self): warn("Access to __mailfrom attribute on SMTPChannel is deprecated, " - "use 'mailfrom' instead", PendingDeprecationWarning, 2) + "use 'mailfrom' instead", DeprecationWarning, 2) return self.mailfrom @__mailfrom.setter def __mailfrom(self, value): warn("Setting __mailfrom attribute on SMTPChannel is deprecated, " - "set 'mailfrom' instead", PendingDeprecationWarning, 2) + "set 'mailfrom' instead", DeprecationWarning, 2) self.mailfrom = value @property def __rcpttos(self): warn("Access to __rcpttos attribute on SMTPChannel is deprecated, " - "use 'rcpttos' instead", PendingDeprecationWarning, 2) + "use 'rcpttos' instead", DeprecationWarning, 2) return self.rcpttos @__rcpttos.setter def __rcpttos(self, value): warn("Setting __rcpttos attribute on SMTPChannel is deprecated, " - "set 'rcpttos' instead", PendingDeprecationWarning, 2) + "set 'rcpttos' instead", DeprecationWarning, 2) self.rcpttos = value @property def __data(self): warn("Access to __data attribute on SMTPChannel is deprecated, " - "use 'received_data' instead", PendingDeprecationWarning, 2) + "use 'received_data' instead", DeprecationWarning, 2) return self.received_data @__data.setter def __data(self, value): warn("Setting __data attribute on SMTPChannel is deprecated, " - "set 'received_data' instead", PendingDeprecationWarning, 2) + "set 'received_data' instead", DeprecationWarning, 2) self.received_data = value @property def __fqdn(self): warn("Access to __fqdn attribute on SMTPChannel is deprecated, " - "use 'fqdn' instead", PendingDeprecationWarning, 2) + "use 'fqdn' instead", DeprecationWarning, 2) return self.fqdn @__fqdn.setter def __fqdn(self, value): warn("Setting __fqdn attribute on SMTPChannel is deprecated, " - "set 'fqdn' instead", PendingDeprecationWarning, 2) + "set 'fqdn' instead", DeprecationWarning, 2) self.fqdn = value @property def __peer(self): warn("Access to __peer attribute on SMTPChannel is deprecated, " - "use 'peer' instead", PendingDeprecationWarning, 2) + "use 'peer' instead", DeprecationWarning, 2) return self.peer @__peer.setter def __peer(self, value): warn("Setting __peer attribute on SMTPChannel is deprecated, " - "set 'peer' instead", PendingDeprecationWarning, 2) + "set 'peer' instead", DeprecationWarning, 2) self.peer = value @property def __conn(self): warn("Access to __conn attribute on SMTPChannel is deprecated, " - "use 'conn' instead", PendingDeprecationWarning, 2) + "use 'conn' instead", DeprecationWarning, 2) return self.conn @__conn.setter def __conn(self, value): warn("Setting __conn attribute on SMTPChannel is deprecated, " - "set 'conn' instead", PendingDeprecationWarning, 2) + "set 'conn' instead", DeprecationWarning, 2) self.conn = value @property def __addr(self): warn("Access to __addr attribute on SMTPChannel is deprecated, " - "use 'addr' instead", PendingDeprecationWarning, 2) + "use 'addr' instead", DeprecationWarning, 2) return self.addr @__addr.setter def __addr(self, value): warn("Setting __addr attribute on SMTPChannel is deprecated, " - "set 'addr' instead", PendingDeprecationWarning, 2) + "set 'addr' instead", DeprecationWarning, 2) self.addr = value # Overrides base class for convenience diff --git a/Lib/test/test_smtpd.py b/Lib/test/test_smtpd.py --- a/Lib/test/test_smtpd.py +++ b/Lib/test/test_smtpd.py @@ -239,49 +239,49 @@ self.assertEqual(self.channel.socket.last, b'501 Syntax: RSET\r\n') def test_attribute_deprecations(self): - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__server - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__server = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__line - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__line = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__state - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__state = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__greeting - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__greeting = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__mailfrom - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__mailfrom = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__rcpttos - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__rcpttos = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__data - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__data = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__fqdn - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__fqdn = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__peer - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__peer = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__conn - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__conn = 'spam' - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): spam = self.channel._SMTPChannel__addr - with support.check_warnings(('', PendingDeprecationWarning)): + with support.check_warnings(('', DeprecationWarning)): self.channel._SMTPChannel__addr = 'spam' def test_main(): diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -406,6 +406,11 @@ Library ------- +- Issue #13248: Turn 3.2's PendingDeprecationWarning into 3.3's + DeprecationWarning. It covers 'cgi.escape', 'importlib.abc.PyLoader', + 'importlib.abc.PyPycLoader', 'nntplib.NNTP.xgtitle', 'nntplib.NNTP.xpath', + and private attributes of 'smtpd.SMTPChannel'. + - Issue #5905: time.strftime() is now using the locale encoding, instead of UTF-8, if the wcsftime() function is not available. -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 10 12:52:36 2011 From: python-checkins at python.org (lars.gustaebel) Date: Sat, 10 Dec 2011 12:52:36 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=283=2E2=29=3A_Fix_doc_typo=2E?= Message-ID: http://hg.python.org/cpython/rev/caeec3e3606f changeset: 73915:caeec3e3606f branch: 3.2 parent: 73912:ee0c9ca09c84 user: Lars Gust?bel date: Sat Dec 10 12:45:45 2011 +0100 summary: Fix doc typo. files: Doc/library/tarfile.rst | 4 ++-- 1 files changed, 2 insertions(+), 2 deletions(-) diff --git a/Doc/library/tarfile.rst b/Doc/library/tarfile.rst --- a/Doc/library/tarfile.rst +++ b/Doc/library/tarfile.rst @@ -101,10 +101,10 @@ +-------------+--------------------------------------------+ | ``'w|'`` | Open an uncompressed *stream* for writing. | +-------------+--------------------------------------------+ - | ``'w|gz'`` | Open an gzip compressed *stream* for | + | ``'w|gz'`` | Open a gzip compressed *stream* for | | | writing. | +-------------+--------------------------------------------+ - | ``'w|bz2'`` | Open an bzip2 compressed *stream* for | + | ``'w|bz2'`` | Open a bzip2 compressed *stream* for | | | writing. | +-------------+--------------------------------------------+ -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 10 12:52:37 2011 From: python-checkins at python.org (lars.gustaebel) Date: Sat, 10 Dec 2011 12:52:37 +0100 Subject: [Python-checkins] =?utf8?q?cpython_=28merge_3=2E2_-=3E_default=29?= =?utf8?q?=3A_Merge_with_3=2E2=3A_Fix_doc_typo=2E?= Message-ID: http://hg.python.org/cpython/rev/5da2c242025f changeset: 73916:5da2c242025f parent: 73914:f82ebf9b3a52 parent: 73915:caeec3e3606f user: Lars Gust?bel date: Sat Dec 10 12:48:03 2011 +0100 summary: Merge with 3.2: Fix doc typo. files: Doc/library/tarfile.rst | 4 ++-- 1 files changed, 2 insertions(+), 2 deletions(-) diff --git a/Doc/library/tarfile.rst b/Doc/library/tarfile.rst --- a/Doc/library/tarfile.rst +++ b/Doc/library/tarfile.rst @@ -101,10 +101,10 @@ +-------------+--------------------------------------------+ | ``'w|'`` | Open an uncompressed *stream* for writing. | +-------------+--------------------------------------------+ - | ``'w|gz'`` | Open an gzip compressed *stream* for | + | ``'w|gz'`` | Open a gzip compressed *stream* for | | | writing. | +-------------+--------------------------------------------+ - | ``'w|bz2'`` | Open an bzip2 compressed *stream* for | + | ``'w|bz2'`` | Open a bzip2 compressed *stream* for | | | writing. | +-------------+--------------------------------------------+ -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 10 13:02:46 2011 From: python-checkins at python.org (florent.xicluna) Date: Sat, 10 Dec 2011 13:02:46 +0100 Subject: [Python-checkins] =?utf8?q?cpython=3A_Fix_comment_in_difflib=2E?= Message-ID: http://hg.python.org/cpython/rev/a3309634f38c changeset: 73917:a3309634f38c user: Florent Xicluna date: Sat Dec 10 13:02:17 2011 +0100 summary: Fix comment in difflib. files: Lib/difflib.py | 3 +-- 1 files changed, 1 insertions(+), 2 deletions(-) diff --git a/Lib/difflib.py b/Lib/difflib.py --- a/Lib/difflib.py +++ b/Lib/difflib.py @@ -204,7 +204,7 @@ # returning true iff the element is "junk" -- this has # subtle but helpful effects on the algorithm, which I'll # get around to writing up someday <0.9 wink>. - # DON'T USE! Only __chain_b uses this. Use isbjunk. + # DON'T USE! Only __chain_b uses this. Use "in self.bjunk". # bjunk # the items in b for which isjunk is True. # bpopular @@ -287,7 +287,6 @@ # when self.isjunk is defined, junk elements don't show up in this # map at all, which stops the central find_longest_match method # from starting any matching block at a junk element ... - # also creates the fast isbjunk function ... # b2j also does not contain entries for "popular" elements, meaning # elements that account for more than 1 + 1% of the total elements, and # when the sequence is reasonably large (>= 200 elements); this can -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 10 13:18:23 2011 From: python-checkins at python.org (charles-francois.natali) Date: Sat, 10 Dec 2011 13:18:23 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMi43KTogSXNzdWUgIzEzNDUz?= =?utf8?q?=3A_Catch_EAI=5FFAIL_in_support=2Etransient=5Finternet=2E?= Message-ID: http://hg.python.org/cpython/rev/5ba1a22c8988 changeset: 73918:5ba1a22c8988 branch: 2.7 parent: 73910:78a7d5f8f054 user: Charles-Fran?ois Natali date: Sat Dec 10 13:16:02 2011 +0100 summary: Issue #13453: Catch EAI_FAIL in support.transient_internet. files: Lib/test/test_support.py | 1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/Lib/test/test_support.py b/Lib/test/test_support.py --- a/Lib/test/test_support.py +++ b/Lib/test/test_support.py @@ -764,6 +764,7 @@ ] default_gai_errnos = [ ('EAI_AGAIN', -3), + ('EAI_FAIL', -4), ('EAI_NONAME', -2), ('EAI_NODATA', -5), ] -- Repository URL: http://hg.python.org/cpython From python-checkins at python.org Sat Dec 10 13:18:44 2011 From: python-checkins at python.org (charles-francois.natali) Date: Sat, 10 Dec 2011 13:18:44 +0100 Subject: [Python-checkins] =?utf8?b?Y3B5dGhvbiAoMy4yKTogSXNzdWUgIzEzNDUz?= =?utf8?q?=3A_Catch_EAI=5FFAIL_in_support=2Etransient=5Finternet=2E?= Message-ID:

	mnemonic	vim-style
first message	f	h
previous message	p	k
next message	n	j
last message	l	l
focus textarea	r	i
unfocus textarea	Esc	Esc
shortcuts help	?	?