Mailman 3 July 2016 - Python-Dev

Fun with ExitStack
by Barry Warsaw July 19, 2016

July 19, 2016

I was trying to debug a problem in some work code and I ran into some interesting oddities with contextlib.ExitStack and other context managers in Python 3.5. This program creates a temporary directory, and I wanted to give it a --keep flag to not automatically delete the tempdir at program exit. I was using an ExitStack to manage a bunch of resources, and the temporary directory is the first thing pushed into the ExitStack. At that point in the program, I check the value of --keep and if it'… [View More]s set, I use ExitStack.pop_all() to clear the stack, and thus, presumably, prevent the temporary directory from being deleted. There's this relevant quote in the contextlib documentation: """ Each instance [of an ExitStack] maintains a stack of registered callbacks that are called in reverse order when the instance is closed (either explicitly or implicitly at the end of a with statement). Note that callbacks are not invoked implicitly when the context stack instance is garbage collected. """ However if I didn't save the reference to the pop_all'd ExitStack, the tempdir would be immediately deleted. If I did save a reference to the pop_all'd ExitStack, the tempdir would live until the saved reference went out of scope and got refcounted away. As best I can tell this happens because TemporaryDirectory.__init__() creates a weakref finalizer which ends up calling the _cleanup() function. Although it's rather difficult to trace, it does appear that when the ExitStack is gc'd, this finalizer gets triggered (via weakref.detach()), thus causing the _cleanup() method to be called and the tmpdir to get deleted. I "fix" this by doing: def __init__(self): tmpdir = TemporaryDirectory() self._tmpdir = (tmpdir.name if keep else self.resources.enter_context(tmpdir)) There must be more to the story because when __init__() exits in the --keep case, tmpdir should have gotten refcounted away and the directory deleted, but it doesn't. I haven't dug down deep enough to figure that out. Now, while I was debugging that behavior, I ran across more interesting bits. I put this in a file to drive some tests: ------snip snip----- with ExitStack() as resources: print('enter context') tmpdir = resources.enter_context(X()) resources.pop_all() print('exit context') ------snip snip----- Let's say X is: class X: def __enter__(self): print('enter Foo') return self def __exit__(self, *args, **kws): print('exit Foo') return False the output is: enter context enter Foo exit context So far so good. A fairly standard context manager class doesn't get its __exit__() called even when the program exits. Let's try this: @contextmanager def X(): print('enter bar') yield print('exit bar') still good: enter context enter bar exit context Let's modify X a little bit to be a more common idiom: @contextmanager def X(): print('enter foo') try: yield finally: print('exit foo') enter context enter foo exit foo exit context Ah, the try-finally changes the behavior! There's probably some documentation somewhere that defines how a generator gets finalized, and that triggers the finally clause, whereas in the previous example, nothing after the yield gets run. I just can't find that anything that would describe the observed behavior. It's all very twisty, and I'm not sure Python is doing anything wrong, but I'm also not sure it's *not* doing anything wrong. ;) In any case, the contextlib documentation quoted above should probably be more liberally sprinkled with salty caveats. Just calling .pop_all() isn't necessarily enough to ensure that resources managed by an ExitStack will survive its garbage collection. Cheers, -Barry [View Less]

3 2

PEP 467: next round
by Ethan Furman July 19, 2016

July 19, 2016

Taking into consideration the comments from the last round: - 'bytes.zeros' renamed to 'bytes.size', with option byte filler (defaults to b'\x00') - 'bytes.byte' renamed to 'fromint', add 'bchr' function - deprecation and removal softened to deprecation/discouragement ----------- PEP: 467 Title: Minor API improvements for binary sequences Version: $Revision$ Last-Modified: $Date$ Author: Nick Coghlan <ncoghlan(a)gmail.com>, Ethan Furman <ethan(a)stoneleaf.us> Status: Draft … [View More]Type: Standards Track Content-Type: text/x-rst Created: 2014-03-30 Python-Version: 3.6 Post-History: 2014-03-30 2014-08-15 2014-08-16 2016-06-07 Abstract ======== During the initial development of the Python 3 language specification, the core ``bytes`` type for arbitrary binary data started as the mutable type that is now referred to as ``bytearray``. Other aspects of operating in the binary domain in Python have also evolved over the course of the Python 3 series. This PEP proposes five small adjustments to the APIs of the ``bytes``, ``bytearray`` and ``memoryview`` types to make it easier to operate entirely in the binary domain: * Deprecate passing single integer values to ``bytes`` and ``bytearray`` * Add ``bytes.size`` and ``bytearray.size`` alternative constructors * Add ``bytes.fromint`` and ``bytearray.fromint`` alternative constructors * Add ``bytes.getbyte`` and ``bytearray.getbyte`` byte retrieval methods * Add ``bytes.iterbytes``, ``bytearray.iterbytes`` and ``memoryview.iterbytes`` alternative iterators Proposals ========= Deprecation of current "zero-initialised sequence" behaviour without removal ---------------------------------------------------------------------------- Currently, the ``bytes`` and ``bytearray`` constructors accept an integer argument and interpret it as meaning to create a zero-initialised sequence of the given size:: >>> bytes(3) b'\x00\x00\x00' >>> bytearray(3) bytearray(b'\x00\x00\x00') This PEP proposes to deprecate that behaviour in Python 3.6, but to leave it in place for at least as long as Python 2.7 is supported, possibly indefinitely. No other changes are proposed to the existing constructors. Addition of explicit "count and byte initialised sequence" constructors ----------------------------------------------------------------------- To replace the deprecated behaviour, this PEP proposes the addition of an explicit ``size`` alternative constructor as a class method on both ``bytes`` and ``bytearray`` whose first argument is the count, and whose second argument is the fill byte to use (defaults to ``\x00``):: >>> bytes.size(3) b'\x00\x00\x00' >>> bytearray.size(3) bytearray(b'\x00\x00\x00') >>> bytes.size(5, b'\x0a') b'\x0a\x0a\x0a\x0a\x0a' >>> bytearray.size(5, b'\x0a') bytearray(b'\x0a\x0a\x0a\x0a\x0a') It will behave just as the current constructors behave when passed a single integer. Addition of "bchr" function and explicit "single byte" constructors ------------------------------------------------------------------- As binary counterparts to the text ``chr`` function, this PEP proposes the addition of a ``bchr`` function and an explicit ``fromint`` alternative constructor as a class method on both ``bytes`` and ``bytearray``:: >>> bchr(ord("A")) b'A' >>> bchr(ord(b"A")) b'A' >>> bytes.fromint(65) b'A' >>> bytearray.fromint(65) bytearray(b'A') These methods will only accept integers in the range 0 to 255 (inclusive):: >>> bytes.fromint(512) Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: integer must be in range(0, 256) >>> bytes.fromint(1.0) Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'float' object cannot be interpreted as an integer The documentation of the ``ord`` builtin will be updated to explicitly note that ``bchr`` is the primary inverse operation for binary data, while ``chr`` is the inverse operation for text data, and that ``bytes.fromint`` and ``bytearray.fromint`` also exist. Behaviourally, ``bytes.fromint(x)`` will be equivalent to the current ``bytes([x])`` (and similarly for ``bytearray``). The new spelling is expected to be easier to discover and easier to read (especially when used in conjunction with indexing operations on binary sequence types). As a separate method, the new spelling will also work better with higher order functions like ``map``. Addition of "getbyte" method to retrieve a single byte ------------------------------------------------------ This PEP proposes that ``bytes`` and ``bytearray`` gain the method ``getbyte`` which will always return ``bytes``:: >>> b'abc'.getbyte(0) b'a' If an index is asked for that doesn't exist, ``IndexError`` is raised:: >>> b'abc'.getbyte(9) Traceback (most recent call last): File "<stdin>", line 1, in <module> IndexError: index out of range Addition of optimised iterator methods that produce ``bytes`` objects --------------------------------------------------------------------- This PEP proposes that ``bytes``, ``bytearray`` and ``memoryview`` gain an optimised ``iterbytes`` method that produces length 1 ``bytes`` objects rather than integers:: for x in data.iterbytes(): # x is a length 1 ``bytes`` object, rather than an integer For example:: >>> tuple(b"ABC".iterbytes()) (b'A', b'B', b'C') The method can be used with arbitrary buffer exporting objects by wrapping them in a ``memoryview`` instance first:: for x in memoryview(data).iterbytes(): # x is a length 1 ``bytes`` object, rather than an integer For ``memoryview``, the semantics of ``iterbytes()`` are defined such that:: memview.tobytes() == b''.join(memview.iterbytes()) This allows the raw bytes of the memory view to be iterated over without needing to make a copy, regardless of the defined shape and format. The main advantage this method offers over the ``map(bytes.byte, data)`` approach is that it is guaranteed *not* to fail midstream with a ``ValueError`` or ``TypeError``. By contrast, when using the ``map`` based approach, the type and value of the individual items in the iterable are only checked as they are retrieved and passed through the ``bytes.byte`` constructor. Design discussion ================= Why not rely on sequence repetition to create zero-initialised sequences? ------------------------------------------------------------------------- Zero-initialised sequences can be created via sequence repetition:: >>> b'\x00' * 3 b'\x00\x00\x00' >>> bytearray(b'\x00') * 3 bytearray(b'\x00\x00\x00') However, this was also the case when the ``bytearray`` type was originally designed, and the decision was made to add explicit support for it in the type constructor. The immutable ``bytes`` type then inherited that feature when it was introduced in PEP 3137. This PEP isn't revisiting that original design decision, just changing the spelling as users sometimes find the current behaviour of the binary sequence constructors surprising. In particular, there's a reasonable case to be made that ``bytes(x)`` (where ``x`` is an integer) should behave like the ``bytes.byte(x)`` proposal in this PEP. Providing both behaviours as separate class methods avoids that ambiguity. References ========== .. [1] Initial March 2014 discussion thread on python-ideas (https://mail.python.org/pipermail/python-ideas/2014-March/027295.html) .. [2] Guido's initial feedback in that thread (https://mail.python.org/pipermail/python-ideas/2014-March/027376.html) .. [3] Issue proposing moving zero-initialised sequences to a dedicated API (http://bugs.python.org/issue20895) .. [4] Issue proposing to use calloc() for zero-initialised binary sequences (http://bugs.python.org/issue21644) .. [5] August 2014 discussion thread on python-dev (https://mail.python.org/pipermail/python-ideas/2014-March/027295.html) .. [6] June 2016 discussion thread on python-dev (https://mail.python.org/pipermail/python-dev/2016-June/144875.html) Copyright ========= This document has been placed in the public domain. [View Less]

6 11

PEP 467: Minor API improvements to bytes, bytearray, and memoryview
by Ethan Furman July 18, 2016

July 18, 2016

Minor changes: updated version numbers, add punctuation. The current text seems to take into account Guido's last comments. Thoughts before asking for acceptance? PEP: 467 Title: Minor API improvements for binary sequences Version: $Revision$ Last-Modified: $Date$ Author: Nick Coghlan <ncoghlan(a)gmail.com> Status: Draft Type: Standards Track Content-Type: text/x-rst Created: 2014-03-30 Python-Version: 3.5 Post-History: 2014-03-30 2014-08-15 2014-08-16 Abstract ======== During … [View More]the initial development of the Python 3 language specification, the core ``bytes`` type for arbitrary binary data started as the mutable type that is now referred to as ``bytearray``. Other aspects of operating in the binary domain in Python have also evolved over the course of the Python 3 series. This PEP proposes four small adjustments to the APIs of the ``bytes``, ``bytearray`` and ``memoryview`` types to make it easier to operate entirely in the binary domain: * Deprecate passing single integer values to ``bytes`` and ``bytearray`` * Add ``bytes.zeros`` and ``bytearray.zeros`` alternative constructors * Add ``bytes.byte`` and ``bytearray.byte`` alternative constructors * Add ``bytes.iterbytes``, ``bytearray.iterbytes`` and ``memoryview.iterbytes`` alternative iterators Proposals ========= Deprecation of current "zero-initialised sequence" behaviour ------------------------------------------------------------ Currently, the ``bytes`` and ``bytearray`` constructors accept an integer argument and interpret it as meaning to create a zero-initialised sequence of the given size:: >>> bytes(3) b'\x00\x00\x00' >>> bytearray(3) bytearray(b'\x00\x00\x00') This PEP proposes to deprecate that behaviour in Python 3.6, and remove it entirely in Python 3.7. No other changes are proposed to the existing constructors. Addition of explicit "zero-initialised sequence" constructors ------------------------------------------------------------- To replace the deprecated behaviour, this PEP proposes the addition of an explicit ``zeros`` alternative constructor as a class method on both ``bytes`` and ``bytearray``:: >>> bytes.zeros(3) b'\x00\x00\x00' >>> bytearray.zeros(3) bytearray(b'\x00\x00\x00') It will behave just as the current constructors behave when passed a single integer. The specific choice of ``zeros`` as the alternative constructor name is taken from the corresponding initialisation function in NumPy (although, as these are 1-dimensional sequence types rather than N-dimensional matrices, the constructors take a length as input rather than a shape tuple). Addition of explicit "single byte" constructors ----------------------------------------------- As binary counterparts to the text ``chr`` function, this PEP proposes the addition of an explicit ``byte`` alternative constructor as a class method on both ``bytes`` and ``bytearray``:: >>> bytes.byte(3) b'\x03' >>> bytearray.byte(3) bytearray(b'\x03') These methods will only accept integers in the range 0 to 255 (inclusive):: >>> bytes.byte(512) Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: bytes must be in range(0, 256) >>> bytes.byte(1.0) Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'float' object cannot be interpreted as an integer The documentation of the ``ord`` builtin will be updated to explicitly note that ``bytes.byte`` is the inverse operation for binary data, while ``chr`` is the inverse operation for text data. Behaviourally, ``bytes.byte(x)`` will be equivalent to the current ``bytes([x])`` (and similarly for ``bytearray``). The new spelling is expected to be easier to discover and easier to read (especially when used in conjunction with indexing operations on binary sequence types). As a separate method, the new spelling will also work better with higher order functions like ``map``. Addition of optimised iterator methods that produce ``bytes`` objects --------------------------------------------------------------------- This PEP proposes that ``bytes``, ``bytearray`` and ``memoryview`` gain an optimised ``iterbytes`` method that produces length 1 ``bytes`` objects rather than integers:: for x in data.iterbytes(): # x is a length 1 ``bytes`` object, rather than an integer The method can be used with arbitrary buffer exporting objects by wrapping them in a ``memoryview`` instance first:: for x in memoryview(data).iterbytes(): # x is a length 1 ``bytes`` object, rather than an integer For ``memoryview``, the semantics of ``iterbytes()`` are defined such that:: memview.tobytes() == b''.join(memview.iterbytes()) This allows the raw bytes of the memory view to be iterated over without needing to make a copy, regardless of the defined shape and format. The main advantage this method offers over the ``map(bytes.byte, data)`` approach is that it is guaranteed *not* to fail midstream with a ``ValueError`` or ``TypeError``. By contrast, when using the ``map`` based approach, the type and value of the individual items in the iterable are only checked as they are retrieved and passed through the ``bytes.byte`` constructor. Design discussion ================= Why not rely on sequence repetition to create zero-initialised sequences? ------------------------------------------------------------------------- Zero-initialised sequences can be created via sequence repetition:: >>> b'\x00' * 3 b'\x00\x00\x00' >>> bytearray(b'\x00') * 3 bytearray(b'\x00\x00\x00') However, this was also the case when the ``bytearray`` type was originally designed, and the decision was made to add explicit support for it in the type constructor. The immutable ``bytes`` type then inherited that feature when it was introduced in PEP 3137. This PEP isn't revisiting that original design decision, just changing the spelling as users sometimes find the current behaviour of the binary sequence constructors surprising. In particular, there's a reasonable case to be made that ``bytes(x)`` (where ``x`` is an integer) should behave like the ``bytes.byte(x)`` proposal in this PEP. Providing both behaviours as separate class methods avoids that ambiguity. References ========== .. [1] Initial March 2014 discussion thread on python-ideas (https://mail.python.org/pipermail/python-ideas/2014-March/027295.html) .. [2] Guido's initial feedback in that thread (https://mail.python.org/pipermail/python-ideas/2014-March/027376.html) .. [3] Issue proposing moving zero-initialised sequences to a dedicated API (http://bugs.python.org/issue20895) .. [4] Issue proposing to use calloc() for zero-initialised binary sequences (http://bugs.python.org/issue21644) .. [5] August 2014 discussion thread on python-dev (https://mail.python.org/pipermail/python-ideas/2014-March/027295.html) Copyright ========= This document has been placed in the public domain. [View Less]

13 34

__qualname__ exposed as a local variable: standard?
by Carlos Pita July 17, 2016

July 17, 2016

Hi all, I noticed __qualname__ is exposed by locals() while defining a class. This is handy but I'm not sure about its status: is it standard or just an artifact of the current implementation? (btw, the pycodestyle linter -former pep8- rejects its usage). I was unable to find any reference to this behavior in PEP 3155 nor in the language reference. Thank you in advance -- Carlos

3 4

Status of Python 3.6 PEPs?
by Victor Stinner July 16, 2016

July 16, 2016

Hi, I see many PEPs accepted for Python 3.6, or stil in draft status, but only a few final PEPs. What is happening? Reminder: the deadline for new features in Python 3.6 is 2016-09-12, only in 2 months and these 2 months are summer in the northern hemisphere which means holiday for many of them... Python 3.6 schedule and What's New in Python 3.6 list some PEPs: https://www.python.org/dev/peps/pep-0494/ https://docs.python.org/dev/whatsnew/3.6.html "PEP 499 -- python -m foo should bind sys.… [View More]modules['foo'] in addition to sys.modules['__main__']" https://www.python.org/dev/peps/pep-0499/ => draft "PEP 498 -- Literal String Interpolation" https://www.python.org/dev/peps/pep-0498/ => accepted -- it's merged in Python 3.6, the status should be updated to Final no? "PEP 495 -- Local Time Disambiguation" https://www.python.org/dev/peps/pep-0495/ => accepted Alexander Belopolsky asked for a review of the implementation: https://mail.python.org/pipermail/python-dev/2016-June/145450.html "PEP 447 -- Add __getdescriptor__ method to metaclass" https://www.python.org/dev/peps/pep-0447/ => draft "PEP 487 -- Simpler customisation of class creation" https://www.python.org/dev/peps/pep-0487/ => draft "PEP 520 -- Preserving Class Attribute Definition Order" https://www.python.org/dev/peps/pep-0520/ => accepted -- what is the status of its implementation? "PEP 519 -- Adding a file system path protocol" https://www.python.org/dev/peps/pep-0519/ => accepted "PEP 467 -- Minor API improvements for binary sequences" https://www.python.org/dev/peps/pep-0467 => draft -- I saw recently some discussions around this PEP (on python-ideas?) It looks like os.fspath() exists, so the PEP is implemented. Its status should be Final, but the PEP should also be mentioned in What's New in Python 3.6 please. I also see some discussions for even more compact dict implementation. I wrote 3 PEPs, but I didn't have time recently to work of them (to make progress on the implementation of FAT Python): "PEP 509 -- Add a private version to dict" https://www.python.org/dev/peps/pep-0509/ => draft Pyjion, Cython, and Yury Selivanov are interested to use this feature, but last time I asked Guido, he didn't seem convinced by the advantages of the PEP. "PEP 510 -- Specialize functions with guards" https://www.python.org/dev/peps/pep-0510/ "PEP 511 -- API for code transformers" https://www.python.org/dev/peps/pep-0511/ These two PEPs are directly related to my FAT Python work. I was asked to prove that FAT Python makes CPython faster. Sadly, I failed to prove that. Moreover, it took me almost 2 months (and I'm not done yet!) to get stable benchmarks results on Python. I want to make sure that my changes don't make Python slower (don't introduce Python regressions), but the CPython benchmark is unstable, some benchmarks are very unstable. To get more information, follow the speed(a)python.org mailing list ;-) I probably forgot some PEPs, there are so many PEPs in the draft state :-( Victor [View Less]

8 9

Summary of Python tracker Issues
by Python tracker July 15, 2016

July 15, 2016

ACTIVITY SUMMARY (2016-07-08 - 2016-07-15) Python tracker at http://bugs.python.org/ To view or respond to any of the issues listed below, click on the issue. Do NOT respond to this message. Issues counts and deltas: open 5562 (+12) closed 33715 (+39) total 39277 (+51) Open issues with patches: 2429 Issues opened (33) ================== #27469: Unicode filename gets crippled on Windows when drag and drop http://bugs.python.org/issue27469 reopened by eryksun #27470: -3 … [View More]

1 0

PyPI index deprecation
by Dmitry Trofimov July 13, 2016

July 13, 2016

Hi, as you probably already know, today the PyPI index page ( https://pypi.python.org/pypi?%3Aaction=index) was deprecated and ceased to be. Among other things it affected PyCharm IDE that relied on that page to enable packaging related features from the IDE. As a result users of PyCharm can no longer install/update PyPI packages from PyCharm. Here is an issue about that in our tracker: https://youtrack.jetbrains.com/issue/PY-20081 Given that there are several hundred thouthands of PyCharm … [View More]

3 3

Should PY_SSIZE_T_CLEAN break Py_LIMITED_API?
by Daniel Holth July 12, 2016

July 12, 2016

I was using Py_LIMITED_API under 3.5 and PY_SSIZE_T_CLEAN was set, this causes some functions not in the limited api to be used and the resulting extension segfaults in Linux. Is that right? Thanks, Daniel

2 1

Proposal: explicitly disallow function/class mismatches in accelerator modules
by Nick Coghlan July 12, 2016

July 12, 2016

I'm in the process of trying to disentangle http://bugs.python.org/issue27137 which points out some of the behavioural differences that arise when falling back from the original C implementation of functools.partial to the pure Python emulation that uses a closure. That issue was opened due to a few things that work with the C implementation that fail with the Python implementation: - the C version can be pickled (and hence used with multiprocessing) - the C version can be subclassed - the C … [View More]

9 20

[RELEASE] Python 3.6.0a3 is now available
by Ned Deily July 12, 2016

July 12, 2016

On behalf of the Python development community and the Python 3.6 release team, I'm happy to announce the availability of Python 3.6.0a3. 3.6.0a3 is the third of four planned alpha releases of Python 3.6, the next major release of Python. During the alpha phase, Python 3.6 remains under heavy development: additional features will be added and existing features may be modified or deleted. Please keep in mind that this is a preview release and its use is not recommended for production … [View More]

1 0