Mailman 3 September 2014 - Python-ideas

The stdlib++ user experience (Was: Introduce `start=1` argument to `math.factorial`)
by Paul Moore Sept. 19, 2014

Sept. 19, 2014

On 18 September 2014 18:01, Andrew Barnert <abarnert(a)yahoo.com.dmarc.invalid> wrote: > On Sep 17, 2014, at 23:15, Nick Coghlan <ncoghlan(a)gmail.com> wrote: > >> However, now that CPython ships with pip by default, we may want to >> consider providing more explicit pointers to such "If you want more >> advanced functionality than the standard library provides" libraries. > > I love this idea, but there's one big potential problem, and one smaller one. … [View More]> > Many of the most popular and useful packages require C extensions. In itself, > that doesn't have to be a problem; if you provide wheels for the official 3.4+ Win32, > Win64, and Mac64 CPython builds, it can still be as simple as `pip install spam` > for most users, including the ones with the least ability to figure it out for themselves. OK, the key thing to look at here is the user experience for someone who has Python installed, and has a job to do, but needs to branch out into external packages because the stdlib doesn't provide enough functionality. To make this example concrete, I'll focus on a specific use case, which I believe is relatively common, although I can't back this up with hard data. Assume: * A user who is comfortable with Python, or with scripting languages in general * No licensing or connectivity issues to worry about * An existing manual process that the user wants to automate In my line of work, this constitutes the vast bulk of Python use - informal, simple automation scripts. So I'm writing this script, and I discover I need to do something that the stdlib doesn't cover, but I feel like it should be available "out there", and it's sufficiently fiddly that I'd prefer not to write it myself. Examples I've come across in the past: * A console progress bar * Scraping some data off a web page * Writing data into an Excel spreadsheet with formatting * Querying an Oracle database Every time an issue like this comes up, I know that I'm looking to do "pip install XXX". It's working out what XXX is that's the problem. So I go and ask Google. A quick check on the progress bar case gets me to a StackOverflow article that offers me a lot of "write it yourself" solutions, and pointers to a couple of libraries. Further down there are a few pointers to python-progressbar, which was mentioned in the StackOverflow article, which in turn leads me to the PyPI page for it. The latest version (2.3-dev) is not hosted on PyPI, so I hit all the fun of --allow-external. Ironically, "pip install tqdm" gives me what I want instantly. But it never came up via Google. The rest of the cases are similar, lots of Google searching, often combined with evaluating multiple options, followed by more or less pain installing the software. Things that aren't Python 3 or Windows compatible suck me into the "shall I patch it and submit a PR" minefield. For the last case (an Oracle driver), where I need a C extension and access to external libraries, ironically it's pretty easy. There's no real competition to cx_Oracle, and the PyPI page has what I need, although they ship wininst exes rather than wheels, which means I need to do a download then a wheel convert then a pip install, so it's not ideal, but doable. >From this example, I'd like to see the following improvements to the process: 1. Somewhere I can go to find useful modules, that's better than Google. 2. Someone else choosing the "best option" - I don't want to evaluate 3 different progressbar modules, I just want to write "57% complete" and a few dots! 3. C extensions aren't a huge problem to me on Windows, although I'm looking forward to the day when everyone distributes wheels (wheel convert is good enough for now though). [1] 4. Much more community pressure for projects to host their code on PyPI. Some projects have genuine issues with hosting on PyPI, and there are changes being looked at to support them, but for most projects it seems to just be history and inertia. [1] A Linux/OS X user might have more more issues with C extensions. Maybe this can't be solved in any meaningful sense, and maybe it's not something the "Python project" should take responsibility for, but without any doubt, it's the single most significant improvement that could be made to my experience with PyPI. Paul. PS I should also note that even in its current state, PyPI is streets ahead of the 3rd party module story I've experienced for any other language - C/C++, Lua, Powershell, and Java are all far worse. Perl/CPAN may be as good or better, it's so long since I used Perl that I don't really know these days. [View Less]

10 24

Stop displaying elements of bytes objects as printable ASCII characters in CPython 3
by Chris Lasher Sept. 19, 2014

Sept. 19, 2014

Why did the CPython core developers decide to force the display of ASCII characters in the printable representation of bytes objects in CPython 3? For example >>> import struct >>> # In go bytes for four floats: >>> my_packed_bytes = struct.pack('ffff', 3.544294848931151e-12, 1.853266900760489e+25, 1.6215185358725202e-19, 0.9742483496665955) >>> # And out comes a speciously human-readable representation of those bytes >>> … [View More]my_packed_bytes b'Why, Guido? Why?' >>> >>> # But it's just an illusion; it's truly bytes underneath! >>> a_reasonable_representation = bytes((0x57, 0x68, 0x79, 0x2c, 0x20, 0x47, 0x75, 0x69, 0x64, 0x6f, 0x3f, 0x20, 0x57, 0x68, 0x79, 0x3f)) >>> my_packed_bytes == a_reasonable_reperesentation True >>> >>> this_also_seems_reasonable = b'\x57\x68\x79\x2c\x20\x47\x75\x69\x64\x6f\x3f\x20\x57\x68\x79\x3f' >>> my_packed_bytes == this_also_seems_reasonable True I understand bytes literals were brought in to Python 3 to aid the transition from Python 2 to Python 3 [1], but this did not imply that `repr()` on a bytes object ought to display bytes mapping to ASCII characters as ASCII characters. I have not yet found a PEP describing why this decision was made. I am now seeking to put forth a PEP to change printable representation of bytes to be simple, consistent, and easy to understand. The current behavior printing of elements of bytes with a mapping to printable ASCII characters as those characters seems to violate multiple tenants of the Zen of Python [2] * "Explicit is better than implicit." This display happens without the user's explicit request. * "Simple is better than complex." The printable representation of bytes is complex, surprising, and unintuitive: Elements of bytes shall be displayed as their hexadecimal value, unless such a value maps to a printable ASCII character, in which case, the character shall be displayed instead of the hexadecimal value. The underlying values of each element, however, are always integers. The printable representation of an element of a byte will always be an integer representation. The simple thing is to show the hex value for every byte, unconditionally. * "Special cases aren't special enough to break the rules." Implicit decoding of bytes to ASCII characters comes in handy only some of the time. * "In the face of ambiguity, refuse the temptation to guess." Python is guessing that I want to see some bytes as ASCII characters. In the example above, though, what I gave Python was bytes from four floating point numbers. * "There should be one-- and preferably only one --obvious way to do it." `bytes.decode('ascii', errors='backslashreplace')` already provides users the means to display ASCII characters among bytes, as a real string. To be fair, there are two tenants of the Zen of Python that support the implicit display of ASCII characters in bytes: * "Readability counts." * "Although practicality beats purity." In counterargument, though, I would say that the extra readability and practicality are only served boosted in special cases (which are not special enough). Much ado was (and continues to be) raised over Python 3 enforcing distinction between (Unicode) strings and bytes. A lot of this resentment comes from Python programmers who do not yet appreciate the difference between bytes and text†, or from those who remain apathetic and prefer Python 2's it-works-'til-it-doesn't strings. This implicit displaying of ASCII characters in bytes ends up conflating the two data types even deeper in novice programmers' minds. In the example above, `my_packed_bytes` looks like a string. It reads like a string. But it is not a string. The ASCII characters are a lie, as evidenced when trying to access elements of a bytes instance: >>> b'Why, Guido? Why?'[0] 87 >>> # Oh, perhaps you were expecting b'W'? I find this behavior harmful to Python 3 advocacy, and novices and those accustomed to Python 2 find this yet another deterrent in the way of Python 3 adoption. I would like to gauge the feasibility of a PEP to change the printable representation of bytes in CPython 3 to display all elements by their hexadecimal values, and only by their hexadecimal values. Thanks, Chris L. † I write this as someone who, himself, didn't appreciate nor understand the difference between bytes, strings, and Unicode. I have Ned Batchelder [3] to thank and his illuminating "Pragmatic Unicode" presentation [4] for getting me on the right path. [1]: http://legacy.python.org/dev/peps/pep-3112/#rationale [2]: http://legacy.python.org/dev/peps/pep-0020/ [3]: http://nedbatchelder.com/ [4]: http://nedbatchelder.com/text/unipain.html [View Less]

20 67

Make `float('inf') //1 == float('inf')`
by Ram Rachum Sept. 17, 2014

Sept. 17, 2014

Please see this discussion on python-list: https://groups.google.com/forum/#!topic/comp.lang.python/maDZoc-n4bA Currently `float('inf') //1` is equal to NaN. I think that this is really weird. If I understand correctly it's to maintain the invariant `div*y + mod == x`. The question is, do we really care more about maintaining this invariant rather than providing a mathematically reasonable value for floor division? Thanks, Ram.

1 0

Re: [Python-ideas] float comparison in doctes
by Kevin Davies Sept. 17, 2014

Sept. 17, 2014

It seems that this didn't reach the list directly (see https://mail.python.org/pipermail/python-ideas/2014-August/028956.html), so I'm resending: Erik Bray (the author of the +FLOAT_CMP extension in Astropy), Bruce Leban, and I had a short off-thread email discussion. Here are the points: - [Bruce]: ALMOST_EQUAL is the best flag name. - [Erik]: If there's agreement on this, Erik will develop a patch as soon as he can. - [Erik]: There's no way to adjust the tolerance because there seems to be … [View More]

5 4

Fwd: Yielding from the command line
by Martin Teichmann Sept. 17, 2014

Sept. 17, 2014

Hi Andrew, Hi List, > [ some discussion about calling yield from from the command line skipped ] > > I would love to see this. I'm not sure if I'd love it in practice or not, but until > someone implements it and I can play with it I'm not sure how I'd become sure. > > So... You just volunteered, right? Go build it and put it on PyPI, I want it and > I'll be your best friend forever and ever no takebacks if you do it. :) Well, so I did, I wrote an IPython extension that … [View More]

1 0

Special-case 3.x 'print x' SyntaxError
by Terry Reedy Sept. 15, 2014

Sept. 15, 2014

One of the problems with new Python programmers using 3.x is that they first read 'print x' in 2.x based material, try 'print x' in 3.x, get "SyntaxError: invalid syntax" (note the uninformative redundant message), and go "huh?" or worse. Would it be possible to add detect this particular error and print a more useful message? I am thinking of something of something like SyntaxError: calling the 'print' function requires ()s, as in "print(x)" or maybe SyntaxError: did you mean "print(...)"… [View More]

5 8

Bring namedtuple's __str__ and __repr__ behavior to regular classes
by John Wong Sept. 15, 2014

Sept. 15, 2014

Hi, >>> from collections import namedtuple >>> A = namedtuple("A", ["foo"]) >>> print(A(foo=1)) A(foo=1) >>> str(A(foo=1)) 'A(foo=1)' >>> repr(A(foo=1)) 'A(foo=1)' The relevant code is https://hg.python.org/cpython/file/2.7/Lib/collections.py#l356 I propose we bring the behavior to regular classes. Instead of >>> class A(object): ... def __init__(self): ... self.foo = 1 ... >>> repr(A()) '<__main__.A object … [View More]

4 4

Re: [Python-ideas] Yielding from the command line
by Martin Teichmann Sept. 14, 2014

Sept. 14, 2014

Hi everyone, since there seemed to be some interest in my idea of a asyncio-enabled command line, I just sat down and wrote it. I submitted the parts that would need to go into CPython as Issue 22412 to the Python bug tracker. I added a simple command line interpreter, based on code.InteractiveConsole, which will allow for uses like >>> from asyncio import sleep >>> yield from sleep(10) The following code is mostly a copy of InteractiveConsole, with the appropriate yield … [View More]froms stuck in (and comments removed. Yeah!) Greeting Martin Code follows: from asyncio import get_event_loop, coroutine, input from code import InteractiveConsole import sys class AsyncConsole(InteractiveConsole): def __init__(self, locals=None, filename="<console>"): super().__init__(locals, filename) self.compile.compiler.flags |= 0x1000 @coroutine def runsource(self, source, filename="<input>", symbol="single"): try: code = self.compile(source, filename, symbol) except (OverflowError, SyntaxError, ValueError): self.showsyntaxerror(filename) return False if code is None: return True yield from self.runcode(code) return False @coroutine def runcode(self, code): try: yield from eval(code, self.locals) except SystemExit: raise except: self.showtraceback() @coroutine def push(self, line): self.buffer.append(line) source = "\n".join(self.buffer) more = yield from self.runsource(source, self.filename) if not more: self.resetbuffer() return more @coroutine def interact(self, banner=None): try: sys.ps1 except AttributeError: sys.ps1 = ">>> " try: sys.ps2 except AttributeError: sys.ps2 = "... " cprt = 'Type "help", "copyright", "credits" or "license" for more information.' if banner is None: self.write("Python %s on %s\n%s\n(%s)\n" % (sys.version, sys.platform, cprt, self.__class__.__name__)) elif banner: self.write("%s\n" % str(banner)) more = 0 while 1: try: if more: prompt = sys.ps2 else: prompt = sys.ps1 try: line = yield from input(prompt) except EOFError: self.write("\n") break else: more = yield from self.push(line) except KeyboardInterrupt: self.write("\nKeyboardInterrupt\n") self.resetbuffer() more = 0 except SystemExit: return if __name__ == "__main__": console = AsyncConsole() get_event_loop().run_until_complete(console.interact()) [View Less]

1 0

Re: [Python-ideas] Yielding from the command line
by Martin Teichmann Sept. 13, 2014

Sept. 13, 2014

Hi Terry, Hi List, > I presume full behavior requires the call to root.mainloop(). This has two > problems for continued interaction. First, the call blocks until the window > is closed, making further entry impossible through normal means. If that > were solved with a 'noblock' option, there would still be the problem of > getting shell input to a callback that could, on demand, execute to code to > modify the tk app. The solution would have to be different for the … [View More]

8 9

Yielding from the command line
by Martin Teichmann Sept. 11, 2014

Sept. 11, 2014

Hi List, I'm currently trying to convince my company that asyncio is a great thing. After a lot of critique, the newest thing is, people complain: I cannot test my code on the command line! And indeed they are right, a simple a = yield from some_coroutine() is not possible on the command line, and doesn't make sense. Wait a minute, really? Well, it could make sense, in an asyncio-based command line. I am thinking about a python interpreter whose internal loop is something like @… [View More]

4 4