From starsareblueandfaraway at gmail.com Thu May 5 16:37:04 2011 From: starsareblueandfaraway at gmail.com (Roy Hyunjin Han) Date: Thu, 5 May 2011 10:37:04 -0400 Subject: [Python-ideas] [Python-Dev] What if replacing items in a dictionary returns the new dictionary? In-Reply-To: References: <20110429143406.GA441@iskra.aviel.ru>

Message-ID: >> 2011/4/29 Roy Hyunjin Han : >> It would be convenient if replacing items in a dictionary returns the >> new dictionary, in a manner analogous to str.replace(). What do you >> think? >> >> # Current behavior >> x = {'key1': 1} >> x.update(key1=3) == None >> x == {'key1': 3} # Original variable has changed >> >> # Possible behavior >> x = {'key1': 1} >> x.replace(key1=3) == {'key1': 3} >> x == {'key1': 1} # Original variable is unchanged >> > 2011/5/5 Giuseppe Ottaviano : > In general nothing stops you to use a proxy object that returns itself > after each method call, something like > > class using(object): > def __init__(self, obj): > self._wrappee = obj > > def unwrap(self): > return self._wrappee > > def __getattr__(self, attr): > def wrapper(*args, **kwargs): > getattr(self._wrappee, attr)(*args, **kwargs) > return self > return wrapper > > > d = dict() > print using(d).update(dict(a=1)).update(dict(b=2)).unwrap() > # prints {'a': 1, 'b': 2} > l = list() > print using(l).append(1).append(2).unwrap() > # prints [1, 2] Cool! I never thought of that. That's a great snippet. I'll forward this to the python-ideas list. I don't think the python-dev people want this discussion to continue on their mailing list. From starsareblueandfaraway at gmail.com Thu May 5 16:42:57 2011 From: starsareblueandfaraway at gmail.com (Roy Hyunjin Han) Date: Thu, 5 May 2011 10:42:57 -0400 Subject: [Python-ideas] [Python-Dev] What if replacing items in a dictionary returns the new dictionary? In-Reply-To: References: <20110429143406.GA441@iskra.aviel.ru>

Message-ID: >> ? ?# Possible behavior >> ? ?x = {'key1': 1} >> ? ?x.replace(key1=3) == {'key1': 3} >> ? ?x == {'key1': 1} # Original variable is unchanged >> > 2011/5/5 Giuseppe Ottaviano : > class using(object): > ? ?def __init__(self, obj): > ? ? ? ?self._wrappee = obj > > ? ?def unwrap(self): > ? ? ? ?return self._wrappee > > ? ?def __getattr__(self, attr): > ? ? ? ?def wrapper(*args, **kwargs): > ? ? ? ? ? ?getattr(self._wrappee, attr)(*args, **kwargs) > ? ? ? ? ? ?return self > ? ? ? ?return wrapper The only thing I would add is obj.copy(), to ensure that the original dictionary is unchanged. class using(object): def __init__(self, obj): self._wrappee = obj.copy() From starsareblueandfaraway at gmail.com Thu May 5 17:19:16 2011 From: starsareblueandfaraway at gmail.com (Roy Hyunjin Han) Date: Thu, 5 May 2011 11:19:16 -0400 Subject: [Python-ideas] [Python-Dev] What if replacing items in a dictionary returns the new dictionary? In-Reply-To: References: <20110429143406.GA441@iskra.aviel.ru>

Message-ID: 2011/5/5 Giuseppe Ottaviano : >> The only thing I would add is obj.copy(), to ensure that the original >> dictionary is unchanged. >> >> class using(object): >> ? ?def __init__(self, obj): >> ? ? ? ?self._wrappee = obj.copy() > > My example was just a proof of concept, there are many other things > that may need to be taken care of (for example, non-callable > attributes). > BTW, the copy should be done outside. If the object is copied, I'd say > "using" is a poor choice of name for the proxy. You're right, I would need to do more work to get it to mimic the underlying object. I think I will stick with Oleg's suggestion to subclass dict for now; it's great for unit tests. Thanks for the idea, though. class ReplaceableDict(dict): def replace(self, **kwargs): 'Works for replacing string-based keys' return dict(self.items() + kwargs.items()) From moloney at ohsu.edu Thu May 5 23:41:06 2011 From: moloney at ohsu.edu (Brendan Moloney) Date: Thu, 5 May 2011 14:41:06 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> Message-ID: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> Hello, I posted this on python-dev, but was told that this is the more appropriate list. Currently if I do: $ import pkg Then all of the public subpackages/submodules are not automatically pulled into the 'pkg' namespace. I can do: $ from pkg import * To get all of the public subpackages/submodules, but that dumps them all into the current namespace. Why not allow: $ import pkg.* This would allow easier interactive use (by eliminating the need to import individual subpackages/submodules) while keeping the 'pkg' namespace around. Thanks, Brendan Moloney From benjamin at python.org Fri May 6 00:00:35 2011 From: benjamin at python.org (Benjamin Peterson) Date: Thu, 5 May 2011 22:00:35 +0000 (UTC) Subject: [Python-ideas] Allow 'import star' with namespaces References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> Message-ID: Brendan Moloney writes: > This would allow easier interactive use (by eliminating the need to import individual > subpackages/submodules) while keeping the 'pkg' namespace around. import * is generally frowned upon, so encouraging its use by extending it is not a good idea. From moloney at ohsu.edu Fri May 6 00:24:16 2011 From: moloney at ohsu.edu (Brendan Moloney) Date: Thu, 5 May 2011 15:24:16 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu>, Message-ID: <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> Benjamin Peterson [benjamin at python.org] wrote: > import * is generally frowned upon, so encouraging its use by extending it is > not a good idea. Well it is frowned upon precisely because it pollutes the current namespace. This change would eliminate that issue. From dag.odenhall at gmail.com Fri May 6 09:20:26 2011 From: dag.odenhall at gmail.com (dag.odenhall at gmail.com) Date: Fri, 6 May 2011 09:20:26 +0200 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> Message-ID: On 6 May 2011 00:24, Brendan Moloney wrote: > Benjamin Peterson [benjamin at python.org] wrote: >> import * is generally frowned upon, so encouraging its use by extending it is >> not a good idea. > > Well it is frowned upon precisely because it pollutes the current namespace. This change would eliminate that issue. I like this idea, except it's inconsistent with from-import-star, the latter which does *not* get you sub-packages or modules. From g.brandl at gmx.net Fri May 6 09:44:02 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Fri, 06 May 2011 09:44:02 +0200 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> Message-ID: On 06.05.2011 09:20, dag.odenhall at gmail.com wrote: > On 6 May 2011 00:24, Brendan Moloney wrote: >> Benjamin Peterson [benjamin at python.org] wrote: >>> import * is generally frowned upon, so encouraging its use by extending it is >>> not a good idea. >> >> Well it is frowned upon precisely because it pollutes the current namespace. This change would eliminate that issue. > > I like this idea, except it's inconsistent with from-import-star, the > latter which does *not* get you sub-packages or modules. And that's for a reason: it's not easy (I think it's even impossible, because for example individual submodules can change __path__) to determine all importable submodules of a package. So ``import pkg.*`` would not have any behavior other than ``import pkg``. Georg From matt at whoosh.ca Fri May 6 19:51:24 2011 From: matt at whoosh.ca (Matt Chaput) Date: Fri, 06 May 2011 13:51:24 -0400 Subject: [Python-ideas] 1_000_000 Message-ID: <4DC4351C.2000109@whoosh.ca> Not sure if this has been proposed before: A syntax change to allow underscores as thousands separators in literal numbers to improve readability, e.g.: for i in range(1, 1_000_000): pass I believe D allows this and while it's a small thing it really is much more readable. Worth a PEP? Thanks, Matt From janssen at parc.com Fri May 6 21:11:59 2011 From: janssen at parc.com (Bill Janssen) Date: Fri, 6 May 2011 12:11:59 PDT Subject: [Python-ideas] thoughts on regular expression improvements Message-ID: <98999.1304709119@parc.com> I've been doing a lot of RE hacking lately, and some possible improvements suggest themselves. 1. Multiple occurrences of a named group Right now, you can compose RE's with x = re.compile("...") y = re.compile("..." + x.pattern + "...") But if x contains named groups, you run into trouble if you have something like z = re.compile("..." + x.pattern + "..." + x.pattern + "...") which can easily happen if x could occur at various places in z. The issue is that a named group is only allowed once, which isn't a bad error-prevention mechanism, but it would be nice if it could occur more than once (in alternative subexpressions), perhaps enabled by a another RE flag. 2. Easier composition. Writing y = re.compile("..." + x.pattern + "...") seems a tad groty, to use a term from my childhood, and affords the RE engine no purchase on the composition, which can be an issue if the flags for x are different from the flags for y. If the first argument to re.compile could be a tuple or list, you could write y = re.compile(["...", x, "..."]) and the engine could see that "..." is a string, and that x is a RE, and could inspect x as necessary. 3. Edit distances. The RE engine TRE (http://laurikari.net/tre/about/) supports fuzzy matching of strings, using edit distances. One can write an expression like "(total){~2}" which would any string that's "total" with no more than two edit errors. You can also specify insertions, deletions, and substitution limits separately with "+", "-", and "#". That would be nice to have... Bill From moloney at ohsu.edu Fri May 6 21:49:08 2011 From: moloney at ohsu.edu (Brendan Moloney) Date: Fri, 6 May 2011 12:49:08 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> , Message-ID: <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> dag.odenhall at gmail.com wrote: > I like this idea, except it's inconsistent with from-import-star, the > latter which does *not* get you sub-packages or modules. Georg Brandl [g.brandl at gmx.net] wrote: > And that's for a reason: it's not easy (I think it's even impossible, because > for example individual submodules can change __path__) to determine all > importable submodules of a package. > So ``import pkg.*`` would not have any behavior other than ``import pkg``. When I said all _public_ sub-packages and modules I was referring to those listed in the __all__ attribute of 'pkg'. Thus it would behave in the exact same way as from-import-star except you don't pollute the current namespace. Brendan From dirkjan at ochtman.nl Fri May 6 21:58:36 2011 From: dirkjan at ochtman.nl (Dirkjan Ochtman) Date: Fri, 6 May 2011 21:58:36 +0200 Subject: [Python-ideas] thoughts on regular expression improvements In-Reply-To: <98999.1304709119@parc.com> References: <98999.1304709119@parc.com> Message-ID: On Fri, May 6, 2011 at 21:11, Bill Janssen wrote: > I've been doing a lot of RE hacking lately, and some possible > improvements suggest themselves. Have you looked at the regex module? Cheers, Dirkjan From ethan at stoneleaf.us Fri May 6 22:12:00 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Fri, 06 May 2011 13:12:00 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> , <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> Message-ID: <4DC45610.3040803@stoneleaf.us> Brendan Moloney wrote: > dag.odenhall at gmail.com wrote: >> I like this idea, except it's inconsistent with from-import-star, the >> latter which does *not* get you sub-packages or modules. > > Georg Brandl [g.brandl at gmx.net] wrote: >> And that's for a reason: it's not easy (I think it's even impossible, because >> for example individual submodules can change __path__) to determine all >> importable submodules of a package. > >> So ``import pkg.*`` would not have any behavior other than ``import pkg``. > > When I said all _public_ sub-packages and modules I was referring to those > listed in the __all__ attribute of 'pkg'. Thus it would behave in the exact > same way as from-import-star except you don't pollute the current namespace. I'm not catching the vision -- could you put together a short example that would illustrate? ~Ethan~ From janssen at parc.com Fri May 6 22:28:12 2011 From: janssen at parc.com (Bill Janssen) Date: Fri, 6 May 2011 13:28:12 PDT Subject: [Python-ideas] thoughts on regular expression improvements In-Reply-To: References: <98999.1304709119@parc.com> Message-ID: <641.1304713692@parc.com> Dirkjan Ochtman wrote: > On Fri, May 6, 2011 at 21:11, Bill Janssen wrote: > > I've been doing a lot of RE hacking lately, and some possible > > improvements suggest themselves. > > Have you looked at the regex module? >From Python 1.4? Not in a long time... Bill From janssen at parc.com Fri May 6 22:32:18 2011 From: janssen at parc.com (Bill Janssen) Date: Fri, 6 May 2011 13:32:18 PDT Subject: [Python-ideas] thoughts on regular expression improvements In-Reply-To: References: <98999.1304709119@parc.com> Message-ID: <818.1304713938@parc.com> Dirkjan Ochtman wrote: > On Fri, May 6, 2011 at 21:11, Bill Janssen wrote: > > I've been doing a lot of RE hacking lately, and some possible > > improvements suggest themselves. > > Have you looked at the regex module? Ah, you mean the PyPI "regex". Looks like it has "branch reset", which might support my #1? Using the same group name multiple times? I don't see fuzzy matches, or support for composition, though. Bill From jsbueno at python.org.br Fri May 6 22:42:53 2011 From: jsbueno at python.org.br (Joao S. O. Bueno) Date: Fri, 6 May 2011 17:42:53 -0300 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <4DC45610.3040803@stoneleaf.us> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us> Message-ID: On Fri, May 6, 2011 at 5:12 PM, Ethan Furman wrote: > Brendan Moloney wrote: >> >> dag.odenhall at gmail.com wrote: >>> >>> I like this idea, except it's inconsistent with from-import-star, the >>> latter which does *not* get you sub-packages or modules. >> >> Georg Brandl [g.brandl at gmx.net] wrote: >>> >>> And that's for a reason: it's not easy (I think it's even impossible, >>> because >>> for example individual submodules can change __path__) to determine all >>> importable submodules of a package. >> >>> So ``import pkg.*`` would not have any behavior other than ``import >>> pkg``. >> >> When I said all _public_ sub-packages and modules I was referring to those > >> listed in the ?__all__ attribute of 'pkg'. ?Thus it would behave in the >> exact >> same way as from-import-star except you don't pollute the current >> namespace. > > > I'm not catching the vision -- could you put together a short example that > would illustrate? The idea is to be able to do operate witha single import when submodules would have to be implicited imported - like xml.etree.ElementTree : [gwidion at powerpuff ~]$ python Python 2.6.1 (r261:67515, Apr 12 2009, 04:14:16) [GCC 4.3.2] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> import xml >>> xml.etree Traceback (most recent call last): File "", line 1, in AttributeError: 'module' object has no attribute 'etree' >>> import xml.etree >>> xml.etree.ElementTree Traceback (most recent call last): File "", line 1, in AttributeError: 'module' object has no attribute 'ElementTree' >>> import xml.etree.ElementTree >>> xml.etree.ElementTree > > ~Ethan~ > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > From moloney at ohsu.edu Fri May 6 22:50:14 2011 From: moloney at ohsu.edu (Brendan Moloney) Date: Fri, 6 May 2011 13:50:14 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <4DC45610.3040803@stoneleaf.us> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> , <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu>, <4DC45610.3040803@stoneleaf.us> Message-ID: <5E25C96030E66B44B9CFAA95D3DE5919351310A7B5@EX-MB08.ohsu.edu> Ethan Furman [ethan at stoneleaf.us] wrote: > I'm not catching the vision -- could you put together a short example > that would illustrate? The motivation is really just for interactive usage (much like the current from-import-star). If 'pkg' contains a number of sub-packages/modules that take a while to import, it makes sense to not automatically import them into the 'pkg' namespace (in the pkg.__init__ module). Putting the sub-package/module names into the __all__ list gives interactive users the ability to import everything in one go using from-import-star. Unfortunately the from-import-star usage pollutes the current namespace, and thus its use is discouraged. So really the vision is that developers can make their packages convenient for interactive use (by setting the __all__ attribute) without requiring users to use a discouraged language feature or making regular import of the package slow. Brendan From ericsnowcurrently at gmail.com Fri May 6 22:52:09 2011 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Fri, 6 May 2011 14:52:09 -0600 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <4DC45610.3040803@stoneleaf.us> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us> Message-ID: On Fri, May 6, 2011 at 2:12 PM, Ethan Furman wrote: > Brendan Moloney wrote: > >> dag.odenhall at gmail.com wrote: >> >>> I like this idea, except it's inconsistent with from-import-star, the >>> latter which does *not* get you sub-packages or modules. >>> >> >> Georg Brandl [g.brandl at gmx.net] wrote: >> >>> And that's for a reason: it's not easy (I think it's even impossible, >>> because >>> for example individual submodules can change __path__) to determine all >>> importable submodules of a package. >>> >> >> So ``import pkg.*`` would not have any behavior other than ``import >>> pkg``. >>> >> >> When I said all _public_ sub-packages and modules I was referring to those >> > > listed in the __all__ attribute of 'pkg'. Thus it would behave in the > exact > > same way as from-import-star except you don't pollute the current > namespace. > > > I'm not catching the vision -- could you put together a short example that > would illustrate? > > He's saying that the package would be imported like normal. Then all "public" sub-modules of the package would automatically imported and bound to the namespace of the object that resulted from the import of the package. The trickery is that __all__ in the __init__.py would change meaning somewhat, and, do you bind the submodules into the package's module object or something else? If you have a list of the submodules you want imported then you can already accomplish this: import parent for mod in parent.__all_submodules__: __import__("parent.{}".format(mod)) Of course, this does not bind the submodules to the namespace of the package module, but I suppose you could try that with one more step. I am not sure of the specific import mechanism with regards to name binding, but that would seem to be a conflict with the way imported names for submodules are bound. -eric ~Ethan~ > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From dag.odenhall at gmail.com Fri May 6 22:59:05 2011 From: dag.odenhall at gmail.com (dag.odenhall at gmail.com) Date: Fri, 6 May 2011 22:59:05 +0200 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <4DC45610.3040803@stoneleaf.us> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us> Message-ID: On 6 May 2011 22:12, Ethan Furman wrote: > Brendan Moloney wrote: >> >> dag.odenhall at gmail.com wrote: >>> >>> I like this idea, except it's inconsistent with from-import-star, the >>> latter which does *not* get you sub-packages or modules. >> >> Georg Brandl [g.brandl at gmx.net] wrote: >>> >>> And that's for a reason: it's not easy (I think it's even impossible, >>> because >>> for example individual submodules can change __path__) to determine all >>> importable submodules of a package. >> >>> So ``import pkg.*`` would not have any behavior other than ``import >>> pkg``. >> >> When I said all _public_ sub-packages and modules I was referring to those > >> listed in the ?__all__ attribute of 'pkg'. ?Thus it would behave in the >> exact >> same way as from-import-star except you don't pollute the current >> namespace. If you're going to require listing in __all__ anyway, you might as well use what already works: import the modules in the package, and you can then import the package and access the modules as attributes: pkg/__init__.py: from . import mod script.py: import pkg pkg.mod #=> pkg/mod.py From dag.odenhall at gmail.com Fri May 6 23:06:18 2011 From: dag.odenhall at gmail.com (dag.odenhall at gmail.com) Date: Fri, 6 May 2011 23:06:18 +0200 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4351C.2000109@whoosh.ca> References: <4DC4351C.2000109@whoosh.ca> Message-ID: On 6 May 2011 19:51, Matt Chaput wrote: > Not sure if this has been proposed before: A syntax change to allow > underscores as thousands separators in literal numbers to improve > readability, e.g.: > > ?for i in range(1, 1_000_000): > ? ?pass > > I believe D allows this and while it's a small thing it really is much more > readable. Ruby too. You could also use e-notation[1]: 1e6, in your example. In many situations it's even more readable because you don't need to "count the zeros". This is already supported in Python. [1] http://en.wikipedia.org/wiki/Scientific_notation#E_notation From nadeem.vawda at gmail.com Fri May 6 23:23:05 2011 From: nadeem.vawda at gmail.com (Nadeem Vawda) Date: Fri, 6 May 2011 23:23:05 +0200 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> Message-ID: On Fri, May 6, 2011 at 11:06 PM, dag.odenhall at gmail.com wrote: > You could also use e-notation[1]: 1e6, in your example. 1e6 is a float, though. If you use it in that example, range() complains that its arguments must be integers. From solipsis at pitrou.net Fri May 6 23:24:07 2011 From: solipsis at pitrou.net (Antoine Pitrou) Date: Fri, 6 May 2011 23:24:07 +0200 Subject: [Python-ideas] 1_000_000 References: <4DC4351C.2000109@whoosh.ca> Message-ID: <20110506232407.2bd211a1@pitrou.net> On Fri, 6 May 2011 23:06:18 +0200 "dag.odenhall at gmail.com" wrote: > On 6 May 2011 19:51, Matt Chaput wrote: > > Not sure if this has been proposed before: A syntax change to allow > > underscores as thousands separators in literal numbers to improve > > readability, e.g.: > > > > ?for i in range(1, 1_000_000): > > ? ?pass > > > > I believe D allows this and while it's a small thing it really is much more > > readable. > > Ruby too. > > You could also use e-notation[1]: 1e6, in your example. In many > situations it's even more readable because you don't need to "count > the zeros". This is already supported in Python. Yes, but it gives a float, not an integer: >>> for i in range(0, 1e6): pass ... Traceback (most recent call last): File "", line 1, in TypeError: 'float' object cannot be interpreted as an integer Regards Antoine. From kirubakaran at gmail.com Fri May 6 23:25:56 2011 From: kirubakaran at gmail.com (Kirubakaran) Date: Fri, 6 May 2011 14:25:56 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: <20110506232407.2bd211a1@pitrou.net> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net> Message-ID: How about range(10**60) ? - Kirubakaran. On Fri, May 6, 2011 at 2:24 PM, Antoine Pitrou wrote: > On Fri, 6 May 2011 23:06:18 +0200 > "dag.odenhall at gmail.com" > wrote: > > On 6 May 2011 19:51, Matt Chaput < > matt-KKMwxO2wslj3fQ9qLvQP4Q at public.gmane.org> wrote: > > > Not sure if this has been proposed before: A syntax change to allow > > > underscores as thousands separators in literal numbers to improve > > > readability, e.g.: > > > > > > for i in range(1, 1_000_000): > > > pass > > > > > > I believe D allows this and while it's a small thing it really is much > more > > > readable. > > > > Ruby too. > > > > You could also use e-notation[1]: 1e6, in your example. In many > > situations it's even more readable because you don't need to "count > > the zeros". This is already supported in Python. > > Yes, but it gives a float, not an integer: > > >>> for i in range(0, 1e6): pass > ... > Traceback (most recent call last): > File "", line 1, in > TypeError: 'float' object cannot be interpreted as an integer > > > Regards > > Antoine. > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From kirubakaran at gmail.com Fri May 6 23:26:14 2011 From: kirubakaran at gmail.com (Kirubakaran) Date: Fri, 6 May 2011 14:26:14 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net> Message-ID: (fixed typo) How about range(10**6) ? - Kirubakaran. On Fri, May 6, 2011 at 2:25 PM, Kirubakaran wrote: > How about range(10**60) ? > > - Kirubakaran. > > > On Fri, May 6, 2011 at 2:24 PM, Antoine Pitrou wrote: > >> On Fri, 6 May 2011 23:06:18 +0200 >> "dag.odenhall at gmail.com" >> wrote: >> > On 6 May 2011 19:51, Matt Chaput < >> matt-KKMwxO2wslj3fQ9qLvQP4Q at public.gmane.org> wrote: >> > > Not sure if this has been proposed before: A syntax change to allow >> > > underscores as thousands separators in literal numbers to improve >> > > readability, e.g.: >> > > >> > > for i in range(1, 1_000_000): >> > > pass >> > > >> > > I believe D allows this and while it's a small thing it really is much >> more >> > > readable. >> > >> > Ruby too. >> > >> > You could also use e-notation[1]: 1e6, in your example. In many >> > situations it's even more readable because you don't need to "count >> > the zeros". This is already supported in Python. >> >> Yes, but it gives a float, not an integer: >> >> >>> for i in range(0, 1e6): pass >> ... >> Traceback (most recent call last): >> File "", line 1, in >> TypeError: 'float' object cannot be interpreted as an integer >> >> >> Regards >> >> Antoine. >> >> >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matt at whoosh.ca Fri May 6 23:36:47 2011 From: matt at whoosh.ca (Matt Chaput) Date: Fri, 06 May 2011 17:36:47 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

Message-ID: <4DC469EF.5000408@whoosh.ca> On 06/05/2011 5:26 PM, Kirubakaran wrote: > (fixed typo) > How about range(10**6) ? Both 1e6 (if it worked in the example) and 10**6 both require a bit of work (at least for my non-mathematician brain) to decode as "1 million", whereas with 1_000_000 you're not so much counting the zeros in your head as counting the *groups* of zeros visually. For me it's much more readable at a glance. Also, obviously the 10**6 trick doesn't work so well if the example is: for i in range(47_284_345): pass Matt From kirubakaran at gmail.com Fri May 6 23:37:10 2011 From: kirubakaran at gmail.com (Kirubakaran) Date: Fri, 6 May 2011 14:37:10 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

Message-ID: Ah, thanks. Sorry, I don't know how I failed to see that. On Fri, May 6, 2011 at 2:30 PM, Andre Roberge wrote: > I believe that the original suggestion was meant to be more general than > the specific suggestions for powers of 10. For example, consider the > following hypothetical: > > for i in range(1, 1_111_111_111, 1024): > pass > > where the _ really helps in figuring out the size. > > Andr? > > > On Fri, May 6, 2011 at 6:26 PM, Kirubakaran wrote: > >> (fixed typo) >> How about range(10**6) ? >> >> - Kirubakaran. >> >> >> On Fri, May 6, 2011 at 2:25 PM, Kirubakaran wrote: >> >>> How about range(10**60) ? >>> >>> - Kirubakaran. >>> >>> >>> On Fri, May 6, 2011 at 2:24 PM, Antoine Pitrou wrote: >>> >>>> On Fri, 6 May 2011 23:06:18 +0200 >>>> "dag.odenhall at gmail.com" >>>> wrote: >>>> > On 6 May 2011 19:51, Matt Chaput < >>>> matt-KKMwxO2wslj3fQ9qLvQP4Q at public.gmane.org> wrote: >>>> > > Not sure if this has been proposed before: A syntax change to allow >>>> > > underscores as thousands separators in literal numbers to improve >>>> > > readability, e.g.: >>>> > > >>>> > > for i in range(1, 1_000_000): >>>> > > pass >>>> > > >>>> > > I believe D allows this and while it's a small thing it really is >>>> much more >>>> > > readable. >>>> > >>>> > Ruby too. >>>> > >>>> > You could also use e-notation[1]: 1e6, in your example. In many >>>> > situations it's even more readable because you don't need to "count >>>> > the zeros". This is already supported in Python. >>>> >>>> Yes, but it gives a float, not an integer: >>>> >>>> >>> for i in range(0, 1e6): pass >>>> ... >>>> Traceback (most recent call last): >>>> File "", line 1, in >>>> TypeError: 'float' object cannot be interpreted as an integer >>>> >>>> >>>> Regards >>>> >>>> Antoine. >>>> >>>> >>>> _______________________________________________ >>>> Python-ideas mailing list >>>> Python-ideas at python.org >>>> http://mail.python.org/mailman/listinfo/python-ideas >>>> >>> >>> >> >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas >> >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From bruce at leapyear.org Fri May 6 23:38:19 2011 From: bruce at leapyear.org (Bruce Leban) Date: Fri, 6 May 2011 14:38:19 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

Message-ID: None of these answers address the original suggestion. Matt didn't say that he only wanted this for numbers of the form 10^N; he just gave that as an example. Consider these examples instead: - 1_234_000 - 9.876_543_210 - 0xFEFF_0042 I'm not advocating this change (nor against it); I just think the discussion should be focused on the actual idea. I do have a question: Is _ just ignored in numbers or are there more complex rules? - 1_2345_6789 (can I use groups of other sizes instead?) - 1_2_3_4_5 (ditto) - 1_234_6789 (do all the groups need to be the same size?) - 1_ (must the _ only be in between 2 digits?) - 1__234 (what about multiple _s?) - 9.876_543_210 (can it be used to the right of the decimal point?) - 0xFEFF_0042 (can it be used in hex, octal or binary numbers?) - int('123_456') (do other functions accept this syntax too?) --- Bruce Puzzazz newsletter: http://j.mp/puzzazz-news-2011-04 including April Fools! Blog post: http://www.vroospeak.com Ironically, a glaring Google grammatical error On Fri, May 6, 2011 at 2:26 PM, Kirubakaran wrote: > (fixed typo) > How about range(10**6) ? > > - Kirubakaran. > > > On Fri, May 6, 2011 at 2:25 PM, Kirubakaran wrote: > >> How about range(10**60) ? >> >> - Kirubakaran. >> >> >> On Fri, May 6, 2011 at 2:24 PM, Antoine Pitrou wrote: >> >>> On Fri, 6 May 2011 23:06:18 +0200 >>> "dag.odenhall at gmail.com" >>> wrote: >>> > On 6 May 2011 19:51, Matt Chaput < >>> matt-KKMwxO2wslj3fQ9qLvQP4Q at public.gmane.org> wrote: >>> > > Not sure if this has been proposed before: A syntax change to allow >>> > > underscores as thousands separators in literal numbers to improve >>> > > readability, e.g.: >>> > > >>> > > for i in range(1, 1_000_000): >>> > > pass >>> > > >>> > > I believe D allows this and while it's a small thing it really is >>> much more >>> > > readable. >>> > >>> > Ruby too. >>> > >>> > You could also use e-notation[1]: 1e6, in your example. In many >>> > situations it's even more readable because you don't need to "count >>> > the zeros". This is already supported in Python. >>> >>> Yes, but it gives a float, not an integer: >>> >>> >>> for i in range(0, 1e6): pass >>> ... >>> Traceback (most recent call last): >>> File "", line 1, in >>> TypeError: 'float' object cannot be interpreted as an integer >>> >>> >>> Regards >>> >>> Antoine. >>> >>> >>> _______________________________________________ >>> Python-ideas mailing list >>> Python-ideas at python.org >>> http://mail.python.org/mailman/listinfo/python-ideas >>> >> >> > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From p.f.moore at gmail.com Sat May 7 00:04:43 2011 From: p.f.moore at gmail.com (Paul Moore) Date: Fri, 6 May 2011 23:04:43 +0100 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us> Message-ID: On 6 May 2011 21:52, Eric Snow wrote: > He's saying that the package would be imported like normal. ?Then all > "public" sub-modules of the package would automatically imported and bound > to the namespace of the object that resulted from the import of the package. There is no means of determining what submodules of a package exist. Check PEP 302 for details - finders find modules ant they can do so any way they like - there's nothing in the protocol to enumerate subpackages, so you can't do it (if faced with a general PEP 302 finder). Paul. From ethan at stoneleaf.us Sat May 7 00:40:06 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Fri, 06 May 2011 15:40:06 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

Message-ID: <4DC478C6.3010801@stoneleaf.us> Bruce Leban wrote: > None of these answers address the original suggestion. Matt didn't say > that he only wanted this for numbers of the form 10^N; he just gave that > as an example. > > Consider these examples instead: > > * 1_234_000 > * 9.876_543_210 > * 0xFEFF_0042 > > I'm not advocating this change (nor against it); I just think the > discussion should be focused on the actual idea. I do have a question: > > Is _ just ignored in numbers or are there more complex rules? > > * 1_2345_6789 (can I use groups of other sizes instead?) > * 1_2_3_4_5 (ditto) > * 1_234_6789 (do all the groups need to be the same size?) > * 1_ (must the _ only be in between 2 digits?) > * 1__234 (what about multiple _s?) > * 9.876_543_210 (can it be used to the right of the decimal point?) > * 0xFEFF_0042 (can it be used in hex, octal or binary numbers?) > * int('123_456') (do other functions accept this syntax too?) I would say it's ignored. Have the rule be something like number_string.replace('_',''). The only wrinkle is that currently '_1' is usable name, and that should probably be disallowed if the above change took place. I'm +1 on the idea. ~Ethan~ From alexander.belopolsky at gmail.com Sat May 7 00:42:59 2011 From: alexander.belopolsky at gmail.com (Alexander Belopolsky) Date: Fri, 6 May 2011 18:42:59 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC478C6.3010801@stoneleaf.us> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> Message-ID: On Fri, May 6, 2011 at 6:40 PM, Ethan Furman wrote: .. > The only wrinkle is that currently '_1' is usable name, and that should > probably be disallowed if the above change took place. -1_000 if _1 becomes invalid as an identifier. +0 otherwise. From fdrake at acm.org Sat May 7 00:45:23 2011 From: fdrake at acm.org (Fred Drake) Date: Fri, 6 May 2011 18:45:23 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC478C6.3010801@stoneleaf.us> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> Message-ID: On Fri, May 6, 2011 at 6:40 PM, Ethan Furman wrote: > The only wrinkle is that currently '_1' is usable name, and that should > probably be disallowed if the above change took place. Why? I've never seen a leading thousands separator in practice. For example, ,123,456 isn't generally accepted usage, so why should _123_456 be considered acceptable? (I'm not taking a position on the proposal here; just commenting on the problem of breaking code by making _1 a number instead of an identifier.) -Fred -- Fred L. Drake, Jr.? ? "Give me the luxuries of life and I will willingly do without the necessities." ?? --Frank Lloyd Wright From ethan at stoneleaf.us Sat May 7 00:58:50 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Fri, 06 May 2011 15:58:50 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> Message-ID: <4DC47D2A.9090808@stoneleaf.us> Alexander Belopolsky wrote: > On Fri, May 6, 2011 at 6:40 PM, Ethan Furman wrote: > .. >> The only wrinkle is that currently '_1' is usable name, and that should >> probably be disallowed if the above change took place. > > -1_000 if _1 becomes invalid as an identifier. > > +0 otherwise. So you use _8127 style names for your objects* then? ~Ethan~ *Okay, avoiding the word 'variables' can make for some slightly odd sounding sentences! ;) From ethan at stoneleaf.us Sat May 7 01:02:08 2011 From: ethan at stoneleaf.us (Ethan Furman) Date: Fri, 06 May 2011 16:02:08 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> Message-ID: <4DC47DF0.1020001@stoneleaf.us> Fred Drake wrote: > On Fri, May 6, 2011 at 6:40 PM, Ethan Furman wrote: >> The only wrinkle is that currently '_1' is usable name, and that should >> probably be disallowed if the above change took place. > > Why? I've never seen a leading thousands separator in practice. For example, > > ,123,456 > > isn't generally accepted usage, so why should > > _123_456 > > be considered acceptable? > > (I'm not taking a position on the proposal here; just commenting on the problem > of breaking code by making _1 a number instead of an identifier.) I see it as a readability issue -- if you have 1_024 and _1025 (etc, etc), where one is a number and the other a name, confusion can easily result. ~Ethan~ From fdrake at acm.org Sat May 7 00:59:02 2011 From: fdrake at acm.org (Fred Drake) Date: Fri, 6 May 2011 18:59:02 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC47D2A.9090808@stoneleaf.us> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> <4DC47D2A.9090808@stoneleaf.us> Message-ID: On Fri, May 6, 2011 at 6:58 PM, Ethan Furman wrote: > So you use _8127 style names for your objects* then? Code generators often use such names, though. Since _1234 is currently a legal identifier, you'd be breaking backward compatibility. I understand the motivation for a thousands separator, at least (though I'll admit, I don't find it compelling; *all* big numbers in code are too magical). -Fred -- Fred L. Drake, Jr.? ? "Give me the luxuries of life and I will willingly do without the necessities." ?? --Frank Lloyd Wright From cs at zip.com.au Sat May 7 00:51:38 2011 From: cs at zip.com.au (Cameron Simpson) Date: Sat, 7 May 2011 08:51:38 +1000 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC478C6.3010801@stoneleaf.us> References: <4DC478C6.3010801@stoneleaf.us> Message-ID: <20110506225138.GA2323@cskk.homeip.net> On 06May2011 15:40, Ethan Furman wrote: | Bruce Leban wrote: | >Is _ just ignored in numbers or are there more complex rules? | > | > * 1_2345_6789 (can I use groups of other sizes instead?) | > * 1_2_3_4_5 (ditto) | > * 1_234_6789 (do all the groups need to be the same size?) | > * 1_ (must the _ only be in between 2 digits?) | > * 1__234 (what about multiple _s?) | > * 9.876_543_210 (can it be used to the right of the decimal point?) | > * 0xFEFF_0042 (can it be used in hex, octal or binary numbers?) | > * int('123_456') (do other functions accept this syntax too?) | | I would say it's ignored. Have the rule be something like | number_string.replace('_',''). | | The only wrinkle is that currently '_1' is usable name, and that | should probably be disallowed if the above change took place. | | I'm +1 on the idea. Personally I'm be for ignoring the _ also, save that I would forbid it at the start or end, so no _1 or 1_. And I would permit it in hex code etc. I'm +0.5, myself. Cheers, -- Cameron Simpson DoD#743 http://www.cskk.ezoshosting.com/cs/ A strong conviction that something must be done is the parent of many bad measures. - Daniel Webster From python at mrabarnett.plus.com Sat May 7 01:41:33 2011 From: python at mrabarnett.plus.com (MRAB) Date: Sat, 07 May 2011 00:41:33 +0100 Subject: [Python-ideas] 1_000_000 In-Reply-To: <20110506225138.GA2323@cskk.homeip.net> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> Message-ID: <4DC4872D.60004@mrabarnett.plus.com> On 06/05/2011 23:51, Cameron Simpson wrote: > On 06May2011 15:40, Ethan Furman wrote: > | Bruce Leban wrote: > |>Is _ just ignored in numbers or are there more complex rules? > |> > |> * 1_2345_6789 (can I use groups of other sizes instead?) > |> * 1_2_3_4_5 (ditto) > |> * 1_234_6789 (do all the groups need to be the same size?) > |> * 1_ (must the _ only be in between 2 digits?) > |> * 1__234 (what about multiple _s?) > |> * 9.876_543_210 (can it be used to the right of the decimal point?) > |> * 0xFEFF_0042 (can it be used in hex, octal or binary numbers?) > |> * int('123_456') (do other functions accept this syntax too?) > | > | I would say it's ignored. Have the rule be something like > | number_string.replace('_',''). > | > | The only wrinkle is that currently '_1' is usable name, and that > | should probably be disallowed if the above change took place. > | > | I'm +1 on the idea. > > Personally I'm be for ignoring the _ also, save that I would forbid it > at the start or end, so no _1 or 1_. > > And I would permit it in hex code etc. > > I'm +0.5, myself. > As far as I remember, Ada also permits it, but has the rule that it can occur only between digits. If we follow that, then: 1_2345_6789 => Yes 1_2_3_4_5 => Yes 1_234_6789 => Yes 1_ => No _1 => No 1__234 => No 9.876_543_210 => Yes 9._876_543_210 => No 9_.876_543_210 => No 0xFEFF_0042 => Yes int('123_456') => Yes From bruce at leapyear.org Sat May 7 01:44:21 2011 From: bruce at leapyear.org (Bruce Leban) Date: Fri, 6 May 2011 16:44:21 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4872D.60004@mrabarnett.plus.com> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> Message-ID: I'm opposed to changing int so that int('123_456') ignores the _ as that will change the behavior of existing code and could break apps. Alternatively, if you want to change int how about int('123_456', separator='_') ignores the _. That would also admit int('123,456', separator=',') --- Bruce * * On Fri, May 6, 2011 at 4:41 PM, MRAB wrote: > On 06/05/2011 23:51, Cameron Simpson wrote: > >> On 06May2011 15:40, Ethan Furman wrote: >> | Bruce Leban wrote: >> |>Is _ just ignored in numbers or are there more complex rules? >> |> >> |> * 1_2345_6789 (can I use groups of other sizes instead?) >> |> * 1_2_3_4_5 (ditto) >> |> * 1_234_6789 (do all the groups need to be the same size?) >> |> * 1_ (must the _ only be in between 2 digits?) >> |> * 1__234 (what about multiple _s?) >> |> * 9.876_543_210 (can it be used to the right of the decimal >> point?) >> |> * 0xFEFF_0042 (can it be used in hex, octal or binary numbers?) >> |> * int('123_456') (do other functions accept this syntax too?) >> | >> | I would say it's ignored. Have the rule be something like >> | number_string.replace('_',''). >> | >> | The only wrinkle is that currently '_1' is usable name, and that >> | should probably be disallowed if the above change took place. >> | >> | I'm +1 on the idea. >> >> Personally I'm be for ignoring the _ also, save that I would forbid it >> at the start or end, so no _1 or 1_. >> >> And I would permit it in hex code etc. >> >> I'm +0.5, myself. >> >> As far as I remember, Ada also permits it, but has the rule that it can > occur only between digits. If we follow that, then: > > 1_2345_6789 => Yes > 1_2_3_4_5 => Yes > 1_234_6789 => Yes > 1_ => No > _1 => No > 1__234 => No > 9.876_543_210 => Yes > 9._876_543_210 => No > 9_.876_543_210 => No > 0xFEFF_0042 => Yes > int('123_456') => Yes > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ironfroggy at gmail.com Sat May 7 01:55:11 2011 From: ironfroggy at gmail.com (Calvin Spealman) Date: Fri, 6 May 2011 19:55:11 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> Message-ID: On Fri, May 6, 2011 at 7:44 PM, Bruce Leban wrote: > I'm opposed to changing int so that int('123_456') ignores the _ as that > will change the behavior of existing code and could break apps. > Alternatively, if you want to change int how about int('123_456', > separator='_') ignores the _. That would also admit int('123,456', > separator=',') > --- Bruce > > > On Fri, May 6, 2011 at 4:41 PM, MRAB wrote: >> >> On 06/05/2011 23:51, Cameron Simpson wrote: >>> >>> On 06May2011 15:40, Ethan Furman ?wrote: >>> | Bruce Leban wrote: >>> |>Is _ just ignored in numbers or are there more complex rules? >>> |> >>> |> ? ? * 1_2345_6789 ?(can I use groups of other sizes instead?) >>> |> ? ? * 1_2_3_4_5 ?(ditto) >>> |> ? ? * 1_234_6789 ?(do all the groups need to be the same size?) >>> |> ? ? * 1_ ? (must the _ only be in between 2 digits?) >>> |> ? ? * 1__234 ? (what about multiple _s?) >>> |> ? ? * 9.876_543_210 ? (can it be used to the right of the decimal >>> point?) >>> |> ? ? * 0xFEFF_0042 ? (can it be used in hex, octal or binary numbers?) >>> |> ? ? * int('123_456') ? (do other functions accept this syntax too?) >>> | >>> | I would say it's ignored. ?Have the rule be something like >>> | number_string.replace('_',''). >>> | >>> | The only wrinkle is that currently '_1' is usable name, and that >>> | should probably be disallowed if the above change took place. >>> | >>> | I'm +1 on the idea. >>> >>> Personally I'm be for ignoring the _ also, save that I would forbid it >>> at the start or end, so no _1 or 1_. >>> >>> And I would permit it in hex code etc. >>> >>> I'm +0.5, myself. >>> >> As far as I remember, Ada also permits it, but has the rule that it can >> occur only between digits. If we follow that, then: >> >> ? ?1_2345_6789 => Yes >> ? ?1_2_3_4_5 => Yes >> ? ?1_234_6789 => Yes >> ? ?1_ => No >> ? ?_1 => No >> ? ?1__234 => No >> ? ?9.876_543_210 => Yes >> ? ?9._876_543_210 => No >> ? ?9_.876_543_210 => No >> ? ?0xFEFF_0042 => Yes >> ? ?int('123_456') => Yes >> _______________________________________________ >> Python-ideas mailing list >> Python-ideas at python.org >> http://mail.python.org/mailman/listinfo/python-ideas > > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > > I am +0 on the whole idea, but +0.5 if is not an underscore, which I think is ugly. Would it conflict with any other syntax rules if numbers allowed a space separator? for i in range(1 111 111): foo(i) It looks cleaner and in a fixed-font should be just as obvious about separator placement. -- Read my blog! I depend on your acceptance of my opinion! I am interesting! http://techblog.ironfroggy.com/ Follow me if you're into that sort of thing: http://www.twitter.com/ironfroggy From greg.ewing at canterbury.ac.nz Sat May 7 01:56:04 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 11:56:04 +1200 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4351C.2000109@whoosh.ca> References: <4DC4351C.2000109@whoosh.ca> Message-ID: <4DC48A94.2030608@canterbury.ac.nz> Matt Chaput wrote: > Not sure if this has been proposed before: A syntax change to allow > underscores as thousands separators in literal numbers to improve > readability, It has, but it received a rather lukewarm response last time. An alternative would be to allow spaces. -- Greg From pjenvey at underboss.org Sat May 7 01:59:35 2011 From: pjenvey at underboss.org (Philip Jenvey) Date: Fri, 6 May 2011 16:59:35 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4872D.60004@mrabarnett.plus.com> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> Message-ID: On May 6, 2011, at 4:41 PM, MRAB wrote: > On 06/05/2011 23:51, Cameron Simpson wrote: >> On 06May2011 15:40, Ethan Furman wrote: >> | Bruce Leban wrote: >> |>Is _ just ignored in numbers or are there more complex rules? >> |> >> |> * 1_2345_6789 (can I use groups of other sizes instead?) >> |> * 1_2_3_4_5 (ditto) >> |> * 1_234_6789 (do all the groups need to be the same size?) >> |> * 1_ (must the _ only be in between 2 digits?) >> |> * 1__234 (what about multiple _s?) >> |> * 9.876_543_210 (can it be used to the right of the decimal point?) >> |> * 0xFEFF_0042 (can it be used in hex, octal or binary numbers?) >> |> * int('123_456') (do other functions accept this syntax too?) >> | >> | I would say it's ignored. Have the rule be something like >> | number_string.replace('_',''). >> | >> | The only wrinkle is that currently '_1' is usable name, and that >> | should probably be disallowed if the above change took place. >> | >> | I'm +1 on the idea. >> >> Personally I'm be for ignoring the _ also, save that I would forbid it >> at the start or end, so no _1 or 1_. >> >> And I would permit it in hex code etc. >> >> I'm +0.5, myself. >> > As far as I remember, Ada also permits it, but has the rule that it can > occur only between digits. If we follow that, then: > > 1_2345_6789 => Yes > 1_2_3_4_5 => Yes > 1_234_6789 => Yes > 1_ => No > _1 => No > 1__234 => No > 9.876_543_210 => Yes > 9._876_543_210 => No > 9_.876_543_210 => No > 0xFEFF_0042 => Yes > int('123_456') => Yes Java 7 also adds this feature. Its rules: You can place underscores only between digits; you cannot place underscores in the following places: ? At the beginning or end of a number ? Adjacent to a decimal point in a floating point literal ? Prior to an F or L suffix ? In positions where a string of digits is expected The following examples demonstrate valid and invalid underscore placements in numeric literals: float pi1 = 3_.1415F; // Invalid; cannot put underscores adjacent to a decimal point float pi2 = 3._1415F; // Invalid; cannot put underscores adjacent to a decimal point long socialSecurityNumber1 = 999_99_9999_L; // Invalid; cannot put underscores prior to an L suffix int x1 = _52; // This is an identifier, not a numeric literal int x2 = 5_2; // OK (decimal literal) int x3 = 52_; // Invalid; cannot put underscores at the end of a literal int x4 = 5_______2; // OK (decimal literal) int x5 = 0_x52; // Invalid; cannot put underscores in the 0x radix prefix int x6 = 0x_52; // Invalid; cannot put underscores at the beginning of a number int x7 = 0x5_2; // OK (hexadecimal literal) int x8 = 0x52_; // Invalid; cannot put underscores at the end of a number int x9 = 0_52; // OK (octal literal) int x10 = 05_2; // OK (octal literal) int x11 = 052_; // Invalid; cannot put underscores at the end of a number (From http://download.oracle.com/javase/tutorial/java/nutsandbolts/datatypes.html ) -- Philip Jenvey From dholth at gmail.com Sat May 7 02:16:21 2011 From: dholth at gmail.com (Daniel Holth) Date: Fri, 6 May 2011 20:16:21 -0400 Subject: [Python-ideas] AttributeError: __exit__ Message-ID: I just learned about Python internals from The ZODB transaction module. In Python < 2.7, the module works as a transaction manager. More or less: manager = Foo() __exit__ = manager.__exit__ __enter__ = manager.__enter__ After Python 2.7, it doesn't work. import transaction with transaction: pass >>> AttributeError: __exit__ It should be obvious to even the most casual observer that the exception is because, after Python 2.7, the with: statement has its own opcode that bypasses transaction.__getattribute__('__exit__') -> transaction.__dict__['__exit__']. Instead, CPython calls special_lookup(), looks for __exit__ on the module type, not the instance, doesn't find it, and raises the AttributeError. Instead, import sys sys.__exit__ >>> AttributeError: 'module' object has no attribute '__exit__' The interpreter should at least explain the AttributeError in the same way as it does when the user triggers it directly. -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Sat May 7 02:38:05 2011 From: guido at python.org (Guido van Rossum) Date: Fri, 6 May 2011 17:38:05 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us>

Message-ID: The point is that the pkg should use __all__ to declare what submodules exist. That's what it was invented for! On May 6, 2011 3:05 PM, "Paul Moore" wrote: > On 6 May 2011 21:52, Eric Snow wrote: >> He's saying that the package would be imported like normal. Then all >> "public" sub-modules of the package would automatically imported and bound >> to the namespace of the object that resulted from the import of the package. > > There is no means of determining what submodules of a package exist. > Check PEP 302 for details - finders find modules ant they can do so > any way they like - there's nothing in the protocol to enumerate > subpackages, so you can't do it (if faced with a general PEP 302 > finder). > > Paul. > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas -------------- next part -------------- An HTML attachment was scrubbed... URL: From guido at python.org Sat May 7 02:41:33 2011 From: guido at python.org (Guido van Rossum) Date: Fri, 6 May 2011 17:41:33 -0700 Subject: [Python-ideas] AttributeError: __exit__ In-Reply-To: References: Message-ID: Please file a bug. On May 6, 2011 5:17 PM, "Daniel Holth" wrote: > > I just learned about Python internals from The ZODB transaction module. In Python < 2.7, the module works as a transaction manager. More or less: > > manager = Foo() > __exit__ = manager.__exit__ > __enter__ = manager.__enter__ > > After Python 2.7, it doesn't work. > > import transaction > with transaction: pass > >>> AttributeError: __exit__ > > It should be obvious to even the most casual observer that the exception is because, after Python 2.7, the with: statement has its own opcode that bypasses transaction.__getattribute__('__exit__') -> transaction.__dict__['__exit__']. Instead, CPython calls special_lookup(), looks for __exit__ on the module type, not the instance, doesn't find it, and raises the AttributeError. > > Instead, > > import sys > sys.__exit__ > >>> AttributeError: 'module' object has no attribute '__exit__' > > The interpreter should at least explain the AttributeError in the same way as it does when the user triggers it directly. > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ben+python at benfinney.id.au Sat May 7 02:44:09 2011 From: ben+python at benfinney.id.au (Ben Finney) Date: Sat, 07 May 2011 10:44:09 +1000 Subject: [Python-ideas] 1 246 358 (was: 1_000_000) References: <4DC4351C.2000109@whoosh.ca> <4DC48A94.2030608@canterbury.ac.nz> Message-ID: <87pqnvmjae.fsf_-_@benfinney.id.au> Greg Ewing writes: > An alternative would be to allow spaces. I would prefer to allow space between digits in a numeric literal. 1 2345 6789 1 2 3 4 5 6789 1 234 6789 1 234 567 89 9.876 543 210 0xFEFF 0042 This nicely parallels the fact that space can separate chunks of a string literal. But that still leaves the following inconsistency: int('1 234 567') That will currently raise a ValueError. Should it continue to do so under this proposal? -- \ ?You say ?Carmina?, and I say ?Burana?, You say ?Fortuna?, and | `\ I say ?cantata?, Carmina, Burana, Fortuna, cantata, Let's Carl | _o__) the whole thing Orff.? ?anonymous | Ben Finney From guido at python.org Sat May 7 02:54:15 2011 From: guido at python.org (Guido van Rossum) Date: Fri, 6 May 2011 17:54:15 -0700 Subject: [Python-ideas] 1 246 358 (was: 1_000_000) In-Reply-To: <87pqnvmjae.fsf_-_@benfinney.id.au> References: <4DC4351C.2000109@whoosh.ca> <4DC48A94.2030608@canterbury.ac.nz> <87pqnvmjae.fsf_-_@benfinney.id.au> Message-ID: Too ambiguous, too hard to parse. I like the _ proposal. On May 6, 2011 5:45 PM, "Ben Finney" wrote: > Greg Ewing writes: > >> An alternative would be to allow spaces. > > I would prefer to allow space between digits in a numeric literal. > > 1 2345 6789 > 1 2 3 4 5 6789 > 1 234 6789 > 1 234 567 89 > 9.876 543 210 > 0xFEFF 0042 > > This nicely parallels the fact that space can separate chunks of a > string literal. > > But that still leaves the following inconsistency: > > int('1 234 567') > > That will currently raise a ValueError. Should it continue to do so > under this proposal? > > -- > \ ?You say ?Carmina?, and I say ?Burana?, You say ?Fortuna?, and | > `\ I say ?cantata?, Carmina, Burana, Fortuna, cantata, Let's Carl | > _o__) the whole thing Orff.? ?anonymous | > Ben Finney > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas -------------- next part -------------- An HTML attachment was scrubbed... URL: From python at mrabarnett.plus.com Sat May 7 02:55:52 2011 From: python at mrabarnett.plus.com (MRAB) Date: Sat, 07 May 2011 01:55:52 +0100 Subject: [Python-ideas] 1 246 358 In-Reply-To: <87pqnvmjae.fsf_-_@benfinney.id.au> References: <4DC4351C.2000109@whoosh.ca> <4DC48A94.2030608@canterbury.ac.nz> <87pqnvmjae.fsf_-_@benfinney.id.au> Message-ID: <4DC49898.4070900@mrabarnett.plus.com> On 07/05/2011 01:44, Ben Finney wrote: > Greg Ewing writes: > >> An alternative would be to allow spaces. > > I would prefer to allow space between digits in a numeric literal. > > 1 2345 6789 > 1 2 3 4 5 6789 > 1 234 6789 > 1 234 567 89 > 9.876 543 210 > 0xFEFF 0042 > > This nicely parallels the fact that space can separate chunks of a > string literal. > > But that still leaves the following inconsistency: > > int('1 234 567') > > That will currently raise a ValueError. Should it continue to do so > under this proposal? > I prefer there not to be whitespace inside tokens. String literals are an exception, they are explicitly delimited. From steve at pearwood.info Sat May 7 04:00:11 2011 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 07 May 2011 12:00:11 +1000 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

Message-ID: <4DC4A7AB.8000803@pearwood.info> Bruce Leban wrote: > Consider these examples instead: > > - 1_234_000 > - 9.876_543_210 > - 0xFEFF_0042 > > I'm not advocating this change (nor against it); I just think the discussion > should be focused on the actual idea. I do have a question: > > Is _ just ignored in numbers or are there more complex rules? > > - 1_2345_6789 (can I use groups of other sizes instead?) > - 1_2_3_4_5 (ditto) > - 1_234_6789 (do all the groups need to be the same size?) +1 on all of these. I don't particularly like the look of _ as a number separator, but it's hard to think of any alternatives other than space, and some separator is better than long sequences of digits. I'm -0.5 on spaces even though it looks MUCH better, because it's too easy to leave the commas out in lists etc: L = [1, 2, 3, 4 5, 6, 7, 8, 9, 10] # oops, wanted 4 & 5 not 45 (Admittedly if the items where strings, the same failure mode applies.) > - 1_ (must the _ only be in between 2 digits?) > - 1__234 (what about multiple _s?) -1 on allowing either _1 or 1_ as numbers. -0 on allowing doubled underscores. > - 9.876_543_210 (can it be used to the right of the decimal point?) > - 0xFEFF_0042 (can it be used in hex, octal or binary numbers?) +1 on these two. > - int('123_456') (do other functions accept this syntax too?) That's a tricky one... I'd say No, but I'm not entirely sure. It's easy enough to say: int('123_456'.replace('_', '')) albeit a tad verbose. Also easy to say: int('123' '456') which is less verbose. And it will change the behaviour of the int function. So I don't think we need to support separators inside strings. We can always change our mind later and add it in, but it's much harder to take it out later. -- Steven From steve at pearwood.info Sat May 7 04:00:43 2011 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 07 May 2011 12:00:43 +1000 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC47DF0.1020001@stoneleaf.us> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> <4DC47DF0.1020001@stoneleaf.us> Message-ID: <4DC4A7CB.7030100@pearwood.info> Ethan Furman wrote: > I see it as a readability issue -- if you have 1_024 and _1025 (etc, > etc), where one is a number and the other a name, confusion can easily > result. I don't think there will be *that* much confusion though. _1025 can occur on the LHS of an assignment, 1_024 cannot. And we already distinguish between x1234 and 0x1234 without much confusion. -- Steven From guido at python.org Sat May 7 05:45:18 2011 From: guido at python.org (Guido van Rossum) Date: Fri, 6 May 2011 20:45:18 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4A7AB.8000803@pearwood.info> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC4A7AB.8000803@pearwood.info> Message-ID: On Fri, May 6, 2011 at 7:00 PM, Steven D'Aprano wrote: > Bruce Leban wrote: > >> Consider these examples instead: >> >> ? - 1_234_000 >> ? - 9.876_543_210 >> ? - 0xFEFF_0042 >> >> I'm not advocating this change (nor against it); I just think the >> discussion >> should be focused on the actual idea. I do have a question: >> >> Is _ just ignored in numbers or are there more complex rules? >> >> ? - 1_2345_6789 ?(can I use groups of other sizes instead?) >> ? - 1_2_3_4_5 ?(ditto) >> ? - 1_234_6789 ?(do all the groups need to be the same size?) > > +1 on all of these. I don't particularly like the look of _ as a number > separator, but it's hard to think of any alternatives other than space, and > some separator is better than long sequences of digits. > > I'm -0.5 on spaces even though it looks MUCH better, because it's too easy > to leave the commas out in lists etc: > > L = [1, 2, 3, 4 5, 6, 7, 8, 9, 10] ?# oops, wanted 4 & 5 not 45 > > (Admittedly if the items where strings, the same failure mode applies.) And it does sometimes bite. So let's not do more of that. (In retrospect 'xxx' + 'yyy' would have been good enough.) >> ? - 1_ ? (must the _ only be in between 2 digits?) >> ? - 1__234 ? (what about multiple _s?) > > -1 on allowing either _1 or 1_ as numbers. > > -0 on allowing doubled underscores. > > >> ? - 9.876_543_210 ? (can it be used to the right of the decimal point?) >> ? - 0xFEFF_0042 ? (can it be used in hex, octal or binary numbers?) > > +1 on these two. Steven channels me well so far. Fine points about _ in floats: IMO the _ should be allowed to appear between any two digits, or between the last digit and the 'e' in the exponent, or between the 'e' and a following digit. But not adjacent to the '.' or to the '+' or '-' in the exponent. So 3.141_593 yes, 3_.14 no. Fine points about _ in bin/oct/hex literals: 0x_dead_beef yes, 0_xdeadbeef no. (The overall rule seems to be that it must be internal to alphanumeric strings, except that leading 0x, 0o or 0b must not be separated -- somehow I find 0_x_dead_beef would be a disservice to human readers.) >> ? - int('123_456') ? (do other functions accept this syntax too?) > > That's a tricky one... I'd say No, but I'm not entirely sure. It's easy > enough to say: > > int('123_456'.replace('_', '')) > > albeit a tad verbose. Also easy to say: > > int('123' '456') > > which is less verbose. But that's not how it'll be used. The argument will be provided by the user of the code. > And it will change the behaviour of the int function. > So I don't think we need to support separators inside strings. I think it's fine, the same reason why we want to write 1_234_567 in code sometimes applies to input or command line arguments too, and I see little harm. > We can always change our mind later and add it in, but it's much harder to > take it out later. It seems entirely harmless here. Also for float(). It would also be nice to have an easy way to emit _ in suitable places. Maybe this could be added to the .format() language for numbers? It would be nice if you could tell it to emit an _ every N positions. -- --Guido van Rossum (python.org/~guido) From cs at zip.com.au Sat May 7 06:29:11 2011 From: cs at zip.com.au (Cameron Simpson) Date: Sat, 7 May 2011 14:29:11 +1000 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: Message-ID: <20110507042911.GA14472@cskk.homeip.net> On 06May2011 19:55, Calvin Spealman wrote: | I am +0 on the whole idea, but +0.5 if is not an underscore, which I | think is ugly. I think the underscore is one of the better choices: - it is very visible, unlike a dot or comma - it is "low" or "flat", not intruding into the glyph space of the digits, leaving things easy to read - it is already widely used (perl (sorry), Ada (where I first encountered it now that someone ele has mentioned it, etc) i.e. it is a pre-existing idom with successful use | Would it conflict with any other syntax rules if | numbers allowed a space separator? | | for i in range(1 111 111): | foo(i) | | It looks cleaner and in a fixed-font should be just as obvious about | separator placement. I'm very -1 on this one. Like another recent proposal it take a common typing error and turns it into legal syntax. Code that once would fail to compile because the author dropped a comma between values now runs, with silent breakage (the new stuff isn't even the wrong type!) Cheers, -- Cameron Simpson DoD#743 http://www.cskk.ezoshosting.com/cs/ It's there as a sop to former Ada programmers. :-) - Larry Wall regarding 10_000_000 in <11556 at jpl-devvax.JPL.NASA.GOV> From cs at zip.com.au Sat May 7 06:30:09 2011 From: cs at zip.com.au (Cameron Simpson) Date: Sat, 7 May 2011 14:30:09 +1000 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4872D.60004@mrabarnett.plus.com> References: <4DC4872D.60004@mrabarnett.plus.com> Message-ID: <20110507043009.GA15772@cskk.homeip.net> On 07May2011 00:41, MRAB wrote: | As far as I remember, Ada also permits it, That's where I first encountered it myself. | but has the rule that it can | occur only between digits. If we follow that, then: | | 1_2345_6789 => Yes | 1_2_3_4_5 => Yes | 1_234_6789 => Yes | 1_ => No | _1 => No | 1__234 => No | 9.876_543_210 => Yes | 9._876_543_210 => No | 9_.876_543_210 => No | 0xFEFF_0042 => Yes | int('123_456') => Yes +1 to this. Cheers, -- Cameron Simpson DoD#743 http://www.cskk.ezoshosting.com/cs/ It is impossible to travel faster than light, and certainly not desirable as ones hat keeps blowing off. - Woody Allen From ben+python at benfinney.id.au Sat May 7 07:03:42 2011 From: ben+python at benfinney.id.au (Ben Finney) Date: Sat, 07 May 2011 15:03:42 +1000 Subject: [Python-ideas] 1 246 358 References: <4DC4351C.2000109@whoosh.ca> <4DC48A94.2030608@canterbury.ac.nz> <87pqnvmjae.fsf_-_@benfinney.id.au> <4DC49898.4070900@mrabarnett.plus.com> Message-ID: <87hb97m79t.fsf@benfinney.id.au> MRAB writes: > On 07/05/2011 01:44, Ben Finney wrote: > > I would prefer to allow space between digits in a numeric literal. [?] > > This nicely parallels the fact that space can separate chunks of a > > string literal. > I prefer there not to be whitespace inside tokens. String literals are > an exception, they are explicitly delimited. That's a good justification for the special case. Okay, I withdraw my proposal. -- \ ?Facts are stubborn things; and whatever may be our wishes, our | `\ inclinations, or the dictates of our passion, they cannot alter | _o__) the state of facts and evidence.? ?John Adams, 1770-12-04 | Ben Finney From lac at openend.se Sat May 7 07:05:37 2011 From: lac at openend.se (Laura Creighton) Date: Sat, 07 May 2011 07:05:37 +0200 Subject: [Python-ideas] 1_000_000 In-Reply-To: Message from MRAB of "Sat, 07 May 2011 00:41:33 BST." <4DC4872D.60004@mrabarnett.plus.com> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> Message-ID: <201105070505.p4755b2E014146@theraft.openend.se> If you disallow variable names of the form _ you will break a huge amount of my automatically generated code. Admittedly, it wouldn't be hard to change things so that the generated variables are now X instead, but that happens to be the way I have written it now. Laura From greg.ewing at canterbury.ac.nz Sat May 7 09:29:47 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 19:29:47 +1200 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC47D2A.9090808@stoneleaf.us> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> <4DC47D2A.9090808@stoneleaf.us> Message-ID: <4DC4F4EB.9080007@canterbury.ac.nz> Ethan Furman wrote: > So you use _8127 style names for your objects* then? I can easily imagine a code generator producing names like that to reduce the chance of collision with a user's names. -- Greg From greg.ewing at canterbury.ac.nz Sat May 7 09:36:07 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 19:36:07 +1200 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> <4DC47D2A.9090808@stoneleaf.us> Message-ID: <4DC4F667.5000104@canterbury.ac.nz> Fred Drake wrote: > I understand the motivation for a thousands separator, at least (though > I'll admit, I don't find it compelling; *all* big numbers in code are > too magical). Bigness is a relative concept. Avogadro's number is fairly big in absolute terms, but you can hold that many molecules in your hand quite easily. Although writing it as 6_020_000_000_000_000_000_000_000_000 probably wouldn't be very helpful. -- Greg From greg.ewing at canterbury.ac.nz Sat May 7 09:41:43 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 19:41:43 +1200 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC47DF0.1020001@stoneleaf.us> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC478C6.3010801@stoneleaf.us> <4DC47DF0.1020001@stoneleaf.us> Message-ID: <4DC4F7B7.3090204@canterbury.ac.nz> Ethan Furman wrote: > I see it as a readability issue -- if you have 1_024 and _1025 (etc, > etc), where one is a number and the other a name, confusion can easily > result. But probably not much worse than the confusion you can get today between 1234e6 and _1234e6, or O000001 and 0000001. There will always be ways of creating confusing-looking code if you put your mind to it. :-) -- Greg From greg.ewing at canterbury.ac.nz Sat May 7 09:46:57 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 19:46:57 +1200 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> Message-ID: <4DC4F8F1.4090904@canterbury.ac.nz> Bruce Leban wrote: > I'm opposed to changing int so that int('123_456') ignores the _ as that > will change the behavior of existing code and could break apps. But int('123_456', 0) should perhaps work? (On the grounds that it parses numbers using the same syntax as Python source.) -- Greg From greg.ewing at canterbury.ac.nz Sat May 7 09:51:35 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 19:51:35 +1200 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> Message-ID: <4DC4FA07.2000506@canterbury.ac.nz> Philip Jenvey wrote: > int x4 = 5_______2; // OK (decimal literal) Hmmm, that one looks really weird -- maybe it should be disallowed as well? -- Greg From steve at pearwood.info Sat May 7 10:18:22 2011 From: steve at pearwood.info (Steven D'Aprano) Date: Sat, 07 May 2011 18:18:22 +1000 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4FA07.2000506@canterbury.ac.nz> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> <4DC4FA07.2000506@canterbury.ac.nz> Message-ID: <4DC5004E.30308@pearwood.info> Greg Ewing wrote: > Philip Jenvey wrote: > >> int x4 = 5_______2; // OK (decimal literal) > > Hmmm, that one looks really weird -- maybe it should be > disallowed as well? I don't think we need disallow it merely over an aesthetic judgement (although it does look weird *grins*). There is precedence with separators in collections: >>> t = (1,,,,2) File "", line 1 t = (1,,,,2) ^ SyntaxError: invalid syntax Like consecutive commas, consecutive underscores are likely to indicate a typo rather than a deliberate decision. So I'm +1 on strictly enforcing a single underscore between digits. -- Steven From greg.ewing at canterbury.ac.nz Sat May 7 10:27:14 2011 From: greg.ewing at canterbury.ac.nz (Greg Ewing) Date: Sat, 07 May 2011 20:27:14 +1200 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC5004E.30308@pearwood.info> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> <4DC4FA07.2000506@canterbury.ac.nz> <4DC5004E.30308@pearwood.info> Message-ID: <4DC50262.2080300@canterbury.ac.nz> Steven D'Aprano wrote: > Like consecutive commas, consecutive underscores are likely to indicate > a typo rather than a deliberate decision. Well, yes, that's really the rationale I had in mind. Although it would provide an amusingly funky way of introducing dividing line comments into your code: class A: ... ... ... 0____________________________________0 class B: ... ... ... You could even decorate it with scissors for a bit more panache: 0_____8<0_____8<0_____8<0_____8<0_____0 -- Greg From p.f.moore at gmail.com Sat May 7 10:58:56 2011 From: p.f.moore at gmail.com (Paul Moore) Date: Sat, 7 May 2011 09:58:56 +0100 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us>

Message-ID: On 7 May 2011 01:38, Guido van Rossum wrote: > The point is that the pkg should use __all__ to declare what submodules > exist. That's what it was invented for! Hmm, OK. I missed that. But how would that work? p1/__init__.py: __all__ = ['p2', 'foo'] def foo(): print "p1.foo" p1/p2/__init__.py: __all__ = ['foo'] def foo(): print "p1.foo" If I import p1, p1.__all__ shows me that p2 and foo are public. p1.foo exists and I can tell it's not a module. p1.p2 doesn't exist in the p1 namespace at the moment, so how do I tell that I need to import it? Just assume all nonexistent names are subpackages, and import them? That doesn't seem like a very robust approach. A proof of concept in the form of a Python implementation (as a function) would help me understand, I guess. (But I still doubt that even if it's implementable, the feature is much practical use...) Paul. From dirkjan at ochtman.nl Sat May 7 14:16:48 2011 From: dirkjan at ochtman.nl (Dirkjan Ochtman) Date: Sat, 7 May 2011 14:16:48 +0200 Subject: [Python-ideas] thoughts on regular expression improvements In-Reply-To: <818.1304713938@parc.com> References: <98999.1304709119@parc.com> <818.1304713938@parc.com> Message-ID: On Fri, May 6, 2011 at 22:32, Bill Janssen wrote: > Ah, you mean the PyPI "regex". ?Looks like it has "branch reset", which > might support my #1? ?Using the same group name multiple times? > > I don't see fuzzy matches, or support for composition, though. I might've been more specific: I think MRAB is working on regex as a playground for new regex-module things (and potentially a replacement for stdlib re), so it might be a good place to implement these kinds of things or discuss them. Cheers, Dirkjan From guido at python.org Sat May 7 16:41:55 2011 From: guido at python.org (Guido van Rossum) Date: Sat, 7 May 2011 07:41:55 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us>

Message-ID: On Sat, May 7, 2011 at 1:58 AM, Paul Moore wrote: > On 7 May 2011 01:38, Guido van Rossum wrote: >> The point is that the pkg should use __all__ to declare what submodules >> exist. That's what it was invented for! > > Hmm, OK. I missed that. But how would that work? > > p1/__init__.py: > > __all__ = ['p2', 'foo'] > def foo(): print "p1.foo" > > p1/p2/__init__.py: > > __all__ = ['foo'] > def foo(): print "p1.foo" > > If I import p1, p1.__all__ shows me that p2 and foo are public. p1.foo > exists and I can tell it's not a module. p1.p2 doesn't exist in the p1 > namespace at the moment, so how do I tell that I need to import it? > Just assume all nonexistent names are subpackages, and import them? > That doesn't seem like a very robust approach. Do whatever "from pkg import *" does today. Though the recursive application is new. I think (if we do this) it should be recursive. The implementation is straightforward, though the consequences may not be (think cyclic imports). > A proof of concept in the form of a Python implementation (as a > function) would help me understand, I guess. (But I still doubt that > even if it's implementable, the feature is much practical use...) It deviates from "import what you use" for sure. OTOH it is a better alternative to "from pkg import *" because it does not pollute the namespace. I believe Java users are used to this. -- --Guido van Rossum (python.org/~guido) From python at mrabarnett.plus.com Sat May 7 17:32:53 2011 From: python at mrabarnett.plus.com (MRAB) Date: Sat, 07 May 2011 16:32:53 +0100 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC4F8F1.4090904@canterbury.ac.nz> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> <4DC4F8F1.4090904@canterbury.ac.nz> Message-ID: <4DC56625.6010704@mrabarnett.plus.com> On 07/05/2011 08:46, Greg Ewing wrote: > Bruce Leban wrote: >> I'm opposed to changing int so that int('123_456') ignores the _ as >> that will change the behavior of existing code and could break apps. > > But int('123_456', 0) should perhaps work? (On the grounds that > it parses numbers using the same syntax as Python source.) > There's also the argument that if you forbid it then the programmer may have to write: int(string.replace("_", "")) in order to let the user include underscores, which would make it too permissive. If the user entered "_10", the above code would accept it. From g.brandl at gmx.net Sat May 7 18:11:21 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 07 May 2011 18:11:21 +0200 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC50262.2080300@canterbury.ac.nz> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> <4DC4FA07.2000506@canterbury.ac.nz> <4DC5004E.30308@pearwood.info> <4DC50262.2080300@canterbury.ac.nz> Message-ID: On 07.05.2011 10:27, Greg Ewing wrote: > Steven D'Aprano wrote: > >> Like consecutive commas, consecutive underscores are likely to indicate >> a typo rather than a deliberate decision. > > Well, yes, that's really the rationale I had in mind. > > Although it would provide an amusingly funky way of > introducing dividing line comments into your code: > > class A: > ... > ... > ... > > 0____________________________________0 +1__________________________________________________________0! Georg From g.brandl at gmx.net Sat May 7 18:12:06 2011 From: g.brandl at gmx.net (Georg Brandl) Date: Sat, 07 May 2011 18:12:06 +0200 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> , <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> Message-ID: On 06.05.2011 21:49, Brendan Moloney wrote: > dag.odenhall at gmail.com wrote: >> I like this idea, except it's inconsistent with from-import-star, the >> latter which does *not* get you sub-packages or modules. > > Georg Brandl [g.brandl at gmx.net] wrote: >> And that's for a reason: it's not easy (I think it's even impossible, >> because for example individual submodules can change __path__) to determine >> all importable submodules of a package. > >> So ``import pkg.*`` would not have any behavior other than ``import pkg``. > > When I said all _public_ sub-packages and modules I was referring to those > listed in the __all__ attribute of 'pkg'. Thus it would behave in the exact > same way as from-import-star except you don't pollute the current namespace. Right -- I forgot about __all__. Georg From dholth at gmail.com Sat May 7 19:15:07 2011 From: dholth at gmail.com (Daniel Holth) Date: Sat, 7 May 2011 13:15:07 -0400 Subject: [Python-ideas] AttributeError: __exit__ In-Reply-To: References:

Message-ID: OK. I will reopen the related bug that was immediately closed with a suggestion to check with the python-ideas mailing list. -------------- next part -------------- An HTML attachment was scrubbed... URL: From dholth at gmail.com Sat May 7 19:49:58 2011 From: dholth at gmail.com (Daniel Holth) Date: Sat, 7 May 2011 13:49:58 -0400 Subject: [Python-ideas] proposal: module-level __init__ Message-ID: __all__ is very useful when doing import *, which is frowned upon. As an alternative, allow modules to contain a function called __init__ that defines that module's exported symbols by way of the global statement. By importing modules that are used, but not intended to be exported, inside the __init__ function, programmers avoid cases such as the unintentional 'somemodule.sys' (referring to a module by its non-canonical name) that makes it harder to refactor larger projects. Before: __all__ = ['a', 'b'] import sys def a(): pass def b(): pass def c(): pass After: def __init__(): global a, b import sys def a(): pass def b(): pass def c(): pass __init__() -------------- next part -------------- An HTML attachment was scrubbed... URL: From mal at egenix.com Sat May 7 19:56:50 2011 From: mal at egenix.com (M.-A. Lemburg) Date: Sat, 07 May 2011 19:56:50 +0200 Subject: [Python-ideas] proposal: module-level __init__ In-Reply-To: References: Message-ID: <4DC587E2.3040301@egenix.com> Daniel Holth wrote: > __all__ is very useful when doing import *, which is frowned upon. As an > alternative, allow modules to contain a function called __init__ that > defines that module's exported symbols by way of the global statement. By > importing modules that are used, but not intended to be exported, inside the > __init__ function, programmers avoid cases such as the unintentional > 'somemodule.sys' (referring to a module by its non-canonical name) that > makes it harder to refactor larger projects. > > Before: > > __all__ = ['a', 'b'] > import sys > def a(): pass > def b(): pass > def c(): pass > > After: > > def __init__(): > global a, b > import sys > def a(): pass > def b(): pass > def c(): pass > > __init__() This is already possible and used in modules where you don't want to clutter up the global namespace. Where's the novelty ? -- Marc-Andre Lemburg eGenix.com Professional Python Services directly from the Source (#1, May 07 2011) >>> Python/Zope Consulting and Support ... http://www.egenix.com/ >>> mxODBC.Zope.Database.Adapter ... http://zope.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ 2011-06-20: EuroPython 2011, Florence, Italy 44 days to go ::: Try our new mxODBC.Connect Python Database Interface for free ! :::: eGenix.com Software, Skills and Services GmbH Pastor-Loeh-Str.48 D-40764 Langenfeld, Germany. CEO Dipl.-Math. Marc-Andre Lemburg Registered at Amtsgericht Duesseldorf: HRB 46611 http://www.egenix.com/company/contact/ From fdrake at acm.org Sat May 7 21:26:22 2011 From: fdrake at acm.org (Fred Drake) Date: Sat, 7 May 2011 15:26:22 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC50262.2080300@canterbury.ac.nz> References: <4DC478C6.3010801@stoneleaf.us> <20110506225138.GA2323@cskk.homeip.net> <4DC4872D.60004@mrabarnett.plus.com> <4DC4FA07.2000506@canterbury.ac.nz> <4DC5004E.30308@pearwood.info> <4DC50262.2080300@canterbury.ac.nz> Message-ID: On Sat, May 7, 2011 at 4:27 AM, Greg Ewing wrote: > You could even decorate it with scissors for a bit > more panache: > > 0_____8<0_____8<0_____8<0_____8<0_____0 Heh. Thanks for the swell tip, Martha Stewart! -Fred -- Fred L. Drake, Jr.? ? "Give me the luxuries of life and I will willingly do without the necessities." ?? --Frank Lloyd Wright From eric at trueblade.com Sat May 7 21:51:36 2011 From: eric at trueblade.com (Eric Smith) Date: Sat, 07 May 2011 15:51:36 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC4A7AB.8000803@pearwood.info> Message-ID: <4DC5A2C8.5080305@trueblade.com> On 05/06/2011 11:45 PM, Guido van Rossum wrote: > It would also be nice to have an easy way to emit _ in suitable > places. Maybe this could be added to the .format() language for > numbers? It would be nice if you could tell it to emit an _ every N > positions. We already support commas (PEP 378). Adding underscores in the same way would be easy. However, you can't specify N, it's always 3. Eric. From guido at python.org Sat May 7 23:06:12 2011 From: guido at python.org (Guido van Rossum) Date: Sat, 7 May 2011 14:06:12 -0700 Subject: [Python-ideas] 1_000_000 In-Reply-To: <4DC5A2C8.5080305@trueblade.com> References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC4A7AB.8000803@pearwood.info> <4DC5A2C8.5080305@trueblade.com> Message-ID: On Sat, May 7, 2011 at 12:51 PM, Eric Smith wrote: > On 05/06/2011 11:45 PM, Guido van Rossum wrote: > >> It would also be nice to have an easy way to emit _ in suitable >> places. Maybe this could be added to the .format() language for >> numbers? It would be nice if you could tell it to emit an _ every N >> positions. > > We already support commas (PEP 378). Adding underscores in the same way > would be easy. However, you can't specify N, it's always 3. Which would suck for non-decimal formats. :-( Also there seem to be some countries where the conventions for formatting currency uses groupings other than 1000. E.g. http://www.ozgrid.com/forum/showthread.php?t=10226 (though specifying N wouldn't be enough there). -- --Guido van Rossum (python.org/~guido) From jeanpierreda at gmail.com Sun May 8 00:38:47 2011 From: jeanpierreda at gmail.com (Devin Jeanpierre) Date: Sat, 7 May 2011 18:38:47 -0400 Subject: [Python-ideas] 1_000_000 In-Reply-To: References: <4DC4351C.2000109@whoosh.ca> <20110506232407.2bd211a1@pitrou.net>

<4DC4A7AB.8000803@pearwood.info> <4DC5A2C8.5080305@trueblade.com> Message-ID: >> On 05/06/2011 11:45 PM, Guido van Rossum wrote: > Which would suck for non-decimal formats. :-( Also there seem to be > some countries where the conventions for formatting currency uses > groupings other than 1000. E.g. > http://www.ozgrid.com/forum/showthread.php?t=10226 (though specifying > N wouldn't be enough there). Wouldn't something like that be the job of locale.currency()? Devin Jeanpierre From jeanpierreda at gmail.com Sun May 8 02:57:29 2011 From: jeanpierreda at gmail.com (Devin Jeanpierre) Date: Sat, 7 May 2011 20:57:29 -0400 Subject: [Python-ideas] Rename python.exe to python3.exe on Windows Message-ID: Hello, On most *nix systems, Python 3.x is available as the python3 executable, and Python 2.x as the 'python' executable. This lets both exist side-by-side and be usable from the command-line. The alternative (used by Arch), is to name Python 2.x 'python2', and 3.x 'python'. The Windows distribution of Python does neither, it names them both 'python.exe', meaning that you can't install and use both at once. Moreover, if you install Python 2.7 and then Python 3.2, the default handler for .py files is set to Python 3.2, and changing it to 2.7 is difficult because of a quirk in Eexplorer that forces you to choose between two non-distinguishable "python.exe"s. This is made much more difficult if in fact you installed five or so different Python versions. Also any automated tests using something like Cram that use python3 will not work, and any batch scripts that use python.exe will work differently depending on the host system. (It wouldn't be awful to get python-X.Y.exe executables, either). The downside of this is that any code that tries to use C:\Python3Y\python.exe breaks. Such code is probably broken anyway, there are multiple Ys around, and Python can be installed in My Documents or wherever. PEP 397 should relieve the issues with opening .py files, making some of this unnecessary with that change, as well. I'm guessing that it would also be appropriate to rename pythonw.exe to python3w.exe. I doubt that particular change matters at all, it's solely to do with opening .pyw files, and that should be handled by PEP 397. I'd appreciate any thoughts or comments you might have. Thanks for your time, Devin Jeanpierre From ben+python at benfinney.id.au Sun May 8 03:21:52 2011 From: ben+python at benfinney.id.au (Ben Finney) Date: Sun, 08 May 2011 11:21:52 +1000 Subject: [Python-ideas] Rename python.exe to python3.exe on Windows References: Message-ID: <87zkmykmvj.fsf@benfinney.id.au> Devin Jeanpierre writes: > On most *nix systems, Python 3.x is available as the python3 > executable, and Python 2.x as the 'python' executable. This lets both > exist side-by-side and be usable from the command-line. More importantly, it ensures that programs written for older Python 2.x will continue to run with the default ?python?. If the default ?python? were Python 3.x, programs expecting Python 2.x would most likely break due to backward incompatibility. So it's best if the ?python? program invokes only Python 2.x. -- \ ?To label any subject unsuitable for comedy is to admit | `\ defeat.? ?Peter Sellers | _o__) | Ben Finney From steve at pearwood.info Sun May 8 04:28:07 2011 From: steve at pearwood.info (Steven D'Aprano) Date: Sun, 08 May 2011 12:28:07 +1000 Subject: [Python-ideas] Rename python.exe to python3.exe on Windows In-Reply-To: <87zkmykmvj.fsf@benfinney.id.au> References: <87zkmykmvj.fsf@benfinney.id.au> Message-ID: <4DC5FFB7.6050605@pearwood.info> Ben Finney wrote: > If the default ?python? were Python 3.x, programs expecting Python 2.x > would most likely break due to backward incompatibility. So it's best if > the ?python? program invokes only Python 2.x. The first sentence is true. The second is a value judgement, not a statement of fact, and the people behind Arch Linux disagree with you. http://www.archlinux.org/news/python-is-now-python-3/ I say, good on 'em. I wish I could find the quote somebody made about Arch being the distro that makes Gentoo seem cautious and conservative... something about Arch moving forward so the Gentoo folks know which mistakes not to make? -- Steven From stephen at xemacs.org Mon May 9 12:39:17 2011 From: stephen at xemacs.org (Stephen J. Turnbull) Date: Mon, 09 May 2011 19:39:17 +0900 Subject: [Python-ideas] Rename python.exe to python3.exe on Windows In-Reply-To: <4DC5FFB7.6050605@pearwood.info> References: <87zkmykmvj.fsf@benfinney.id.au> <4DC5FFB7.6050605@pearwood.info> Message-ID: <87pqnsdup6.fsf@uwakimon.sk.tsukuba.ac.jp> Steven D'Aprano writes: > I wish I could find the quote somebody made about Arch being the distro > that makes Gentoo seem cautious and conservative... something about Arch > moving forward so the Gentoo folks know which mistakes not to make? The only thing history teaches us is that nobody learns from others' history: $ python Python 3.1.3 (r313:86834, Feb 22 2011, 18:52:21) [GCC 4.3.5] on linux2 Type "help", "copyright", "credits" or "license" for more information. >>> $ There are a couple of ebuilds that break because of this. From ncoghlan at gmail.com Mon May 9 16:04:16 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 10 May 2011 00:04:16 +1000 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us> Message-ID: On Sat, May 7, 2011 at 6:52 AM, Eric Snow wrote: > If you have a list of the submodules you want imported then you can already > accomplish this: > import parent > for mod in parent.__all_submodules__: > ? ? __import__("parent.{}".format(mod)) > Of course, this does not bind the submodules to the namespace of the package > module It actually does, as binding the submodule name in the parent package namespace is part of the responsibility of __import__(): >>> import logging >>> logging.handlers Traceback (most recent call last): File "", line 1, in AttributeError: 'module' object has no attribute 'handlers' >>> __import__("logging.handlers") >>> logging.handlers This is one of the reasons circular imports are such a pain - we pre-bind them in sys.modules, and remove them again if the import fails, but we don't currently do that in the parent package namespace, so circular imports sometimes work and sometime break depending not only on which names are accessed but also *how* they're accessed (e.g. in a/b/c.py, "import a.b.c" will work, "import a.b.c; c = a.b.c" will fail with AttributeError and "from a.b import c" will fail with ImportError). >?I am not sure > of the specific import mechanism with regards to name binding, but that > would seem to be a conflict with the way imported names for submodules are > bound. Nope, it's basically the same as what happens automatically when the modules are imported normally. Indeed, as near as I can tell, this request amounts to asking for syntactic sugar that does something roughly along the lines of: def _subnames(pkg_name, subnames): for subname in subnames: yield ".".join(pkg_name, subname) def import_all(pkg): try: pkg_all = pkg.__all__ except AttributeError: pass else: names = list(_subnames(pkg.__name__, pkg_all)) for name in names: mod = importlib.import_module(name) try: mod_all = mod.__all__ except AttributeError: pass else: names.extend(_subnames(mod.__name__, mod_all) I can see a case being made to provide that as a function in pkgutil (or perhaps importlib itself), but I don't see any reason to give it dedicated syntax. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From grosser.meister.morti at gmx.net Mon May 9 18:43:13 2011 From: grosser.meister.morti at gmx.net (=?ISO-8859-1?Q?Mathias_Panzenb=F6ck?=) Date: Mon, 09 May 2011 18:43:13 +0200 Subject: [Python-ideas] Rename python.exe to python3.exe on Windows In-Reply-To: <87pqnsdup6.fsf@uwakimon.sk.tsukuba.ac.jp> References: <87zkmykmvj.fsf@benfinney.id.au> <4DC5FFB7.6050605@pearwood.info> <87pqnsdup6.fsf@uwakimon.sk.tsukuba.ac.jp> Message-ID: <4DC819A1.3000508@gmx.net> I would say in every Python installation there should be a binary with the version number attached. I think in most (all?) Linux distributions this is already the case. E.g. there is python2.7 and python3.2. There is also python2, that links to some python2.x, and python3 that links to some python3.x, and then there is python, that links to any of the above. Under Linux/Mac OS X we already add a line like this to our scripts: #!/usr/bin/env python Or better: #!/usr/bin/env python3 I say it should be documented that the first is deprecated and the latter form shall be used. Using "#!/usr/bin/env python" should mean "this script is written so that it can be run in *any* python version", which is pretty unrealistic. "#!/usr/bin/env python3" should mean "this script is written so that it can be run in any python 3.x version" and so on. Of course there are scripts that do not use this right. They should be considered as broken and be fixed. (Maybe print deprecation warnings if possible?) Now on Windows there is no #! mechanism. I think it would be worthwhile to fix this and implement a python-dispatcher for Windows. This would then parse the #!-line, drop the "/usr/bin/env" part (if it exists) and lookup the right Python binary form a registry variable. I don't know if there are any registry variables set in a Windows Python installation that let you find the binary of a certain version, but I think it would be a good thing. This way correct scripts would just work under Unix (Linux, Mac, BSD) and Windows. And under Windows you would not have any problems with file type associations. *.py and *.pyw files just have to be associated with the dispatcher. It should not matter if the dispatcher is from a Python 2.x or Python 3.x installation. -panzi On 05/09/2011 12:39 PM, Stephen J. Turnbull wrote: > Steven D'Aprano writes: > > > I wish I could find the quote somebody made about Arch being the distro > > that makes Gentoo seem cautious and conservative... something about Arch > > moving forward so the Gentoo folks know which mistakes not to make? > > The only thing history teaches us is that nobody learns from > others' history: > > $ python > Python 3.1.3 (r313:86834, Feb 22 2011, 18:52:21) > [GCC 4.3.5] on linux2 > Type "help", "copyright", "credits" or "license" for more information. >>>> > $ > > There are a couple of ebuilds that break because of this. From ncoghlan at gmail.com Mon May 9 18:55:32 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 10 May 2011 02:55:32 +1000 Subject: [Python-ideas] Rename python.exe to python3.exe on Windows In-Reply-To: <4DC819A1.3000508@gmx.net> References: <87zkmykmvj.fsf@benfinney.id.au> <4DC5FFB7.6050605@pearwood.info> <87pqnsdup6.fsf@uwakimon.sk.tsukuba.ac.jp> <4DC819A1.3000508@gmx.net> Message-ID: On Tue, May 10, 2011 at 2:43 AM, Mathias Panzenb?ck wrote: > Now on Windows there is no #! mechanism. I think it would be worthwhile to > fix this and implement a python-dispatcher for Windows. This would then > parse the #!-line, drop the "/usr/bin/env" part (if it exists) and lookup > the right Python binary form a registry variable. I don't know if there are > any registry variables set in a Windows Python installation that let you > find the binary of a certain version, but I think it would be a good thing. > > This way correct scripts would just work under Unix (Linux, Mac, BSD) and > Windows. And under Windows you would not have any problems with file type > associations. *.py and *.pyw files just have to be associated with the > dispatcher. It should not matter if the dispatcher is from a Python 2.x or > Python 3.x installation. Since this came up not all that long ago, I'll point people to PEP 394 (for the current draft recommendation regarding symlinks on *nix systems) and PEP 397 (for proposed Windows launcher semantics). Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From guido at python.org Mon May 9 19:02:59 2011 From: guido at python.org (Guido van Rossum) Date: Mon, 9 May 2011 10:02:59 -0700 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us> Message-ID: On Mon, May 9, 2011 at 7:04 AM, Nick Coghlan wrote: > This is one of the reasons circular imports are such a pain - we > pre-bind them in sys.modules, and remove them again if the import > fails, but we don't currently do that in the parent package namespace, > so circular imports sometimes work and sometime break depending not > only on which names are accessed but also *how* they're accessed (e.g. > in a/b/c.py, "import a.b.c" will work, "import a.b.c; c = a.b.c" will > fail with AttributeError and "from a.b import c" will fail with > ImportError). Maybe that's something we could strive to fix? > I can see a case being made to provide that as a function in pkgutil > (or perhaps importlib itself), but I don't see any reason to give it > dedicated syntax. +1 -- --Guido van Rossum (python.org/~guido) From ncoghlan at gmail.com Mon May 9 19:16:52 2011 From: ncoghlan at gmail.com (Nick Coghlan) Date: Tue, 10 May 2011 03:16:52 +1000 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us>

Message-ID: On Tue, May 10, 2011 at 3:02 AM, Guido van Rossum wrote: > On Mon, May 9, 2011 at 7:04 AM, Nick Coghlan wrote: >> This is one of the reasons circular imports are such a pain - we >> pre-bind them in sys.modules, and remove them again if the import >> fails, but we don't currently do that in the parent package namespace, >> so circular imports sometimes work and sometime break depending not >> only on which names are accessed but also *how* they're accessed (e.g. >> in a/b/c.py, "import a.b.c" will work, "import a.b.c; c = a.b.c" will >> fail with AttributeError and "from a.b import c" will fail with >> ImportError). > > Maybe that's something we could strive to fix? The relevant bug is still open: http://bugs.python.org/issue992389 My recollection is that the division of responsibility between the core import code and PEP 302 loaders gets a little confused on this point (although I don't recall if that's a real confusion or just an artefact of the structure of the legacy import code). It will hopefully be a little easier to fix once importlib takes over from import.c and the pre-PEP 302 legacy stuff goes away. Cheers, Nick. -- Nick Coghlan?? |?? ncoghlan at gmail.com?? |?? Brisbane, Australia From ericsnowcurrently at gmail.com Mon May 9 20:55:20 2011 From: ericsnowcurrently at gmail.com (Eric Snow) Date: Mon, 9 May 2011 12:55:20 -0600 Subject: [Python-ideas] Allow 'import star' with namespaces In-Reply-To: References: <5E25C96030E66B44B9CFAA95D3DE5919351310A7AE@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7AF@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B2@EX-MB08.ohsu.edu> <5E25C96030E66B44B9CFAA95D3DE5919351310A7B4@EX-MB08.ohsu.edu> <4DC45610.3040803@stoneleaf.us> Message-ID: On Mon, May 9, 2011 at 8:04 AM, Nick Coghlan wrote: > On Sat, May 7, 2011 at 6:52 AM, Eric Snow > wrote: > > > If you have a list of the submodules you want imported then you can > already > > accomplish this: > > import parent > > for mod in parent.__all_submodules__: > > __import__("parent.{}".format(mod)) > > > Of course, this does not bind the submodules to the namespace of the > package > > module > > It actually does, as binding the submodule name in the parent package > namespace is part of the responsibility of __import__(): > > >>> import logging > >>> logging.handlers > Traceback (most recent call last): > File "", line 1, in > AttributeError: 'module' object has no attribute 'handlers' > >>> __import__("logging.handlers") > > >>> logging.handlers > > > Well, dang it. Not sure how I missed this before: $ python3 >>> import temp >>> dir(temp) ['__builtins__', '__cached__', '__doc__', '__file__', '__name__', '__package__', '__path__'] $ python3 >>> import temp.mod >>> dir(temp) ['__builtins__', '__cached__', '__doc__', '__file__', '__name__', '__package__', '__path__', 'mod'] So the sub-module name binding mechanism is simply to bind the package module and then bind the submodules to it. However, "import temp.mod as something_else" and "from temp import mod" don't do this, which makes sense. This is one of the reasons circular imports are such a pain - we > pre-bind them in sys.modules, and remove them again if the import > fails, but we don't currently do that in the parent package namespace, > so circular imports sometimes work and sometime break depending not > only on which names are accessed but also *how* they're accessed (e.g. > in a/b/c.py, "import a.b.c" will work, "import a.b.c; c = a.b.c" will > fail with AttributeError and "from a.b import c" will fail with > ImportError). > > > I am not sure > > of the specific import mechanism with regards to name binding, but that > > would seem to be a conflict with the way imported names for submodules > are > > bound. > > Nope, it's basically the same as what happens automatically when the > modules are imported normally. Indeed, as near as I can tell, this > request amounts to asking for syntactic sugar that does something > roughly along the lines of: > > def _subnames(pkg_name, subnames): > for subname in subnames: > yield ".".join(pkg_name, subname) > > def import_all(pkg): > try: > pkg_all = pkg.__all__ > except AttributeError: > pass > else: > names = list(_subnames(pkg.__name__, pkg_all)) > for name in names: > mod = importlib.import_module(name) > try: > mod_all = mod.__all__ > except AttributeError: > pass > else: > names.extend(_subnames(mod.__name__, mod_all) > > This works as long as __all__ only contains submodule names, right? > I can see a case being made to provide that as a function in pkgutil > (or perhaps importlib itself), but I don't see any reason to give it > dedicated syntax. > > +1 -eric > Cheers, > Nick. > > -- > Nick Coghlan | ncoghlan at gmail.com | Brisbane, Australia > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mwm at mired.org Tue May 10 16:47:54 2011 From: mwm at mired.org (Mike Meyer) Date: Tue, 10 May 2011 10:47:54 -0400 Subject: [Python-ideas] Minor tweak to PEP 8? Message-ID: <20110510104754.4689cc5e@bhuda.mired.org> PEP eight has an interesting omission in the "Code Layout" section. It doesn't say how to indent continuation lines when code is wrapped to comply with the line length limits. It has examples, but no textual guides. Which means you can do a rock-stupid word warp (with no indentation on the continuation lines), point at the resulting mess, and say "See? If we follow this part of the PEP, we get really ugly code!". Mail doing just that is what prompted this suggestion. I therefore propose adding a sentence or two to this section, something along the lines of: The continuation line(s) should be indented to reflect the structure of the statement being continued. This should be at least one space beyond the first open parenthesis that is not closed on the continued line, if present. Nothing hard and fast, just a requirement to use good sense and the minimal indent resulting from doing so. http://www.mired.org/ Independent Software developer/SCM consultant, email for more information. O< ascii ribbon campaign - stop html mail - www.asciiribbon.org From mikegraham at gmail.com Tue May 10 17:51:33 2011 From: mikegraham at gmail.com (Mike Graham) Date: Tue, 10 May 2011 11:51:33 -0400 Subject: [Python-ideas] Minor tweak to PEP 8? In-Reply-To: <20110510104754.4689cc5e@bhuda.mired.org> References: <20110510104754.4689cc5e@bhuda.mired.org> Message-ID: On Tue, May 10, 2011 at 10:47 AM, Mike Meyer wrote: > PEP eight has an interesting omission in the "Code Layout" section. It > doesn't say how to indent continuation lines when code is wrapped to > comply with the line length limits. It has examples, but no textual > guides. Which means you can do a rock-stupid word warp (with no > indentation on the continuation lines), point at the resulting mess, > and say "See? If we follow this part of the PEP, we get really ugly > code!". Mail doing just that is what prompted this suggestion. > > I therefore propose adding a sentence or two to this section, > something along the lines of: > > ? ?The continuation line(s) should be indented to reflect the > ? ?structure of the statement being continued. This should be at > ? ?least one space beyond the first open parenthesis that is not > ? ?closed on the continued line, if present. > > Nothing hard and fast, just a requirement to use good sense and the > minimal indent resulting from doing so. > > ? ? Message-ID: <87y62ejl2j.fsf@benfinney.id.au> Mike Graham writes: > For this actual rule, I am -1, as I think this is too limiting. And often results in hideous code :-) I'm ?1 also. Please don't make the indentation of continuation lines dependent on the content of the opening line. > Sometimes the indentation is too far and the best style is > > self.other_thing.some_long_method_name( > foo, > barMightBeSortOfLongNaturally, > baz........ I assume you meant a four-column (not three-column) additional indent. +1 if so, this matches the indentation style I advocate for continuation lines. -- \ ?I believe in making the world safe for our children, but not | `\ our children's children, because I don't think children should | _o__) be having sex.? ?Jack Handey | Ben Finney From sklass at pointcircle.com Wed May 11 04:59:16 2011 From: sklass at pointcircle.com (Steven Klass) Date: Tue, 10 May 2011 19:59:16 -0700 Subject: [Python-ideas] Minor tweak to PEP 8? In-Reply-To: <87y62ejl2j.fsf@benfinney.id.au> References: <20110510104754.4689cc5e@bhuda.mired.org> <87y62ejl2j.fsf@benfinney.id.au> Message-ID: +1 for this method self.some_insane_long_method_which_should_have_originator_shot( True, None, keyword = foobar) :-) On Tue, May 10, 2011 at 2:35 PM, Ben Finney wrote: > Mike Graham writes: > > > For this actual rule, I am -1, as I think this is too limiting. > > And often results in hideous code :-) > > I'm ?1 also. Please don't make the indentation of continuation lines > dependent on the content of the opening line. > > > Sometimes the indentation is too far and the best style is > > > > self.other_thing.some_long_method_name( > > foo, > > barMightBeSortOfLongNaturally, > > baz........ > > I assume you meant a four-column (not three-column) additional indent. > > +1 if so, this matches the indentation style I advocate for continuation > lines. > > -- > \ ?I believe in making the world safe for our children, but not | > `\ our children's children, because I don't think children should | > _o__) be having sex.? ?Jack Handey | > Ben Finney > > _______________________________________________ > Python-ideas mailing list > Python-ideas at python.org > http://mail.python.org/mailman/listinfo/python-ideas > -- Steven M. Klass ? 1 (480) 225-1112 ? sklass at pointcircle.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From cmjohnson.mailinglist at gmail.com Wed May 11 05:19:29 2011 From: cmjohnson.mailinglist at gmail.com (Carl M. Johnson) Date: Tue, 10 May 2011 17:19:29 -1000 Subject: [Python-ideas] Minor tweak to PEP 8? In-Reply-To: References: <20110510104754.4689cc5e@bhuda.mired.org> <87y62ejl2j.fsf@benfinney.id.au> Message-ID: Can we all at least agree that continuation lines should always be at least one space more indented than the parent line? So, for example, this would be right out: for item in items: modified_item = self.frobincation_with_spengulizer( item, True, False, spam=None) The arguments should at least line up with the o in modified, if not the f. -------------- next part -------------- An HTML attachment was scrubbed... URL: