Mailman 3 September 2010 - Python-Dev

os.path.normcase rationale?
by Chris Withers Oct. 8, 2010

Oct. 8, 2010

Hi All, I'm curious as to why, with a file called "Foo.txt" on a case descriminating but case insensitive filesystem, os.path.normcase('FoO.txt') will return "foo.txt" rather than "Foo.txt"? Yes, I know the behaviour is documented, but I'm wondering if anyone can remember the rationale for that behaviour? cheers, Chris -- Simplistix - Content Management, Batch Processing & Python Consulting - http://www.simplistix.co.uk

18 35

API for binary operations on Sets
by Raymond Hettinger Oct. 4, 2010

Oct. 4, 2010

I would like to solicit this group's thoughts on how to reconcile the Set abstract base class with the API for built-in set objects (see http://bugs.python.org/issue8743 ). I've been thinking about this issue for a good while and the RightThingToDo(tm) isn't clear. Here's the situation: Binary operators for the built-in set object restrict their "other" argument to instances of set, frozenset, or one of their subclasses. Otherwise, they return NotImplemented. This design was intentional (… [View More]i.e. part of the original pure python version, it is unittested behavior, and it is a documented restriction). It allows other classes to "see" the NotImplemented and have a chance to take-over using __ror__, __rand__, etc. Also, by not accepting any iterable, it prevents little coding atrocities or possible mistakes like "s | 'abc'". This is a break with what is done for lists (Guido has previously lamented that list.__add__ accepting any iterable is one of his "regrets"). This design has been in place for several years and so far everyone has been happy with it (no bug reports, feature requests, or discussions on the newsgroup, etc). If someone needed to process a non-set iterable, the named set methods (like intersection, update, etc) all accept any iterable value and this provides an immediate, usable alternative. In contrast, the Set and MutableSet abstract base classes in Lib/_abcoll.py take a different approach. They specify that something claiming to be set-like will accept any-iterable for a binary operator (IOW, the builtin set object does not comply). The provided mixins (such as __or__, __and__, etc) are implemented that way and it works fine. Also, the Set and MutableSet API do not provide named methods such as update, intersection, difference, etc. They aren't really needed because the operator methods already provide the functionality and because it keeps the Set API to a reasonable minimum. All of this it well and good, but the two don't interoperate. You can't get an instance of the Set ABC to work with a regular set, nor do regular sets comply with the ABC. These are problems because they defeat some of the design goals for ABCs. We have a few options: 1. Liberalize setobject.c binary operator methods to accept anything registered to the Set ABC and add a backwards incompatible restriction to the Set ABC binary operator methods to only accept Set ABC instances (they currently accept any iterable). This approach has a backwards incompatible tightening of the Set ABC, but that will probably affect very few people. It also has the disadvantage of not providing a straight-forward way to handle general iterable arguments (either the implementer needs to write named binary methods like update, difference, etc for that purpose or the user will need to cast the the iterable to a set before operating on it). The positive side of this option is that keeps the current advantages of the setobject API and its NotImplemented return value. 1a. Liberalize setobject.c binary operator methods, restrict SetABC methods, and add named methods (like difference, update, etc) that accept any iterable. 2. We could liberalize builtin set objects to accept any iterable as an "other" argument to a binary set operator. This choice is not entirely backwards compatible because it would break code depending on being able run __ror__, __rand__, etc after a NotImplemented value is returned. That being said, I think it unlikely that such code exists. The real disadvantage is that it replicates the problems with list.__add__ and Guido has said before that he doesn't want to do that again. I was leaning towards #1 or #1a and the guys on IRC thought #2 would be better. Now I'm not sure and would like additional input so I can get this bug closed for 3.2. Any thoughts on the subject would be appreciated. Thanks, Raymond P.S. I also encountered a small difficulty in implementing #2 that would still need to be resolved if that option is chosen. [View Less]

7 8

Pronouncement needed in issue9675
by Jesus Cea Oct. 4, 2010

Oct. 4, 2010

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 http://bugs.python.org/issue9675 Long history sort: Python 2.7 backported Capsule support and (incorrectly, in my opinion) marked CObject as deprecated. All C modules in the stdlib were updated to Capsule (with a CObject compatibility layer), except BSDDB, because this change was done late in the cycle, the proposed patch was buggy (solvable) and a pronouncement was done that CObject was not actually deprecated. But in python 2.7 release, … [View More]

8 20

We should be using a tool for code reviews
by Guido van Rossum Oct. 2, 2010

Oct. 2, 2010

I would like to recommend that the Python core developers start using a code review tool such as Rietveld or Reviewboard. I don't really care which tool we use (I'm sure there are plenty of pros and cons to each) but I do think we should get out of the stone age and start using a tool for the majority of our code reviews. While I would personally love to see Rietveld declared the official core Python code review tool, I realize that since I wrote as a Google engineer and it is running on … [View More]

27 73

Problems with hex-conversion functions
by Ender Wiggin Oct. 2, 2010

Oct. 2, 2010

Hello everyone. I see several problems with the two hex-conversion function pairs that Python offers: 1. binascii.hexlify and binascii.unhexlify 2. bytes.fromhex and bytes.hex Problem #1: bytes.hex is not implemented, although it was specified in PEP 358. This means there is no symmetrical function to accompany bytes.fromhex. Problem #2: Both pairs perform the same function, although The Zen Of Python suggests that "There should be one-- and preferably only one --obvious way to do it." I do … [View More]not understand why PEP 358 specified the bytes function pair although it mentioned the binascii pair... Problem #3: bytes.fromhex may receive spaces in the input string, although binascii.unhexlify may not. I see no good reason for these two functions to have different features. Problem #4: binascii.unhexlify may receive both input types: strings or bytes, whereas bytes.fromhex raises an exception when given a bytes parameter. Again there is no reason for these functions to be different. Problem #5: binascii.hexlify returns a bytes type - although ideally, converting to hex should always return string types and converting from hex should always return bytes. IMO there is no meaning of bytes as an output of hexlify, since the output is a representation of other bytes. This is also the suggested behavior of bytes.hex in PEP 358 Problems #4 and #5 call for a decision about the input and output of the functions being discussed: Option A : Strict input and output unhexlify (and bytes.fromhex) may only receives string and may only return bytes hexlify (and bytes.hex) may only receives bytes and may only return strings Option B : Robust input and strict output unhexlify (and bytes.fromhex) may receive bytes and strings and may only return bytes hexlify (and bytes.hex) may receive bytes or strings and may only return strings Of course we may also consider a third option, which will allow the return type of all functions to be robust (perhaps specified in a keyword argument), but as I wrote in the description of problem #5, I see no sense in that. Note that PEP 3137 describes: "... the more strict definitions of encoding and decoding in Python 3000: encoding always takes a Unicode string and returns a bytes sequence, and decoding always takes a bytes sequence and returns a Unicode string." - suggesting option A. To repeat problems #4 and #5, the current behavior does not match any option: * The return type of binascii.hexlify should be string, and this is not the current behavior. As for the input: * Option A is not the current behavior because binascii.unhexlify may receive both input types. * Option B is not the current behavior because bytes.fromhex does not allow bytes as input. To fix these issues, three changes should be applied: 1. Deprecate bytes.fromhex. This fixes the following problems: #4 (go with option B and remove the function that does not allow bytes input) #2 (the binascii functions will be the only way to "do it") #1 (bytes.hex should not be implemented) 2. In order to keep the functionality that bytes.fromhex has over unhexlify, the latter function should be able to handle spaces in its input (fix #3) 3. binascii.hexlify should return string as its return type (fix #5) [View Less]

5 7

Celebrating issue #10000
by "Martin v. Löwis" Oct. 1, 2010

Oct. 1, 2010

Amaury just filed issue #10000 yesterday; as counting started with 1000, we are now into 9000 roundup issues. I have become quite fond of roundup over the years, and would like to thank Ka-Ping Yee, Richard Jones, and Erik Forsberg for getting us here. There are many contributions to this infrastructure, both from individuals and software projects, but I'd like to single out two of them which have I also appreciate very much: the folks at Upfront Hosting have helped a lot to keep the system … [View More]

5 4

Moving the developer docs?
by Fred Drake Sept. 30, 2010

Sept. 30, 2010

On Wed, Sep 22, 2010 at 10:38 PM, Brett Cannon <brett(a)python.org> wrote: > the first thing on the agenda is a complete rewrite of the developer > docs and moving them into the Doc/ directory I'd like to know why you think moving the developer docs into the CPython tree makes sense. My own thought here is that they're not specific to the version of Python, though some of the documentation deals with the group of specific branches being maintained. For me, keeping them in a … [View More]

23 77

hg conversion: tags
by Dirkjan Ochtman Sept. 30, 2010

Sept. 30, 2010

Hi all, I've recently been working on the conversion more (since my thesis got finished). I finally wrote the script that splits the release branches from the feature branches, so that we can include the former in the main repository and keep the latter in separate clones as needed. Next, I wanted to look into tags. There's a big list of tags (see below), and I wonder if I should clean that up or if we should leave it as-is. For example, it might be interesting to bring old release tags in … [View More]line with newer tags (so Release_1_0 would become r10), or maybe use clean tags similar to what we do with hg itself (r266 would become just 2.6.6), or just remove some tags. Is this a good idea at all, or should we just leave everything the way it is now? Cheers, Dirkjan r32a2 r266 r266rc2 r266rc1 r32a1 r27 r27rc2 r27rc1 r27b2 r27b1 r312 r265 r265rc2 r312rc1 r27a4 r265rc1 r27a3 r255 r255c2 r255c1 r27a2 r27a1 r311 r264 r264rc2 r264rc1 r263 r263rc1 r311rc1 r31 r31rc2 r31rc1 r31b1 r262 r262c1 r31a2 r31a1 r301 r254 r253 r246 r253c1 r246c1 r261 r30 r30rc3 r30rc2 r26 r26rc2 r30rc1 r26rc1 r30b3 r26b3 r26b2 r30b2 r26b1 r30b1 r26a3 r30a5 r26a2 r30a4 r237 r245 r237c1 r245c1 r30a3 r26a1 r252 r252c1 r251c1 r30a2 r30a1 py3k-before-rstdocs py26-before-rstdocs r251 r236 r236c1 r244 r244c1 r25 r25c2 r25c1 r25b3 r25b2 r25b1 r25a2 r25a1 r25a0 r243 r25a0/trunk r243c1 before-ast-merge-to-head mrg_to_ast-branch_15OCT05 IDLE-syntax-root r242 r242c1 email_Release_2_5_6 r241 r241c2 r241c1 r235 r235c1 merged_from_MAIN_07JAN05 mrg_to_ast-branch_05JAN05 pre-sf-818006 bsddb_version_4_3_0 release24-fork r24 r24c1 r24b2 r24b1 r24a3 tim-doctest-closed r24a2 tim-doctest-merge-24a2 r24a1 pre-sf-1149508 r234 email_Release_2_5_5 r234c1 start-sf-965425 before-ast-merge-from-head bsddb_version_4_2_4 r233 r233c1 r232 r232c1 r231 pybsddb_after_bsddb42_01 pybsddb_before_bsddb42 r23-mac r23 release23-fork r23c2 r23rc1-fork r23c1 bsddb_version_4_1_6 r23b2 r23b2-fork anthony-parser-branchpoint IDLELIB_CREATED r09b1 r223 email_Release_2_5_3 r223c1 email_Release_2_5_2 r23b1-mac r23b1 r23b1-fork COPY_TO_PYTHON mrg_to_ast-branch_24APR03 email_Release_2_5_1 cache-attr-fork email_Release_2_5 r09a2nt email_Release_2_5b1 r23a2 r23a2-fork r09a2 LAST_OLD_IDLE r09a1 r23a1 r23a1-fork r09a0 before-bgen-pep252 r222-mac r222 email_Release_2_4_3 email_Release_2_4_2 Distutils-1_0_3 r222b1 email_Release_2_4_1 email_Release_2_4 py-cvs-2002_09_13-merged py-cvs-2002_09_13 MERGED_FROM_DS_RPC_13SEP02 MERGED_TO_MAIN_13SEP02 PRE_DS_RPC_MERGE email_Release_2_3_1 email_Release_2_3 BEFORE_RESTART email_Release_2_2 email_Release_2_1 final_classic_builds email_Release_2_0_5 Release_2_0_4 TAG_pre_teyc_gvr_rpc_patch r221 r213 r22p1 r221c2 r221c1 r1_95_2 r212 r212c1 release22-mac release22-macmerge release22 release22-fork r22c1-mac r22c1-macmergefromtrunk r22c1 r22rc1-fork final_CW6_projects universal_headers_332 v0_10 r22b2-mac r22b2 r22b2-fork v0_09 r22b1-mac v0_08 v0_07 v0_06 r22b1 r22b1-fork r22b1-docs-prep r22a4 r22a4-fork r22a3 r22a3-fork r22a3-docs-prep r22a2 r22a2-docs-prep r22a2-fork before-carbon-package after-descr-branch-merge date2001-08-01 date2001-07-30 date2001-07-28 IDLEFORK_081_RELEASE date2001-07-21 r211 r22a1 date2001-07-17b date2001-07-17a date2001-07-16 date2001-07-15 py-cvs-2001_07_13-merged py-cvs-2001_07_13 py-cvs-rel2_1-merged date2001-07-13 r211c1 py-cvs-rel2_1 py-cvs-2000_03_09 descr-fork date2001-07-06 base-VP-idle r201 r201c1 merged-with-main-repository after-call-reorg before-call-reorg mac210 Distutils-1_0_2 release21 r21c2 r21c1 mac210b2 r21b2 mac210b1a mac210b1 r21b1 mac210a3 r161 mac210a1 r21a2 r21a1 pre_amk last_cw_pro_53 mac200 release20 Distutils-1_0_1 r20c1 Distutils-1_0 Distutils-0_9_4 r20b2 Distutils-0_9_3 r20b11 mac200b1 r20b1 release16 Distutils-0_9_2 last_68k_projects cw_pro_5 Distutils-0_9_1 arelease r16b1 r16b1-win Distutils-0_9 Distutils-0_8_2 Distutils-0_8_1 Distutils-0_8 r16a2 Distutils-0_1_5 pre_GUSI2 Distutils-0_1_4 Distutils-0_1_3-branch r16a1 release152p2 release152p1-branch-tag pre-unicode idle05 pre_0_2_breakage Distutils-0_1_3 Distutils-0_1_2 Distutils-0_1_1 Distutils-0_1 mx_builder-alpha2 release152p1 pre-string-meths Release_1_0 release152 r152 mac152c1 r152c1 r06 release152b2 r152b2 mac152b1 release152b1 idle-r02 r152b1 Release_0_1 r01 r152a2 Public_Release_20-Aug-1998 PYIDE_APR98 PYTHONSCRIPT_0_5b1 OSAM_APR98 BBPY_0_2_3 release152a1 r152a1 release151p1 mac151 r151 Public_Release_27-Mar-1998 Public_13-Mar-1998 Public_11-Mar-1998 release15 r15b2 r15b1 r15a4 r15a4near r15a3 r15a2 r15a1 lastoldnames lastoldname release14 Python1_4final r14beta3 Beta_20-Aug-1996 r14beta2 Beta_09-Aug-1996 Beta_05-Jul-1996 r14beta1 cwisync1 cnrisync2 chameleon-1 cnrisync release13 r13beta1 Public_05-Jul-1995 release12 proof3 r12beta4 Python_1_2_release Beta_20-Mar-1995 proof2 Beta_15-Mar-1995-#2 Beta_15-Mar-1995 Beta_14-Mar-1995-#3 Beta_14-Mar-1995-#2 Beta_14-Mar-1995 proof1 r12beta3 r12beta2 r12beta1 release111 release11 mhammond mac102 release102 release101 release100 last099 release099 release098 pre_jh [View Less]

13 23

Goodbye
by Mark Lawrence Sept. 30, 2010

Sept. 30, 2010

I'm rather sad to have been sacked, but such is life. I won't be doing any more work on the bug tracker for obvious reasons, but hope that you who have managed to keep your voluntary jobs manage to keep Python going. Kindest regards. Mark Lawrence.

24 74

Question on bz2 codec. Is this a bug?
by Chris Bergstresser Sept. 29, 2010

Sept. 29, 2010

Hi all -- I looked through the bug tracker, but I didn't see this listed. I was trying to use the bz2 codec, but it seems like it's not very useful in the current form (and I'm not sure if it's getting added back to py3k, so maybe this is a moot point). It looks like the codec writes every piece of data fed to it as a separate compressed block. This results in compressed files which are significantly larger than the uncompressed files, if you're writing a lot of small bursts of data. It … [View More]

3 5