<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">On Tue, Mar 4, 2014 at 5:23 AM, Stephen J. Turnbull <span dir="ltr"><<a href="mailto:stephen@xemacs.org" target="_blank">stephen@xemacs.org</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="">>>>>> Guido van Rossum writes:<br>
<br>
> Given that the claim "Python 2 doesn't support Unicode filenames"<br>
> is factually incorrect (in Python 2.7, most filesystem calls in<br>
> fact do support Unicode, at least on some platforms),<br>
<br>
</div>I don't understand what "support Unicode" means. Just that<br>
<br>
with open(u"\u4e00", "w") as f: f.write("works!\n")<br>
<br>
does what is expected[1] if the user knows what he is doing (ie, has<br>
set PYTHONIOENCODING to a Unicode UTF or one of the Asian encodings)?<br><div class=""></div></blockquote><div><br></div><div>That's all I'm asking for, since that's what most functions in 2.7 already do.<br>
</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="">
> I think individual functions in the os module that are found<br>
> lacking should be considered bugs, and if someone goes through<br>
> the effort to supply an otherwise acceptable fix, we shouldn't<br>
> reject it on the basis that we don't want to consider supporting<br>
> Unicode filenames.<br>
<br>
</div>As above, "acceptable fix" means take whatever the current value is<br>
for file system name encoding, and use that to encode and decode<br>
unicode objects to/from str, or raise a UnicodeError if it doesn't<br>
work?<br></blockquote><div><br></div><div>The same thing that is done for other functions that take filenames. <br><br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
I think it's important to define this somewhat carefully, because this<br>
is an area that has a strong tendency to "mission creep". Given that<br>
builtin open "works" by the above definition, I guess it's reasonable<br>
to accept such patches.<br></blockquote><div><br></div><div>Right, the interpretation given to Unicode filenames by builtin open() should be propagated to other functions (I actually suspect that os.statvfs(), which apparently doesn't, is in the minority here). AFAIK that's also roughly what happens in Python 3.<br>
<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
Footnotes:<br>
[1] It writes the line "works!\n" to a file whose name consists of the<br>
single Chinese character for "one".<br>
<br>
<br>
</blockquote></div><br><br clear="all"><br>-- <br>--Guido van Rossum (<a href="http://python.org/~guido">python.org/~guido</a>)
</div></div>