[Distutils] The Wheel specification and Unicode filenames
Daniel Holth
dholth at gmail.com
Thu Feb 21 16:22:10 CET 2013
On Thu, Feb 21, 2013 at 10:13 AM, Vinay Sajip <vinay_sajip at yahoo.co.uk>wrote:
> The Wheel specification talks about supporting Unicode in the filename of
> wheel
> files, but is mute on the subject of the names of the entries in the
> archive.
>
> It would be good to have clarity on this point. The Python docs for 2.x
> and 3.x
> tell us:
>
> There is no official file name encoding for ZIP files. If you have
> unicode
> file names, you must convert them to byte strings in your desired
> encoding
> before passing them to write(). WinZip interprets all file names as
> encoded
> in CP437, also known as DOS Latin.
>
> The "your desired encoding" is, I think, too loose for wheel files, as we
> want
> interoperability between implementations. We should mandate CP437 encoding
> if we
> want the files to be examinable on Windows in e.g. WinZip or 7-Zip. On
> Linux,
> file-roller seems to be unable to display Unicode, whether you use CP437
> for the
> filenames or whether you use utf-8.
>
I feign ignorance of any coding that is not utf-8.
http://hg.python.org/cpython/file/d49685548a7a/Lib/zipfile.py#l404
http://hg.python.org/cpython/file/d49685548a7a/Lib/zipfile.py#l1000
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/distutils-sig/attachments/20130221/b48ce4c3/attachment.html>
More information about the Distutils-SIG
mailing list