[issue8784] tarfile/Windows: Don't use mbcs as the default encoding
Marc-Andre Lemburg
report at bugs.python.org
Thu Jun 10 01:09:12 CEST 2010
Marc-Andre Lemburg <mal at egenix.com> added the comment:
STINNER Victor wrote:
>
> STINNER Victor <victor.stinner at haypocalc.com> added the comment:
>
> I created a TAR archive with the 7-zip archiver of file with diacritics in their name (eg. "é" and "à"). Then I opened the archive with WinRAR: the file names were not displayed correctly :-/
>
> 7-zip encodes "à" (U+00e0) as 0x85 (1 byte), and "é" (U+00e9) as 0x82 (1 byte). I don't know this encoding.
That's an old DOS code paged used in Europe: CP850
http://en.wikipedia.org/wiki/Code_page_850
----------
nosy: +lemburg
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue8784>
_______________________________________
More information about the Python-bugs-list
mailing list