[issue10778] decoding_fgets() (tokenizer.c) decodes the filename from the wrong encoding
STINNER Victor
report at bugs.python.org
Mon Dec 27 02:57:03 CET 2010
New submission from STINNER Victor <victor.stinner at haypocalc.com>:
decoding_fgets() decodes the input filename from UTF-8 whereas the filename is encoded to the filesystem encoding. PyUnicode_DecodeFSDefault() should be used.
decoding_fgets() raises a SyntaxError("Non-UTF-8 code starting with '\xHH' in file xxx on line xxx, but no encoding declared; ...").
indenterror() (inconsistent use of tabs and spaces in indentation) and
----------
components: Interpreter Core, Unicode
messages: 124693
nosy: haypo
priority: normal
severity: normal
status: open
title: decoding_fgets() (tokenizer.c) decodes the filename from the wrong encoding
versions: Python 3.2
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue10778>
_______________________________________
More information about the Python-bugs-list
mailing list