[issue10778] decoding_fgets() (tokenizer.c) decodes the filename from the wrong encoding

STINNER Victor report at bugs.python.org
Mon Dec 27 02:57:03 CET 2010


New submission from STINNER Victor <victor.stinner at haypocalc.com>:

decoding_fgets() decodes the input filename from UTF-8 whereas the filename is encoded to the filesystem encoding. PyUnicode_DecodeFSDefault() should be used.

decoding_fgets() raises a SyntaxError("Non-UTF-8 code starting with '\xHH' in file xxx on line xxx, but no encoding declared; ...").

indenterror() (inconsistent use of tabs and spaces in indentation) and

----------
components: Interpreter Core, Unicode
messages: 124693
nosy: haypo
priority: normal
severity: normal
status: open
title: decoding_fgets() (tokenizer.c) decodes the filename from the wrong encoding
versions: Python 3.2

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue10778>
_______________________________________


More information about the Python-bugs-list mailing list