[New-bugs-announce] [issue10370] py3 readlines() reports wrong offset for UnicodeDecodeError
report at bugs.python.org
Tue Nov 9 02:05:15 CET 2010
New submission from Brian Warner <warner at users.sourceforge.net>:
I noticed that the UnicodeDecodeError exception produced by trying to do open(fn).readlines() (i.e. using the default ASCII encoding) on a file that's actually UTF-8 reports the wrong offset for the first undecodeable character. From what I can tell, it reports (offset%4096) instead of the actual offset.
I've attached a test case. It emits "all good" when run against py2.x (well, after converting the print() expressions back into statements), but reports an error at offset 4096 (reported as "0") on py3.1.2 and py3.2a3 . I'm running on a debian (sid) x86 box.
The misreported offset does not occur with read(), just with readlines().
title: py3 readlines() reports wrong offset for UnicodeDecodeError
versions: Python 3.1, Python 3.2
Added file: http://bugs.python.org/file19552/test.py
Python tracker <report at bugs.python.org>
More information about the New-bugs-announce