[New-bugs-announce] [issue36975] csv: undocumented UnicodeDecodeError on malformed file

alter-bug-tracer report at bugs.python.org
Mon May 20 14:13:43 EDT 2019


New submission from alter-bug-tracer <emanuel.vintila at abugtracer.ro>:

UnicodeDecodeError is thrown instead of csv.Error when parsing malformed inputs.
Examples:
1. file0
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd5 in position 0: invalid continuation byte
Traceback (most recent call last):
  File "csv_parser.py", line 6, in <module>
    for row in reader:
  File "/usr/local/lib/python3.8/csv.py", line 111, in __next__
    row = next(self.reader)
  File "/usr/local/lib/python3.8/codecs.py", line 322, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)
2. file1
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 51: invalid start byte
Traceback (most recent call last):
  File "csv_parser.py", line 6, in <module>
    for row in reader:
  File "/usr/local/lib/python3.8/csv.py", line 110, in __next__
    self.fieldnames
  File "/usr/local/lib/python3.8/csv.py", line 97, in fieldnames
    self._fieldnames = next(self.reader)
  File "/usr/local/lib/python3.8/codecs.py", line 322, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)

(file0, file1 and csv_parser.py attached)

----------
files: csv.zip
messages: 342939
nosy: alter-bug-tracer
priority: normal
severity: normal
status: open
title: csv: undocumented UnicodeDecodeError on malformed file
type: behavior
versions: Python 3.8
Added file: https://bugs.python.org/file48339/csv.zip

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue36975>
_______________________________________


More information about the New-bugs-announce mailing list