[issue14251] [PATCH]HTMLParser decode issue

Ezio Melotti report at bugs.python.org
Sun Mar 11 03:32:21 CET 2012


Ezio Melotti <ezio.melotti at gmail.com> added the comment:

Can you provide a minimal example to reproduce this error?

On Python 2 it's always better to decode the HTML first and then pass unicode to the parser.  Even though on Python 2 the parser accepts bytes string too, there are a few corner cases where it fails.

On Python 3 the parser only accepts unicode, and it should work fine with it (especially if you have an updated clone of cpython).  Can you show what failure you get with Python 3?  Also, can you reproduce the error if you use strict=False?

----------
assignee:  -> ezio.melotti
nosy: +ezio.melotti
type: crash -> behavior

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue14251>
_______________________________________


More information about the Python-bugs-list mailing list