[issue20288] HTMLParse handing of non-numeric charrefs broken

Ezio Melotti report at bugs.python.org
Fri Jan 17 19:35:24 CET 2014


Ezio Melotti added the comment:

Thanks for the report, this is indeed a bug.
This behavior was covered by a test (see Lib/test/test_htmlparser.py:164), but _run_check feeds the chars one by one to the parser, and in that case it works correctly.  While feeding the parser a whole chunk I was able to reproduce the bug.  This should be fixed, and the behavior of _run_check should probably be changed too -- maybe it could test both the char-by-char and the regular feeding.

----------
nosy: +r.david.murray
stage:  -> needs patch
versions: +Python 2.7, Python 3.3, Python 3.4

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue20288>
_______________________________________


More information about the Python-bugs-list mailing list