[issue12008] HtmlParser non-strict goes wrong with unquoted attributes
svilen dobrev
report at bugs.python.org
Thu May 5 12:47:28 CEST 2011
New submission from svilen dobrev <az at svilendobrev.com>:
nonstrict mode seems to eat too much into data and gets past endpos of the chunk processed, and parser gets confused and treats any subsequent stuff as data. i didn't think out how to fix the regexp as such, but instead limited its span to :endpos so it doesnot eat too much.
seems to happen with unquoted attributes.
----------
files: html.parser.diff
keywords: patch
messages: 135182
nosy: svilend
priority: normal
severity: normal
status: open
title: HtmlParser non-strict goes wrong with unquoted attributes
Added file: http://bugs.python.org/file21893/html.parser.diff
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue12008>
_______________________________________
More information about the Python-bugs-list
mailing list