[issue12008] HtmlParser non-strict goes wrong with unquoted attributes

svilen dobrev report at bugs.python.org
Thu May 5 12:47:28 CEST 2011


New submission from svilen dobrev <az at svilendobrev.com>:

nonstrict mode seems to eat too much into data and gets past endpos of the chunk processed, and parser gets confused and treats any subsequent stuff as data. i didn't think out how to fix the regexp as such, but instead limited its span to :endpos so it doesnot eat too much. 
seems to happen with unquoted attributes.

----------
files: html.parser.diff
keywords: patch
messages: 135182
nosy: svilend
priority: normal
severity: normal
status: open
title: HtmlParser non-strict goes wrong with unquoted attributes
Added file: http://bugs.python.org/file21893/html.parser.diff

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue12008>
_______________________________________


More information about the Python-bugs-list mailing list