[New-bugs-announce] [issue12008] HtmlParser non-strict goes wrong with unquoted attributes
report at bugs.python.org
Thu May 5 12:47:28 CEST 2011
New submission from svilen dobrev <az at svilendobrev.com>:
nonstrict mode seems to eat too much into data and gets past endpos of the chunk processed, and parser gets confused and treats any subsequent stuff as data. i didn't think out how to fix the regexp as such, but instead limited its span to :endpos so it doesnot eat too much.
seems to happen with unquoted attributes.
title: HtmlParser non-strict goes wrong with unquoted attributes
Added file: http://bugs.python.org/file21893/html.parser.diff
Python tracker <report at bugs.python.org>
More information about the New-bugs-announce