[issue10599] sgmllib.parse_endtag() is not respecting quoted text

Michael Brooks report at bugs.python.org
Wed Dec 1 22:21:49 CET 2010


New submission from Michael Brooks <firealwaysworks at gmail.com>:

In the attached example is a very simple usage of sgmllib that is trying to parse:
<input value="><a href=http://bug>link</a>">

The bug is that sgmllib is parsing this href.  Browsers on the other hand see this as the input's value.  

Also keep in mind that escaping of quote marks in HTML is not like python.  \" is not a character literal "  thus <input value="\"><a href=http://bug>link</a>"> is still quoted text and the href should not be parsed. 

Thank you

----------
components: None
files: sgmllib_bug.py
messages: 123016
nosy: Michael.Brooks
priority: normal
severity: normal
status: open
title: sgmllib.parse_endtag() is not respecting quoted text
type: behavior
versions: Python 2.6
Added file: http://bugs.python.org/file19895/sgmllib_bug.py

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue10599>
_______________________________________


More information about the Python-bugs-list mailing list