[Patches] [ python-Patches-670664 ] HTMLParser.py - more robust <SCRIPT> parsing

SourceForge.net noreply@sourceforge.net
Fri, 24 Jan 2003 19:58:04 -0800


Patches item #670664, was opened at 2003-01-19 14:07
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=670664&group_id=5470

Category: Library (Lib)
Group: Python 2.3
Status: Open
Resolution: None
Priority: 5
Submitted By: j paulson (fantoozler)
Assigned to: Nobody/Anonymous (nobody)
Summary: HTMLParser.py - more robust <SCRIPT> parsing

Initial Comment:
http://www.ebay.com contains a script element of the form

<SCRIPT>
...
   vbscript += "</SCR"+"IPT> \n";
...
</SCRIPT>

which is not enclosed in "<!-- ... -->" comments.  The parser 
choked on that line, indicating it was a mal-formed end tag.

The changes are:

  interesting_cdata is now a dict mapping start tag to
    an re matching the end tag, a "<--" or \Z

  HTMLParser.set_cdata_mode takes an extra argument, 
    the start tag


----------------------------------------------------------------------

>Comment By: j paulson (fantoozler)
Date: 2003-01-25 03:58

Message:
Logged In: YES 
user_id=690612

Found regression test, used it, found error, fixed it.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=305470&aid=670664&group_id=5470