[ python-Bugs-975556 ] HTMLParser lukewarm on bogus bare attribute
chars
SourceForge.net
noreply at sourceforge.net
Fri Jun 18 15:33:18 EDT 2004
Bugs item #975556, was opened at 2004-06-18 14:33
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=975556&group_id=5470
Category: Python Library
Group: Python 2.3
Status: Open
Resolution: None
Priority: 5
Submitted By: Mike Coleman (mkc)
Assigned to: Nobody/Anonymous (nobody)
Summary: HTMLParser lukewarm on bogus bare attribute chars
Initial Comment:
I tripped over the same problem mentioned in bug
#921657 (HTMLParser.py), except that my bogus attribute
char is '|' instead of '@'.
May I suggest that HTMLParser either require strict
compliance with the HTML spec, or alternatively that it
accept everything reasonable? The latter approach
would be much more useful, and it would also be
valuable to have this decision documented.
In particular, 'attrfind' needs to be changed to accept
(following the '=\s*') something like the subpattern
given for 'locatestarttagend' (see the "bare value" line).
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=975556&group_id=5470
More information about the Python-bugs-list
mailing list