SGMLParser bug? can't parse <br/>

Tung Wai Yip tungwaiyip at yahoo.com
Thu May 15 13:56:04 EDT 2003


I try to use sgmllib.SGMLParser to parse the following HTML

----------------
<html>
<body>
  <br/>
</body>
</html>
----------------

The output is rather messed up.

start tag: <html>
data: '\r\n'
start tag: <body>
data: '\r\n  '
start tag: <br>
data: '>\r\n<'            <-- mess up
end tag: </br>
data: 'body>\r\n'         <-- mess up
end tag: </html>
data: '\r\n'

No problem if I use <br> instead of <br/>.

Is there any place to report Python library bugs?

Wai Yip Tung





More information about the Python-list mailing list