[issue15423] Minidom tries to parse tags inside of <script> tag

Ezio Melotti report at bugs.python.org
Mon Jul 23 00:51:17 CEST 2012


Ezio Melotti <ezio.melotti at gmail.com> added the comment:

> Do you mean as parser with minidom.parseString() or stand alone?

Standalone.

> If second, than, result will be the sequence of handler calls instead of DOM. This is not bad, but entirely different story.

Indeed.  If you want a tree you could try BeautifulSoup or lxml; there's no tool in the stdlib that specifically parses an HTML document and builds a tree.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue15423>
_______________________________________


More information about the Python-bugs-list mailing list