[Python-Dev] Fixing the XML batteries

Paul Moore p.f.moore at gmail.com
Fri Dec 9 19:24:41 CET 2011


On 9 December 2011 18:15, Bill Janssen <janssen at parc.com> wrote:
> I use ElementTree for parsing valid XML, but minidom for producing it.
>
> I think another thing that might go into "refreshing the batteries" is a
> feature comparison of BeautifulSoup and HTML5lib against the stdlib
> competition, to see what needs to be added/revised.  Having to switch to
> an outside package for parsing possibly invalid HTML is a pain.

For what little use I make of XML/HTML parsing, I use lxml, simply
because it has a parser that covers the sort of HTML I have to deal
with in real life. As I have lxml installed, I use it for any XML
parsing tasks, just because I'm used to it.

Paul


More information about the Python-Dev mailing list