[Python-Dev] Fixing the XML batteries
Paul Moore
p.f.moore at gmail.com
Fri Dec 9 19:24:41 CET 2011
On 9 December 2011 18:15, Bill Janssen <janssen at parc.com> wrote:
> I use ElementTree for parsing valid XML, but minidom for producing it.
>
> I think another thing that might go into "refreshing the batteries" is a
> feature comparison of BeautifulSoup and HTML5lib against the stdlib
> competition, to see what needs to be added/revised. Having to switch to
> an outside package for parsing possibly invalid HTML is a pain.
For what little use I make of XML/HTML parsing, I use lxml, simply
because it has a parser that covers the sort of HTML I have to deal
with in real life. As I have lxml installed, I use it for any XML
parsing tasks, just because I'm used to it.
Paul
More information about the Python-Dev
mailing list