python xml DOM? pulldom? SAX?
googlenews at tooper.org
Mon Aug 29 18:38:07 CEST 2005
I'd advocate for using SAX, as DOM related methods implies loading the
complete XML content in memory whereas SAX grab things on the fly.
SAX method should therefore be faster and less memory consuming...
By the way, if your goal is to just "combine the text out of page:title
and page:revision:text for every single page element", maybe you should
also consider an XSLT filter.
More information about the Python-list