> Another point: > sax is painful to use compared to full lxml (dom) > But then sax is the only choice when files cross a certain size > Thats why the above question No matter what the choice of XML parser, I suspect you'll want to convert it to some other form for processing. Skip