[XML-SIG] building XML docs using ?
Joe Murray
jmurray@agyinc.com
Mon, 14 May 2001 11:22:05 -0700
Dear All,
I am converting many large "legacy" text files to XML. Some of the
original text files are upwards of 100 MB. What is the most efficient,
using the speed/memory metrics, way to convert these text files to XML?
Currently, I parse through the text files and create a DOM Document
representation. However, the time and memory expenditure for conversion
is huge, using either xml.dom.minidom or xml.dom. Here's an example of
what I do:
----------
# import stuff
from xml.dom.minidom import Document
# create doc and documentElement node
doc = Document()
docelement = doc.appendChild(...)
f = open(...)
..
while 1:
# get data from file
line = f.readline()
if not line:
break
line = line.strip()
data = line.split(...)
# create a new element node using data from file
node = doc.createElement(...)
node.setAttribute(...)
node.appendChild(...)
docelement.appendChild(node)
...
----------
Should I forgo the ease of using the DOM objects by simply generating
outputting "hand-generated" markup? I was doing this previously, it's
efficient, but definitely not as nice/clean as it could be...
So basically, is there a lightweight XML module which provides for (as a
graphics programmer would say) "immediate mode" output, with as nice an
interface as the DOM modules? Oh, and BTW, can XML solve all my
problems??? ;-)
Thanks much,
joe
--
Joseph Murray
Bioinformatics Specialist, AGY Therapeutics
290 Utah Avenue, South San Francisco, CA 94080
(650) 228-1146