[XML-SIG] Pyxie

Ken MacLeod ken@bitsko.slc.ut.us
30 Jun 2000 07:21:27 -0500


Paul Prescod <paul@prescod.net> writes:

> [from my vantage point the Internet is doing strange things right now
> but I'll give it a try anyhow]
> 
> > >You just said that Pyxie can work directly from the output of pyexat.
> > 
> > Yes. Internally, in order to avoid the unnecessary overhead
> > of forking a subprocess, Pyxie uses Pyexpat to parse
> > XML and create a PYX stream.
> 
> This is the "double parsing" I mentioned. If Pyxie is parsing a one
> gigabyte document (as an extreme example) it needs 1 gigabyte of
> extra disk space for its tempfile. Fredrick's pull parsing technique
> can eliminate this and eliminate the need to use pyx
> internally. With pulldom, I can parse a gigabyte document with 0
> bytes free disk space and as little as 1K of RAM (above and beyond
> that required by Python+modules).

Sean, is "create a PYX stream" correct?

I read between the lines there and assumed Pyxie used pyexpat to parse
the XML and "create PYX [events]", so no subprocess was used.  In
which case the double parse isn't happening.

  -- Ken