30 Jun 2000 07:21:27 -0500
Paul Prescod <firstname.lastname@example.org> writes:
> [from my vantage point the Internet is doing strange things right now
> but I'll give it a try anyhow]
> > >You just said that Pyxie can work directly from the output of pyexat.
> > Yes. Internally, in order to avoid the unnecessary overhead
> > of forking a subprocess, Pyxie uses Pyexpat to parse
> > XML and create a PYX stream.
> This is the "double parsing" I mentioned. If Pyxie is parsing a one
> gigabyte document (as an extreme example) it needs 1 gigabyte of
> extra disk space for its tempfile. Fredrick's pull parsing technique
> can eliminate this and eliminate the need to use pyx
> internally. With pulldom, I can parse a gigabyte document with 0
> bytes free disk space and as little as 1K of RAM (above and beyond
> that required by Python+modules).
Sean, is "create a PYX stream" correct?
I read between the lines there and assumed Pyxie used pyexpat to parse
the XML and "create PYX [events]", so no subprocess was used. In
which case the double parse isn't happening.