Ka-Ping Yee kryee@wheat.uwaterloo.ca
Tue, 11 Feb 1997 16:23:49 -0500

When you mentioned your desire to create a more general and robust
implementation of SGMLParser, i thought, "It might be good to write
the lexical analyzer in C for speed, or even the whole parser as a
compiled module."

Fred Drake wrote:
> Another
> possibility would be to write another SGMLParser implementation around
> nsgmls output; this keeps the document-type stuff similar to what we
> have now while allowing substantial SGML to be used as needed.

That's kind of what i had in mind when i mentioned nsgmls output.

But putting together these two ideas, why not a Python interface
to sgmls itself?  James Clark has an API for SP, and as far as i
know the source code is freely available.  Import that as a module,
and you have a super-fast, robust, well-tested SGML parser to play
with from your Python script.  (The back-ends for various output
formats could then be Python scripts.)

Just a thought...


DOC-SIG  - SIG for the Python Documentation Project

send messages to: doc-sig@python.org
administrivia to: doc-sig-request@python.org