[Web-SIG] Are both htmllib and HTMLParser needed?
fdrake at gmail.com
Wed Feb 20 16:43:26 CET 2008
On Feb 20, 2008 9:35 AM, Guido van Rossum <guido at python.org> wrote:
> ISTR that HTMLParser was the preferred one. It is certainly newer, and
> doesn't carry the baggage of sgmllib which I would discard together
> with htmllib). Maybe Fred Drake remembers (he's listed as the
> co-author on the initial checkin message).
I was thinking I'd said something on the stdlib-sig list, but I can't
find it in the archive, so I must be having a senior moment (brought
on early by kids).
I'd be in favor of keeping only HTMLParser, with a compliant module
name ("htmlparser" doesn't seem unreasonable). The code was
originally derived from htmllib for the Grail webbrowser, mostly to
make things like attribute handling less painful.
Merging _markupbase into HTMLParser to create htmlparser would be
pretty mechanical. Removing sgmllib and htmllib does not depend on
that, and can be done at any time if there's agreement.
Fred L. Drake, Jr. <fdrake at gmail.com>
"Chaos is the score upon which reality is written." --Henry Miller
More information about the Web-SIG