[Python-Dev] htmllib vs. HTMLParser

Guido van Rossum guido at python.org
Mon Oct 27 14:08:48 EST 2003

> On Mon, Oct 27, 2003 at 08:52:53AM -0800, Guido van Rossum wrote:
> > I'm unclear on what you plan to do -- repeal sgmllib an rewrite
> > htmllib to use HTMLParser internally for a backwards compatible
> > interface?
> Correct; that's what your initial checkin message for HTMLParser.py suggests
> doing, and if I'm touching htmllib.py to add the HTML 4.01 stuff, I may as
> well make the other change, too.  
> > I'm okay with deprecating sgmllib faster than htmllib.
> sgmllib gets deprecated; htmllib never gets deprecated.  HTMLParser is a
> barebones HTML parser that provides no default handlers (handle_head,
> handle_title, etc.), and htmllib extends it, adding default handlers for the
> various things in HTML 4.01.

OK, got it.  Sounds good to me!

--Guido van Rossum (home page: http://www.python.org/~guido/)

More information about the Python-Dev mailing list