[Python-Dev] htmllib vs. HTMLParser
Guido van Rossum
guido at python.org
Mon Oct 27 14:08:48 EST 2003
> On Mon, Oct 27, 2003 at 08:52:53AM -0800, Guido van Rossum wrote:
> > I'm unclear on what you plan to do -- repeal sgmllib an rewrite
> > htmllib to use HTMLParser internally for a backwards compatible
> > interface?
>
> Correct; that's what your initial checkin message for HTMLParser.py suggests
> doing, and if I'm touching htmllib.py to add the HTML 4.01 stuff, I may as
> well make the other change, too.
>
> > I'm okay with deprecating sgmllib faster than htmllib.
>
> sgmllib gets deprecated; htmllib never gets deprecated. HTMLParser is a
> barebones HTML parser that provides no default handlers (handle_head,
> handle_title, etc.), and htmllib extends it, adding default handlers for the
> various things in HTML 4.01.
OK, got it. Sounds good to me!
--Guido van Rossum (home page: http://www.python.org/~guido/)
More information about the Python-Dev
mailing list