[Python-Dev] 2.7 Release? 2.7 == last of the 2.x line?

James Y Knight foom at fuhm.net
Tue Nov 3 06:29:15 CET 2009


On Nov 3, 2009, at 12:06 AM, Guido van Rossum wrote:
> Though I imagine what
> it really needs is a "quirks mode" parser that is compatible with the
> HTML dialect accepted by, say, IE6. Maybe a summer of code project?

Already exists: html5lib.
http://code.google.com/p/html5lib/

Or if you want a faster (yet I think less exact) HTML parser,  
libxml2's HTML parser, via lxml:
http://codespeak.net/lxml/parsing.html#parsing-html

James


More information about the Python-Dev mailing list