Parsing complex web pages safely with htmllib.HTMLParser

montanaro at tttech.com montanaro at tttech.com
Thu Jan 24 18:33:10 EST 2002


    >> I'm not sure how XHTML will solve the problem.  Instead of broken
    >> HTML we'll have to contend with broken XHTML.

    amk> Mozilla doesn't accept invalid XHTML, and neither does IE.  For
    amk> example, when I point either Mozilla or IE 6 at
    amk> http://www.w3schools.com/xml/note_error.xml, I get this page:
    ...

That's encouraging.  Regarding Opera's mystic message, I suspect it may be a
case of them not quite being there with a new parsing engine.  I reported a
table display problem to them yesterday and got back the response that they
are working on a new display engine.  Hopefully the same is true for the
parsing side of the equation.

[ot: Opera is the first browser I ever liked well enough to pay for.]

-- 
Skip Montanaro (skip at pobox.com - http://www.mojam.com/)




More information about the Python-list mailing list