Parsing complex web pages safely with htmllib.HTMLParser
montanaro at tttech.com
montanaro at tttech.com
Thu Jan 24 18:33:10 EST 2002
>> I'm not sure how XHTML will solve the problem. Instead of broken
>> HTML we'll have to contend with broken XHTML.
amk> Mozilla doesn't accept invalid XHTML, and neither does IE. For
amk> example, when I point either Mozilla or IE 6 at
amk> http://www.w3schools.com/xml/note_error.xml, I get this page:
...
That's encouraging. Regarding Opera's mystic message, I suspect it may be a
case of them not quite being there with a new parsing engine. I reported a
table display problem to them yesterday and got back the response that they
are working on a new display engine. Hopefully the same is true for the
parsing side of the equation.
[ot: Opera is the first browser I ever liked well enough to pay for.]
--
Skip Montanaro (skip at pobox.com - http://www.mojam.com/)
More information about the Python-list
mailing list