[Tutor] screen scraping web-based email (Alan Gauld)

Alan Gauld alan.gauld at btinternet.com
Thu Apr 19 16:10:35 CEST 2007


"Kent Johnson" <kent37 at tds.net> wrote in

> FWIW most real-world HTML parsers (including Beautiful Soup) seem to 
> be
> based directly on SMTPlib, not htmllib or HTMLParser.

Yes, I noticed that, although htmllib is itself based on sgmllib...
And it has a better event based parsing model but unfortunately
it doesn't throw errors when you get an http error back,
which makes HTMLParser much more user friendly for
beginners...

Alan G. 




More information about the Tutor mailing list