[Tutor] screen scraping web-based email (Alan Gauld)

Alan Gauld alan.gauld at btinternet.com
Thu Apr 19 16:10:35 CEST 2007

"Kent Johnson" <kent37 at tds.net> wrote in

> FWIW most real-world HTML parsers (including Beautiful Soup) seem to 
> be
> based directly on SMTPlib, not htmllib or HTMLParser.

Yes, I noticed that, although htmllib is itself based on sgmllib...
And it has a better event based parsing model but unfortunately
it doesn't throw errors when you get an http error back,
which makes HTMLParser much more user friendly for

Alan G. 

