[Web-SIG] Extracting web data

James Y Knight foom at fuhm.net
Tue Feb 22 02:27:55 CET 2011


On Feb 21, 2011, at 7:07 PM, James Mills wrote:
> You might want to look into using either
> the lxml or BeautifulSoup modules.

For parsing random HTML, the html5lib module works much better than either of those.




More information about the Web-SIG mailing list