[Web-SIG] Extracting web data

James Y Knight foom at fuhm.net
Tue Feb 22 02:27:55 CET 2011

Previous message: [Web-SIG] Extracting web data
Next message: [Web-SIG] Extracting web data
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Feb 21, 2011, at 7:07 PM, James Mills wrote:
> You might want to look into using either
> the lxml or BeautifulSoup modules.

For parsing random HTML, the html5lib module works much better than either of those.

Previous message: [Web-SIG] Extracting web data
Next message: [Web-SIG] Extracting web data
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

More information about the Web-SIG mailing list