[BangPypers] HTML Parsing in python

Puneet Aggarwal look4puneet at gmail.com
Thu Sep 10 16:14:02 CEST 2009


Thanks all for the suggestions. I think I will start with BeautifulSoup
(3.0.7a) and will experiment with other suggested libs if it does not fit
into my requirement or if I face issues with this.

On Thu, Sep 10, 2009 at 7:07 PM, Baishampayan Ghose <b.ghose at gmail.com>wrote:

> > Can anyone suggest me a good library for html parsing in python ?
> > I googled a found few libararies BeautifulSoup, HTMLParser, SGMLParser
> etc.
> >
> > Can anyone suggest me which should I go for from your experience.
>
> BeautifulSoup was OK, but now it's broken. Use lxml, it's very good.
>
> http://codespeak.net/lxml/
>
> Regards,
> BG
>
>
> --
> Baishampayan Ghose
> b.ghose at gmail.com
> _______________________________________________
> BangPypers mailing list
> BangPypers at python.org
> http://mail.python.org/mailman/listinfo/bangpypers
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/bangpypers/attachments/20090910/0013d897/attachment.htm>


More information about the BangPypers mailing list