lxml is *much* faster than BeautifulSoup (as I recall, a speaker at 2009 PyCon suggested it was 20 to 30 times faster) and should be used unless there is a compelling reason not to. Also, I'm not sure how much BeautifulSoup is being maintained at this point.<div>
<br></div><div>- Shailen Tuli<br><br><div class="gmail_quote">On Thu, Dec 30, 2010 at 10:22 PM, Charles Merriam <span dir="ltr"><<a href="mailto:charles.merriam@gmail.com">charles.merriam@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">This shows up on the mailing list every now and then.<br>
<br>
lxml is faster, more tolerant, etc., than Beautful Soup and the built in ones..<br>
<br>
<br>
Enjoy<br>
<div class="im"><br>
On Thu, Dec 30, 2010 at 7:46 PM, Bill Janssen <<a href="mailto:janssen@parc.com">janssen@parc.com</a>> wrote:<br>
><br>
> BeautifulSoup does xhtml, too.<br>
><br>
> Bill<br>
><br>
> Tony Cappellini <<a href="mailto:cappy2112@gmail.com">cappy2112@gmail.com</a>> wrote:<br>
><br>
> > What's the best module/package for parsing xhtml?<br>
> > HTMLParser is built in, but is there another package which is more<br>
> > like urlib2 or Beautiful Soup- but handles xhtml?<br>
> ><br>
> > thanks<br>
> > _______________________________________________<br>
> > Baypiggies mailing list<br>
> > <a href="mailto:Baypiggies@python.org">Baypiggies@python.org</a><br>
> > To change your subscription options or unsubscribe:<br>
> > <a href="http://mail.python.org/mailman/listinfo/baypiggies" target="_blank">http://mail.python.org/mailman/listinfo/baypiggies</a><br>
> _______________________________________________<br>
> Baypiggies mailing list<br>
</div>> Baypiggies@python.orgis shows<br>
<div><div></div><div class="h5">> To change your subscription options or unsubscribe:<br>
> <a href="http://mail.python.org/mailman/listinfo/baypiggies" target="_blank">http://mail.python.org/mailman/listinfo/baypiggies</a><br>
_______________________________________________<br>
Baypiggies mailing list<br>
<a href="mailto:Baypiggies@python.org">Baypiggies@python.org</a><br>
To change your subscription options or unsubscribe:<br>
<a href="http://mail.python.org/mailman/listinfo/baypiggies" target="_blank">http://mail.python.org/mailman/listinfo/baypiggies</a><br>
</div></div></blockquote></div><br></div>