Any equivalent to Ruby's 'hpricot' html/xpath/css selector package?

Stefan Behnel stefan_ml at
Tue Dec 30 14:28:37 CET 2008

Kenneth McDonald wrote:
> Ruby has a package called 'hpricot' which can perform limited xpath
> queries, and CSS selector queries. However, what makes it really useful
> is that it does a good job of handling the "broken" html that is so
> commonly found on the web. Does Python have anything similar, i.e.
> something that will not only do XPath queries, but will do so on
> imperfect HTML?

lxml.html is your friend.


More information about the Python-list mailing list