Any equivalent to Ruby's 'hpricot' html/xpath/css selector package?
stefan_ml at behnel.de
Tue Dec 30 14:28:37 CET 2008
Kenneth McDonald wrote:
> Ruby has a package called 'hpricot' which can perform limited xpath
> queries, and CSS selector queries. However, what makes it really useful
> is that it does a good job of handling the "broken" html that is so
> commonly found on the web. Does Python have anything similar, i.e.
> something that will not only do XPath queries, but will do so on
> imperfect HTML?
lxml.html is your friend.
More information about the Python-list