Looking for a specific html parser

Davor Cengija dcengija_remove_ at inet.hr
Tue Mar 18 09:07:47 CET 2003

Rene Pijlman wrote:

> Davor Cengija:
>>I need to pull out some html elements with its subelements from an html
>>document. Is there something already available?
> http://www.python.org/doc/current/lib/module-HTMLParser.html

Yes, I am aware of HTMLParser (I thought I mentioned it in my post, but I 
guess I deleted that sentence). I was hoping something like the described 
parser is already available on the net. Basically, I need a DOM like parser 
for HTML, with xpath capabilities. xml.dom might help me, but before that I 
obviously need some kind of html-tidy. Is the famous Tidy from w3c ported 
to python? I found mxTidy which is basically a wrapper around Tidy, and not 
a native implementation.

Davor Cengija, dcengija_remove_ at inet.hr

More information about the Python-list mailing list