Help with parsing web page
guettli at thomas-guettler.de
Tue Jun 15 15:56:43 CEST 2004
Am Mon, 14 Jun 2004 17:48:33 +0100 schrieb RiGGa:
> I want to parse a web page in Python and have it write certain values out to
> a mysql database. I really dont know where to start with parsing the html
> code ( I can work out the database part ). I have had a look at htmllib
> but I need more info. Can anyone point me in the right direction , a
> tutorial or something would be great.
Since HTML can be broken in several ways, I would
pipe the HTML thru tidy first. You can use the "-asxml"
option, and then parse the xml.
More information about the Python-list