Parsing an HTML file
ptmcg at users.sourceforge.net
Wed Dec 17 21:53:56 CET 2003
"CodeGuru73" <eddiembabaali at yahoo.com> wrote in message
news:5e290f27.0312170808.4590723e at posting.google.com...
> I am trying to find the best way to parse a bunch of html files. They
> are all simillar in structure and I need to get them into a database.
> Their relevant structure is:
> <address> authors </address>
> <div> Main html content</div>
> I basically need to get the values between <h1></h1>,
> <address></address> and <div></div>
> I am able to read the the files into an array.
Check out this simple XML parsing code:
More information about the Python-list