Parsing an HTML file

CodeGuru73 eddiembabaali at yahoo.com
Wed Dec 17 17:08:41 CET 2003


I am trying to find the best way to parse a bunch of html files. They
are all simillar in structure and I need to get them into a database.
Their relevant structure is:
<html><head></head>
<body>
<h1>title</h1>
<address> authors </address>
<div> Main html content</div>

I basically need to get the values between <h1></h1>,
<address></address> and <div></div>

I am able to read the the files into an array.




More information about the Python-list mailing list