newb: BeautifulSoup
TheFlyingDutchman
zzbbaadd at aol.com
Fri Sep 21 00:22:23 EDT 2007
On Sep 20, 8:04 pm, crybaby <joemystery... at gmail.com> wrote:
> I need to traverse a html page with big table that has many row and
> columns. For example, how to go 35th td tag and do regex to retireve
> the content. After that is done, you move down to 15th td tag from
> 35th tag (35+15) and do regex to retrieve the content?
Make the file an xhtml file (valid xml) if it isn't already and then
you can use software written to process XML files:
http://pyxml.sourceforge.net/topics/
More information about the Python-list
mailing list