newb: BeautifulSoup

TheFlyingDutchman zzbbaadd at aol.com
Fri Sep 21 00:22:23 EDT 2007


On Sep 20, 8:04 pm, crybaby <joemystery... at gmail.com> wrote:
> I need to traverse a html page with big table that has many row and
> columns.  For example, how to go 35th td tag and do regex to retireve
> the content.  After that is done, you move down to 15th td tag from
> 35th tag (35+15) and do regex to retrieve the content?

Make the file an xhtml file (valid xml) if it isn't already and then
you can use software written to process XML files:

http://pyxml.sourceforge.net/topics/





More information about the Python-list mailing list