Regular Expression help for parsing html tables

Stefan Behnel stefan.behnel-n05pAM at web.de
Sat Oct 28 18:15:22 EDT 2006


Hi Steve,

steve551979 at hotmail.com wrote:
> I am having some difficulty creating a regular expression for the
> following string situation in html. I want to find a table that has
> specific text in it and then extract the html just for that immediate
> table.

Any reason why you can't use a real HTML parser and API (e.g. the one provided
by lxml)? That can really make things easier here.

http://codespeak.net/lxml/
http://codespeak.net/lxml/api.html#parsers
http://codespeak.net/lxml/api.html#trees-and-documents
http://effbot.org/zone/element-index.htm

Stefan



More information about the Python-list mailing list