Help with regular expressions

dmbkiwi dmbkiwi at yahoo.com
Tue Aug 26 11:00:48 CEST 2003


On Tue, 26 Aug 2003 08:47:33 +0000, Sybren Stuvel wrote:

> dmbkiwi enlightened us with:
>> A couple of other people have contributed code to this project,
>> particularly relating to the parsing of the websites.
>> Unfortunately, it is not parsing one particular part of the website
>> properly.  This is because it is expecting the data to be in a
>> certain form, and occasionally it is in a different form.
>> Unfortunately this causes the entire script to fail to run.
> 
> You seem to expect old HTML. Why not use XHTML only ('tidy' can
> convert between them) and use a regular XML parser? Much, much, much
> easier! And you won't have to be afraid of messing up your regular
> expressions ;-)
> 
> Sybren

XML would be nice, but unfortunately I have no choice as to the markup
language used by the site.  It's a website on the world wide web, not a
site overwhich I have any control.  My regular expressions are at the
mercy of the developers of that site.

Any other suggestions?

Matt




More information about the Python-list mailing list