Help with regular expressions
dmbkiwi
dmbkiwi at yahoo.com
Tue Aug 26 05:00:48 EDT 2003
On Tue, 26 Aug 2003 08:47:33 +0000, Sybren Stuvel wrote:
> dmbkiwi enlightened us with:
>> A couple of other people have contributed code to this project,
>> particularly relating to the parsing of the websites.
>> Unfortunately, it is not parsing one particular part of the website
>> properly. This is because it is expecting the data to be in a
>> certain form, and occasionally it is in a different form.
>> Unfortunately this causes the entire script to fail to run.
>
> You seem to expect old HTML. Why not use XHTML only ('tidy' can
> convert between them) and use a regular XML parser? Much, much, much
> easier! And you won't have to be afraid of messing up your regular
> expressions ;-)
>
> Sybren
XML would be nice, but unfortunately I have no choice as to the markup
language used by the site. It's a website on the world wide web, not a
site overwhich I have any control. My regular expressions are at the
mercy of the developers of that site.
Any other suggestions?
Matt
More information about the Python-list
mailing list