parsing "&A" in a string..
bruce
bedouglas at earthlink.net
Sun Aug 31 16:36:48 EDT 2008
Hi Fredrick
Thanks for the reply. But since I don't have control of the initial text, is
there something with python that will strip/replace this...
or are you saying I should do a search/replace on the "&" char with the
"amp&;" prior to parsing??
-----Original Message-----
From: python-list-bounces+bedouglas=earthlink.net at python.org
[mailto:python-list-bounces+bedouglas=earthlink.net at python.org]On Behalf
Of Fredrik Lundh
Sent: Sunday, August 31, 2008 1:10 PM
To: python-list at python.org
Subject: Re: parsing "&A" in a string..
bruce wrote:
> a pretty simple question, i'm guessing.
>
> i have a text/html string that looks like:
> ....(A&E)
>
> the issue i have is that when i parse it using xpath/node/toString,
> i get the following
>
> ...(A&E;).
that's because your parser is interpreting the &E part as an entity
reference, and the serializer is then adding the missing semicolon.
bare ampersands must be written as "&" in the file.
</F>
--
http://mail.python.org/mailman/listinfo/python-list
More information about the Python-list
mailing list