[Expat-discuss] Accented characters

Mark 11mjazbdg02 at sneakemail.com
Mon Oct 29 13:16:38 CET 2007


Hi Omar,

AFAIK Expat will always return character data in UTF-8 encoding no matter the character set of the original document.  Look at iconv() to convert back to ISO-8859-1.


---- Original Message ----
Hi everybody.
I have a problem with accented characters: I obtain "realtÃ" when parsing the
content "realtá", for example.
I also noted that if I try to parse a content like this: "Showroom Devon &
Devon", I obtain as output "Devonoom Devon" (the text after & overrides the
previous one).
I use Expat 2.0.1 on a Gentoo distro, but it gives me the same errors on an
embedded ARM system too.
I create the parser and then set the encoding this way:

p = XML_ParserCreate(NULL);
XML_SetEncoding(p, (const XML_Char)"iso-8859-1");

Has anyone a solution? Thanks in advance.



More information about the Expat-discuss mailing list