[Expat-discuss] utf-8 encoding
Aruna.Bhaskara at wellsfargo.com
Aruna.Bhaskara at wellsfargo.com
Tue Feb 24 19:36:06 EST 2004
Hi All
I am trying to use utf-8 encoding my input file has some multibyte
character like below . If I parse it through expat and print the output to a
file I see two bytes
Shouldn't it be single byte or since its utf-8 encoding it represent as two
bytes and the progreammer has to take care of interpreting the 2 bytes.
If I use the xerces parser I see one byte being returned. Let me know what I
am doing wrong.
(single byte) Ñ = Ã'(two bytes)
Thanks
Aruna
More information about the Expat-discuss
mailing list