[Expat-discuss] utf-8 encoding

Aruna.Bhaskara at wellsfargo.com Aruna.Bhaskara at wellsfargo.com
Tue Feb 24 19:36:06 EST 2004


Hi All

 I  am trying to use utf-8 encoding my input file has some multibyte
character like below . If I parse it through expat and print the output to a
file I see two bytes

Shouldn't it be single byte or since its utf-8 encoding it represent as  two
bytes and the progreammer has to take care of interpreting the 2 bytes.

If I use the xerces parser I see one byte being returned. Let me know what I
am doing wrong.

 

 

(single byte) Ñ = Ã'(two bytes)

 

Thanks

Aruna

 

 



More information about the Expat-discuss mailing list