[XML-SIG] Character encodings and expat

Frank J.S. Chen frank63@ms5.hinet.net
Sat, 28 Oct 2000 23:46:50 -0000


> > 
> > > That's only Shift-JIS and EUC-JP, though.  Is there any concerted
> > > effort afoot to make a more complete set?  At the very least,
> > > ISO 2022-JP, Big5, VISCII, GB-2312 and EUC-KR should be implemented.
> > 
> 
> Sure would be nice... the only problem I see is that the 
> different codecs for the Asian scripts will most probably
> behave differently, e.g. there are many issues with private
> code point areas in Unicode and the various Asian encodings.

For now, all CJK Unicode characters reside in Basic Multilingual
Plane(Plane 0).
It seems no need to consider surrogate area or private use area right now.
What we need is indeed a transcoding interface to convert different locales
to UTF-8/UTF-16 and then back.