[XML-SIG] Re: Issues with Unicode type
Daniel Veillard
veillard@redhat.com
Mon, 23 Sep 2002 17:59:26 -0400
On Mon, Sep 23, 2002 at 03:58:11PM -0600, Uche Ogbuji wrote:
> > > Can you confirm that this is what RedHat does by default as mentioned
> > > Uche and do you know the motivations (and eventually downsides) for this
> > > decision?
> >
> > By default Red Hat compiles python with unicode support in UTF-16.
> > I'm not in charge of this, I assume it's the default compilation option.
>
> Not from what we found. Jeremy was the one who encountered this, not me, but
> I'm pretty sure he said he found that starting with RH 7.3, Red Hat started
> building Python 2.x with UTF-32 and whchar_t support.
Hum, here on 2 recent versions :-)
paphio:~ -> python2.2
Python 2.2 (#1, Apr 12 2002, 15:29:57)
[GCC 2.96 20000731 (Red Hat Linux 7.2 2.96-109)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> c = u"\u10800"
>>> len(c)
2
>>>
gnome:~ -> python
Python 2.2.1 (#1, Aug 30 2002, 12:15:30)
[GCC 3.2 20020822 (Red Hat Linux Rawhide 3.2-4)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> c = u"\u10800"
>>> len(c)
2
>>>
looks like UTF16 to me !
> > IMHO it's a wrong assumption to think that UTF16 is a good cut, because
> > you end up with variable lenght encoding anyway, and UCS32 would seriously
> > bloat the app I'm afraid.
>
> Just as a side observation: Guido called this FUD. I'm not so sure.
It's just my opinion, and as a whole me and other in the Gnome and KDE
projects all went UTF8 without apriori concertation, it was just natural
to us (okay this also keep strings 0 terminated which is crucial).
Daniel
--
Daniel Veillard | Red Hat Network https://rhn.redhat.com/
veillard@redhat.com | libxml GNOME XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/