[XML-SIG] Re: Issues with Unicode type

Daniel Veillard veillard@redhat.com
Mon, 23 Sep 2002 17:59:26 -0400


On Mon, Sep 23, 2002 at 03:58:11PM -0600, Uche Ogbuji wrote:
> > > Can you confirm that this is what RedHat does by default as mentioned
> > > Uche and do you know the motivations (and eventually downsides) for this
> > > decision?
> > 
> >   By default Red Hat compiles python with unicode support in UTF-16.
> > I'm not in charge of this, I assume it's the default compilation option.
> 
> Not from what we found.  Jeremy was the one who encountered this, not me, but 
> I'm pretty sure he said he found that starting with RH 7.3, Red Hat started 
> building Python 2.x with UTF-32 and whchar_t support.

  Hum, here on 2 recent versions :-)

paphio:~ -> python2.2
Python 2.2 (#1, Apr 12 2002, 15:29:57) 
[GCC 2.96 20000731 (Red Hat Linux 7.2 2.96-109)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> c = u"\u10800"
>>> len(c)    
2
>>> 

gnome:~ -> python
Python 2.2.1 (#1, Aug 30 2002, 12:15:30) 
[GCC 3.2 20020822 (Red Hat Linux Rawhide 3.2-4)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> c = u"\u10800"
>>> len(c)    
2
>>> 

  looks like UTF16 to me !

> > IMHO it's a wrong assumption to think that UTF16 is a good cut, because
> > you end up with variable lenght encoding anyway, and UCS32 would seriously
> > bloat the app I'm afraid.
> 
> Just as a side observation: Guido called this FUD.  I'm not so sure.

  It's just my opinion, and as a whole me and other in the Gnome and KDE
projects all went UTF8 without apriori concertation, it was just natural
to us (okay this also keep strings 0 terminated which is crucial).

Daniel

-- 
Daniel Veillard      | Red Hat Network https://rhn.redhat.com/
veillard@redhat.com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/