[Python-Dev] Unicode support in getargs.c

Jack Jansen Jack.Jansen@cwi.nl
Fri, 4 Jan 2002 00:09:45 +0100 (CET)


I'm going to jump out of this discussion for a while. Martin and Mark have 
a completely different view on Unicode than I do, apparently, and I think 
I should first try and see if I can use the current implementation.

For the record: my view of Unicode is really "ascii done right", i.e. a 
datatype that allows you to get richer characters than what 1960s ascii 
gives you. For this it should be as backward-compatible as possible, i.e. 
if some API expects a unicode filename and I pass "a.out" it should 
interpret it as u"a.out". All the converting to different charsets is 
icing on the cake, the number one priority should be that unicode is as 
compatible as possible with the 8-bit convention used on the platform 
(whatever it may be). No, make that the number 2 priority: the number one 
pritority is compatibility with 7-bit ascii. Using Python StringObjects as 
binary buffers is also far less common than using StringObjects to store 
plain old strings, so if either of these uses bites the other it's the 
binary buffer that needs to suffer. UnicodeObjects and StringObjects 
should behave pretty orthogonal to how FloatObjects and IntObjects behave.

 -- --
Jack Jansen             | ++++ stop the execution of Mumia Abu-Jamal ++++
Jack.Jansen@cwi.nl      | ++++ if you agree copy these lines to your sig ++++
http://www.cwi.nl/~jack | see http://www.xs4all.nl/~tank/spg-l/sigaction.htm