[I18n-sig] Unicode surrogates: just say no!

Paul Prescod paulp@ActiveState.com
Wed, 27 Jun 2001 16:40:36 -0700

"Machin, John" wrote:
> IMO, once you say that a "valid surrogate pair" is a "single
> character" in a narrow implementation, people will want to do
> the indexing / slicing /dicing thing as well. ord() is just the
> thin end of the wedge.

I'll see your puritanism and raise: unichr(bignum) and \Ubignum are the
thin edge of the wedge. :)

I would still prefer to abolish the notion of surrogates from anything
except codecs.

Or at least abolish them now and see if anyone screams. We should do the
simplest thing possible and see what happens.

