[Python-Dev] UCS2/UCS4 default

"Martin v. Löwis" martin at v.loewis.de
Thu Jul 3 19:36:03 CEST 2008


> Please remember that lone surrogate pair code points are perfectly
> valid Unicode code points, nevertheless. Just as a lone combining
> code point is valid on its own.

Actually, I think they aren't (not any more than an invalid codepoint,
or an unassigned codepoint). They are reserved for UTF-16 only.

I would have to lookup the exact Unicode terminology, but "valid"
is probably not a predicate that they would use.

Regards,
Martin



More information about the Python-Dev mailing list