[Python-Dev] UCS2/UCS4 default
"Martin v. Löwis"
martin at v.loewis.de
Thu Jul 3 19:36:03 CEST 2008
> Please remember that lone surrogate pair code points are perfectly
> valid Unicode code points, nevertheless. Just as a lone combining
> code point is valid on its own.
Actually, I think they aren't (not any more than an invalid codepoint,
or an unassigned codepoint). They are reserved for UTF-16 only.
I would have to lookup the exact Unicode terminology, but "valid"
is probably not a predicate that they would use.
Regards,
Martin
More information about the Python-Dev
mailing list