Nicholas Bastin wrote: > Well, this is a completely separate issue/problem. The internal > representation is UTF-16, and should be stated as such. If the > built-in methods actually don't work with surrogate pairs, then that > should be fixed. Yes to the former, no to the latter. PEP 261 specifies what should and shouldn't work. Regards, Martin