"Fred L. Drake, Jr." <fdrake@acm.org> writes: > I'm sure a small C extension could provide the needed helpers quite > efficiently. Even with a UCS-4 version of Python, a Unicode literal > containing a surrogate pair (explicitly, using two \u sequences) will > exhibit the behavior that Eric wants to see suppressed. Of course, producing such a literal is an application error. Regards, Martin