[I18n-sig] How does Python Unicode treat surrogates?

Fredrik Lundh fredrik@pythonware.com
Mon, 25 Jun 2001 22:39:14 +0200


I wrote:
> SRE and the unicode databases (me again) should also work
> pretty much out of the box.

a 32-bit version SRE works as expected, at least:

>>> a = array.array("i", map(ord, "hello"))
>>> m = sre.search("l+", a)
>>> m
<SRE_Match object at 008CECA8>
>>> m.group(0)
array('i', [108, 108])

the DLL size is identical, and the performance is roughly the
same.

Cheers /F