[issue9804] ascii() does not always join surrogate pairs

STINNER Victor report at bugs.python.org
Thu Sep 9 02:12:53 CEST 2010


STINNER Victor <victor.stinner at haypocalc.com> added the comment:

> >>> s = "'\0\"\n\r\t abcd\x85é\U00012fff\U0001D121xxx\uD800."
> (...)
> (I think I've included everything:
> - normal chars
> - control chars
> - one-byte non-ASCII
> - two-byte non-ASCII (and lone surrogate)
> - printable and non-printable surrogate pairs)
> - single and double quotes)

Add maybe a lone suroggate followed directly by a surrogate pair, eg. 
'\uD800\U0001D121'.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue9804>
_______________________________________


More information about the Python-bugs-list mailing list