[issue3297] Python interpreter uses Unicode surrogate pairs only before the pyc is created
report at bugs.python.org
Sat Jul 12 02:56:43 CEST 2008
Adam Olsen <rhamph at gmail.com> added the comment:
No, the configure options are wrong - we do use UTF-16 and UTF-32.
Although modern UCS-4 has been restricted down to the range of UTF-32
(it used to be larger!), UCS-2 still doesn't support the supplementary
planes (ie no surrogates.)
If it really was UCS-2, the repr wouldn't be u'\U00010123' on windows.
It'd be a pair of ill-formed code units instead.
Python tracker <report at bugs.python.org>
More information about the Python-bugs-list