[issue8024] upgrade to Unicode 5.2
Marc-Andre Lemburg
report at bugs.python.org
Fri Mar 19 10:25:37 CET 2010
Marc-Andre Lemburg <mal at egenix.com> added the comment:
Florent Xicluna wrote:
>
> Florent Xicluna <florent.xicluna at gmail.com> added the comment:
>
> The bug was a side-effect of the update. Code point "\uAAAA" is now assigned to a printable character:
>
> AAAA;TAI VIET LETTER LOW VO;Lo;0;L;;;;;N;;;;;
>
> And test_bigmem relies on this code point being non-printable.
> I changed it for a char in the Low surrogates range, which is guaranteed not printable. See attached patch.
That's better.
You wrote about '\üaaa' (3 'a's) in your previous post
on the ticket and I didn't understand why that would change with the
patch, since it's basically a SyntaxError which doesn't have anything
to do with the Unicode types or database.
> The regression test suite passes flawlessly.
>
> I will do further tests before merging back in 3.x
Please also check what happened to all those code points that were
removed by the patch in unicodetype_db.h.
Thanks.
----------
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue8024>
_______________________________________
More information about the Python-bugs-list
mailing list