[issue8024] upgrade to Unicode 5.2

Fri Mar 19 10:25:37 CET 2010

Marc-Andre Lemburg <mal at egenix.com> added the comment:

Florent Xicluna wrote:
> 
> Florent Xicluna <florent.xicluna at gmail.com> added the comment:
> 
> The bug was a side-effect of the update. Code point "\uAAAA" is now assigned to a printable character:
> 
>   AAAA;TAI VIET LETTER LOW VO;Lo;0;L;;;;;N;;;;;
> 
> And test_bigmem relies on this code point being non-printable.
> I changed it for a char in the Low surrogates range, which is guaranteed not printable. See attached patch.

That's better.

You wrote about '\üaaa' (3 'a's) in your previous post
on the ticket and I didn't understand why that would change with the
patch, since it's basically a SyntaxError which doesn't have anything
to do with the Unicode types or database.

> The regression test suite passes flawlessly.
> 
> I will do further tests before merging back in 3.x

Please also check what happened to all those code points that were
removed by the patch in unicodetype_db.h.

Thanks.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue8024>
_______________________________________