[Python-Dev] Draft Guide for code migration and modernation
M.-A. Lemburg
mal@lemburg.com
Tue, 04 Jun 2002 16:17:26 +0200
Guido van Rossum wrote:
>>> c in string.whitespace --> c.isspace()
>>
>>This changes the meaning slightly for unicode characters, because
>>chr(i).isspace() != unichr(i).isspace()
>>for i in { 0x1c, 0x1d, 0x1e, 0x1f, 0x85, 0xa0 }
>
>
> That's unfortunate, because I'd like unicode to be an extension of
> ASCII also in this kind of functionality. What are these and why are
> they considered spaces? Would it hurt to make them spaces in ASCII
> too?
From the Unicode database:
001C;<control>;Cc;0;B;;;;;N;FILE SEPARATOR;;;;
001D;<control>;Cc;0;B;;;;;N;GROUP SEPARATOR;;;;
001E;<control>;Cc;0;B;;;;;N;RECORD SEPARATOR;;;;
001F;<control>;Cc;0;S;;;;;N;UNIT SEPARATOR;;;;
0085;<control>;Cc;0;B;;;;;N;NEXT LINE;;;;
00A0;NO-BREAK SPACE;Zs;0;CS;<noBreak> 0020;;;;N;NON-BREAKING SPACE;;;;
--
Marc-Andre Lemburg
CEO eGenix.com Software GmbH
______________________________________________________________________
Company & Consulting: http://www.egenix.com/
Python Software: http://www.egenix.com/files/python/
Meet us at EuroPython 2002: http://www.europython.org/