[Python-Dev] Regular expressions, Unicode etc.

"Martin v. Löwis" martin at v.loewis.de
Fri Aug 10 00:59:44 CEST 2007


Nick Maclaren schrieb:
>> The relevance is that your specification of "printing character"
>> as "isprint returns true" is nearly useless, as it only applies
>> to byte-oriented characters.
> 
> Eh?  That's ALL I used it to specify!  I used a Unicode-based
> specification for Unicode.

Your specification was "For Unicode, whatever people agree!"

I would not call that "Unicode-based".

>> Unicode-isalnum is defined as isalpha|isdecimal|isdigit|isnumeric.
>> isalpha means categories Ll, Lu, Lt, Lo, Lm. isdecimal means
>> character has the decimal property. isigit means the character has
>> the digit property. isnumeric means the character has the numeric
>> property.
> 
> I sincerely hope it isn't!

Please read the code.

>> It was a proposal for a definition. English is not my native
>> language, and "printing character" means nothing to me. So
>> I kindly asked for a definition, and suggested one possibility.
>> I would not have guessed that you consider white-space characters
>> as "printing", as they don't actually print anything.
> 
> Ah.  It's not an ordinary English term.  It's a computer language
> one, so I assumed that you would know it.

I know the term "printable character", which is what I read
in definitions of the isprint() routine. "printing character"
I never heard before.

Regards,
Martin



More information about the Python-Dev mailing list