Glyphs and graphemes [was Re: Cult-like behaviour]

Ethan Furman ethan at stoneleaf.us
Mon Jul 16 17:02:43 EDT 2018


On 07/16/2018 01:15 PM, Chris Angelico wrote:
> On Tue, Jul 17, 2018 at 4:55 AM, Steven D'Aprano wrote:

>> There is nothing special about diacritics such that we ought to treat
>> some combinations like "Ch" (two code points = one character) as "fixed
>> width" while others like "â" (two code points = one character) as
>> "variable width".
>
> When you reverse a word, do you treat "ch" and "sh" as one character
> or two? I'm of the opinion that they're single characters, and thus
> this should be "dalokosh":

Depends on the language:  in Spanish, "ch" is it's own letter (at least it was when I grew up), so any word containing 
it should still contain it when reversed:  "chica" would be "acich".

--
~Ethan~


More information about the Python-list mailing list