Grapheme clusters, a.k.a.real characters
Rhodri James
rhodri at kynesim.co.uk
Tue Jul 18 11:37:37 EDT 2017
On 18/07/17 15:10, Rustom Mody wrote:
> On Monday, July 17, 2017 at 10:14:00 PM UTC+5:30, Rhodri James wrote:
>> On 17/07/17 05:10, Rustom Mody wrote:
>>> Hint1: Ask your grandmother whether unicode's notion of character makes sense.
>>> Ask 10 gmas from 10 language-L's
>>> Hint2: When in doubt gma usually is right
>>
>> "For every complex problem there is an answer that is clear, simple and
>> wrong." (H.L. Mencken).
>
> Great men galore with great quotes galore²
[snip]
>> Unfortunately grandmothers outside their areas of expertise are particularly prone to finding those answers.
>
> Gma for the purposes of this discussion can be defined:
>
> - A (not necessarily) elderly person who
> - Is fairly intelligent
> - Not necessarily highly educated
> - Generally interested in life and people
> - [But not usually] in technical arcana
That last one is the killer. Using clear and simple terminology is
usually adequate when you aren't discussing technical arcana.
Unfortunately we are discussing technical arcana, and that's when you
trip over the fact that your clear, simple terminology is wrong. It's
an instance of Weizenbaum's joke that you quoted, just replacing
streetlights with grandmas.
(For the record, one of my grandmothers would have been baffled by this
conversation, and the other one would have had definite opinions on
whether accents were distinct characters or not, followed by a
digression into whether "ŵ" and "ŷ" should be suppressed vigorously :-)
--
Rhodri James *-* Kynesim Ltd
More information about the Python-list
mailing list