[Python-3000] Support for PEP 3131

Jim Jewett jimjjewett at gmail.com
Fri May 25 20:49:55 CEST 2007


On 5/25/07, Adam Olsen <rhamph at gmail.com> wrote:
> On 5/25/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> > On 5/25/07, Adam Olsen <rhamph at gmail.com> wrote:
> > > If we allowed an underscore as a mixed-script separator
> > > (allowing "def get_原料(self):"), does this let us get away
> > > with otherwise banning mixed-scripts?

...

> Indeed, the whole-script confusables does create significant
> holes, but I think the best solution is still to ban mixed-scripts
> and accept that it's only a "75% solution".  Using an "I'm
> expecting cyrillic" flag makes it harder for those who need
> cyrillic AND still leaves them vulnerable to the same problem
> we're trying to protect ourselves from.

hmm... I had thought they should either not include the confusable
letters, or use different fonts -- whatever they normally do.

But I suppose using an _ separator could still be a useful crutch.
Whether it is useful enough ... I'll let others chime in.

> A more extreme solution would be to introduce a symbol type that
> converts that converts whole-script confusables to a canonical
> form

The unicode consortium recommends against this.  I'm not sure if it is
just a presentation issue, or concerns about compatibility; the
"confusables" lists are explicitly allowed to change.

-jJ


More information about the Python-3000 mailing list