[Python-3000] string module trimming

Jeffrey Yasskin jyasskin at gmail.com
Fri Apr 20 17:19:08 CEST 2007


On 4/19/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> > "Jeffrey Yasskin" <jyasskin at gmail.com> wrote:
> > > ... Are you saying that in your app, just because I've set
> > > the en_US locale, I won't be able to type "????"?  Or that those
> > > characters won't be recognized as letters?
>
> The latter.  Some applications may reject them for that reason; for
> example some domain registrars have policies to prevent domain name
> spoofing with similar-looking characters.  One way to do that is to
> say that a character used in a domain name (under that registrar) is
> limited to those letters used by the appropriate national language.

Filtering to a certain language's alphabet is neither necessary nor
sufficient for a domain registrar. They're going to implement their
own system paying attention to http://www.unicode.org/reports/tr36/
(see particularly section 2.3 for why even a restriction to one script
doesn't keep them safe), so they don't justify keeping an attractive
nuisance in the standard library.

-- 
Namasté,
Jeffrey Yasskin


More information about the Python-3000 mailing list