[Python-3000] Pre-PEP: Easy Text File Decoding

Marcin 'Qrczak' Kowalczyk qrczak at knm.org.pl
Tue Sep 12 01:20:28 CEST 2006


"Paul Moore" <p.f.moore at gmail.com> writes:

> Of course, I'm in the useful position of having an OS default
> character set which contains ASCII as a subset. I don't know what
> issues someone with Greek/Russian/Japanese or whatever as an OS
> default would have (one thought - if your default character set
> doesn't contain ASCII as a subset, how do you deal with the hosts
> file? OTOH, I had a real struggle to find an example of an encoding
> which didn't have ASCII as a subset!)

AFAIK the only encoding which might be used today which is not based
on ASCII is EBCDIC. Perl supports it (and it supports Unicode at the
same time, via UTF-EBCDIC).

Other than that, there are some Japanese encodings with a confusion
between \ and the Yen sign, otherwise being ASCII. They are used today.

There used to be national ASCII variants with accented letters instead
of [\]^{|}~. I don't think they are still used today.

-- 
   __("<         Marcin Kowalczyk
   \__/       qrczak at knm.org.pl
    ^^     http://qrnik.knm.org.pl/~qrczak/


More information about the Python-3000 mailing list