On Tue, Apr 25, 2017 at 10:04 AM, Chris Barker <chris.barker@noaa.gov> wrote:
>
> On Tue, Apr 25, 2017 at 9:57 AM, Ambrose LI <ambrose.li@gmail.com> wrote:
>>
>> 2017-04-25 12:34 GMT-04:00 Chris Barker <chris.barker@noaa.gov>:
>> > I am totally euro-centric,
>
>> But Shift-JIS is not one-byte; it's two-byte (unless you allow only
>> half-width characters and nothing else). :-)
>
> bad example then -- are their other non-euro-centric one byte per char encodings worth worrying about? I have no clue :-)

I've run into Windows-1251 in files (seismic and well log data from Russian wells). Treating them as latin-1 does not make for a happy time. Both encodings also technically derive from ASCII in the lower half, but most of the actual language is written with the high-bit characters.

--
Robert Kern