[Numpy-discussion] proposal: smaller representation of string arrays

Chris Barker chris.barker at noaa.gov
Tue Apr 25 13:04:53 EDT 2017


On Tue, Apr 25, 2017 at 9:57 AM, Ambrose LI <ambrose.li at gmail.com> wrote:

> 2017-04-25 12:34 GMT-04:00 Chris Barker <chris.barker at noaa.gov>:
> > I am totally euro-centric,
>


> But Shift-JIS is not one-byte; it's two-byte (unless you allow only
> half-width characters and nothing else). :-)


bad example then -- are their other non-euro-centric one byte per char
encodings worth worrying about? I have no clue :-)


> This I don't understand. As far as I can tell non-Western-European
> filenames are not unusual. If filenames are a reason, even if you're
> euro-centric (think Eastern Europe, say) I don't see how latin1 is a
> good choice.
>

right -- this is the age of Unicode -- Unicode is the correct choice.

But many of us have data in old files that are not proper Unicode -- and
that includes filenames.

-CHB

-- 

Christopher Barker, Ph.D.
Oceanographer

Emergency Response Division
NOAA/NOS/OR&R            (206) 526-6959   voice
7600 Sand Point Way NE   (206) 526-6329   fax
Seattle, WA  98115       (206) 526-6317   main reception

Chris.Barker at noaa.gov
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/numpy-discussion/attachments/20170425/3b827e51/attachment-0001.html>


More information about the NumPy-Discussion mailing list