An assessment of the Unicode standard
John Nagle
nagle at animats.com
Sun Aug 30 01:14:55 EDT 2009
r wrote:
> I was reading the thread here...
> http://groups.google.com/group/comp.lang.python/browse_thread/thread/db90a9629b92aab0/b0385050b4c6c84e?hl=en&lnk=raot#b0385050b4c6c84e
>
> and it raised some fundamental philophosical questions
Rant ignored.
Actually, Python 3.x seems finally to have character sets right.
There's "bytes", for uninterpreted binary data, Unicode, and
proper ASCII, 0..127. Within Python, we finally got rid of
"upper code pages".
(I wish the HTML standards people would do the same. HTML 5
should have been ASCII only (with the "&" escapes if desired)
or Unicode. No "Latin-1", no upper code pages, no JIS, etc.)
>
> [nested thoughts]
> A few months ago i was watching some tear-jerking documentary called
> something like "Save the Languages" or "The dying languages" blah!
It may be a bit much that Unicode supports Cretan Linear B.
John Nagle
More information about the Python-list
mailing list