An assessment of the Unicode standard
nagle at animats.com
Sun Aug 30 07:14:55 CEST 2009
> I was reading the thread here...
> and it raised some fundamental philophosical questions
Actually, Python 3.x seems finally to have character sets right.
There's "bytes", for uninterpreted binary data, Unicode, and
proper ASCII, 0..127. Within Python, we finally got rid of
"upper code pages".
(I wish the HTML standards people would do the same. HTML 5
should have been ASCII only (with the "&" escapes if desired)
or Unicode. No "Latin-1", no upper code pages, no JIS, etc.)
> [nested thoughts]
> A few months ago i was watching some tear-jerking documentary called
> something like "Save the Languages" or "The dying languages" blah!
It may be a bit much that Unicode supports Cretan Linear B.
More information about the Python-list