
I have posted the unicode (and other) encoding subdocument of interscript v1.0a9 to the Web. At http://www.xenon.triode.net.au/~skaller/unicode/index.html you will find documented Python source for conversion of various character sets, including ISO-8859-X, Wansung, Johab, GB, Big5, ShiftJis, and various Microsoft/IBM proprietary code pages. The document contains a character by character description of the Unicode subset of ISO-10646, routines for converting UCS-2, USC2le, UCS-4, UCS4-le to UTF-8 (the native encoding used by interscript), and compact tables of all the glyphs (for unicode and each of the foreign character sets). If you're interested in internationalisation, you may find this web useful. You can browse online, or download it as a tarball (using http). You will need a browser with appropiate fonts to see Chinese, Japanese, Korean, Arabic and other glyphs. [Note: the table of contents uses a folding tree under MSIE4-5, it takes some time for my ECMA script to process the table. It's not broken, just slow!] John Max Skaller ph:61-2-96600850 mailto:skaller@maxtal.com.au 10/1 Toxteth Rd http://www.maxtal.com.au/~skaller Glebe 2037 NSW AUSTRALIA <P><A HREF="http://www.xenon.triode.net.au/~skaller/unicode/index.html"> unicode data tables</A> - documented Python source for conversion of various character sets, including ISO-8859-X, Wansung, Johab, GB, Big5, ShiftJis, and various Microsoft/IBM proprietary code pages. (25-Jul-99) -- ----------- comp.lang.python.announce (moderated) ---------- Article Submission Address: python-announce@python.org Python Language Home Page: http://www.python.org/ Python Quick Help Index: http://www.python.org/Help.html ------------------------------------------------------------
participants (1)
-
skaller@maxtal.com.au