[Python-3000] Help on text editors

David Hopwood david.nospam.hopwood at blueyonder.co.uk
Sat Sep 9 17:04:44 CEST 2006


Michael Urman wrote:
> On 9/7/06, Jeff Wilcox <jeff at soft.fujitsu.com> wrote:
> 
>>>From: "Paul Prescod" <paul at prescod.net>
>>>1. On US English Windows, Notepad defaults to an encoding called "ANSI".
>>>"ANSI" is not a real encoding at all (and certainly not one from the
>>
>>On Japanese Windows 2000, Notepad defaults to ANSI as it does in the English
>>version.  It actually writes Shift JIS though.
> 
> ANSI is not an encoding; it is a collective name for various multibyte
> encodings, each corresponding to a particular default language of the
> machine. Thus ANSI corresponds to cp1252 on English and cp932 on
> Japanese machines.
> 
> As for whether cp932 is the same as Shift JIS, David and I seem to
> disagree. While I lack hard data, the string '\\' round trips through
> either on my box.

You may have an implementation that uses Cp932 or similar, but calls it
"Shift-JIS". <http://en.wikipedia.org/wiki/Shift_jis> agrees with me, FWIW.

Here is a pretty complete mapping table for Shift-JIS + common extensions
(as opposed to Cp932):

<http://wakaba-web.hp.infoseek.co.jp/table/sjis-0208-1997-std.txt>

although there is quite a bit of variation in mappings:

<http://www.haible.de/bruno/charsets/conversion-tables/Shift_JIS.html>

-- 
David Hopwood <david.nospam.hopwood at blueyonder.co.uk>




More information about the Python-3000 mailing list