[Python-Dev] Some thoughts on the codecs...

M.-A. Lemburg mal@lemburg.com
Tue, 16 Nov 1999 12:33:07 +0100


Jack Jansen wrote:
> 
> > I would propose to only add some very basic encodings to
> > the standard distribution, e.g. the ones mentioned under
> > Standard Codecs in the proposal:
> >
> >   'utf-8':            8-bit variable length encoding
> >   'utf-16':           16-bit variable length encoding (litte/big endian)
> >   'utf-16-le':                utf-16 but explicitly little endian
> >   'utf-16-be':                utf-16 but explicitly big endian
> >   'ascii':            7-bit ASCII codepage
> >   'latin-1':          Latin-1 codepage
> >   'html-entities':    Latin-1 + HTML entities;
> >                       see htmlentitydefs.py from the standard Pythin Lib
> >   'jis' (a popular version XXX):
> >                       Japanese character encoding
> >   'unicode-escape':   See Unicode Constructors for a definition
> >   'native':           Dump of the Internal Format used by Python
> 
> I would suggest adding the Dos, Windows and Macintosh standard 8-bit charsets
> (their equivalents of latin-1) too, as documents in these encoding are pretty
> ubiquitous. But maybe these should only be added on the respective platforms.

Good idea. What code pages would that be ?

-- 
Marc-Andre Lemburg
______________________________________________________________________
Y2000:                                                    45 days left
Business:                                      http://www.lemburg.com/
Python Pages:                           http://www.lemburg.com/python/