[Python-Dev] Re: [I18n-sig] Planned updates for cjkcodecs before
2.4a1
Hye-Shik Chang
perky at i18n.org
Wed Jun 16 07:35:18 EDT 2004
On Wed, Jun 16, 2004 at 08:16:52PM +0900, Hye-Shik Chang wrote:
> On Wed, Jun 16, 2004 at 11:33:59AM +0200, M.-A. Lemburg wrote:
> > Hye-Shik Chang wrote:
> [snip]
> > >2. Merge two or three simliar C codecs into one. We have one C
> > > codec for every each python codecs currently. I have got an
> > > idea to merge them into several similar groups and many common
> > > part of .so binaries will be saved:
> >
> > +1, but why not put all Japanese codecs into one module and
> > dito for the Korean and Chinese ones ?
> >
> > Note that todays OS linkers will only mmap those pieces
> > of code into the process memory that are actually needed
> > by the application, so even though the size of the modules
> > increases, the application process memory foot-print is
> > likely not to increase.
>
> Okay. But how about embedded, freezed environments or statically
> compiled into python by uncommenting from Modules/Setup? If somebody
> need to support only legacy Japanese encodings, he will want to
> include a legacy mapping(70K) but will not want JIS X 0213(85K) and
> KS X 1001, GB2312 mappings(200K, for iso-2022-jp-2). And he may
> want to save spaces by just erasing files. In fact, I don't know
> how real Japanese developers use but just guessed it. :)
>
Aah. While I'm taking shower, I found that a problem on iso-2022-jp-2
can be resolved by make codecs to load mapping tables on demand.
(they're loading mappings in init function currently.) I agree in
incorporating all CJK codecs to each per-language codec collection
modules. Thanks for the comments!
Hye-Shik
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 187 bytes
Desc: not available
Url : http://mail.python.org/pipermail/python-dev/attachments/20040616/7d4bf748/attachment.bin
More information about the Python-Dev
mailing list