[issue2857] Add "java modified utf-8" codec

Serhiy Storchaka report at bugs.python.org
Tue Apr 24 14:07:37 CEST 2012


Serhiy Storchaka <storchaka at gmail.com> added the comment:

> Serhiy: your functions to not constitute a Python codec. For example, there is no support for error handlers in them.

Yes, it is not a codec in Python library terminology. It's just a pair
of functions, the COder and DECoder, which is enough for the task of
hacking Java serialized data. I don't think that such specific task
leads to the change of the interpreter core.

However, translators that convert the non-BMP characters to a surrogate
pair and back, would be useful in the standard library. They need to
work with a non-standard encodings (CESU-8, MUTF-8, cp65001, some
Tk/IDLE issues). This is a fairly common task.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue2857>
_______________________________________


More information about the Python-bugs-list mailing list