[issue2857] Add "java modified utf-8" codec
Serhiy Storchaka
report at bugs.python.org
Tue Apr 24 14:07:37 CEST 2012
Serhiy Storchaka <storchaka at gmail.com> added the comment:
> Serhiy: your functions to not constitute a Python codec. For example, there is no support for error handlers in them.
Yes, it is not a codec in Python library terminology. It's just a pair
of functions, the COder and DECoder, which is enough for the task of
hacking Java serialized data. I don't think that such specific task
leads to the change of the interpreter core.
However, translators that convert the non-BMP characters to a surrogate
pair and back, would be useful in the standard library. They need to
work with a non-standard encodings (CESU-8, MUTF-8, cp65001, some
Tk/IDLE issues). This is a fairly common task.
----------
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue2857>
_______________________________________
More information about the Python-bugs-list
mailing list