[New-bugs-announce] [issue2857] add coded for java modified utf-8

paul rubin report at bugs.python.org
Thu May 15 05:08:40 CEST 2008

New submission from paul rubin <phr at users.sourceforge.net>:

For object serialization and some other purposes, Java encodes unicode
strings with a modified version of utf-8:


It is used in Lucene index files among other places.

It would be useful if Python had a codec for this, maybe called "UTF-8J"
or something like that.

components: Library (Lib)
messages: 66843
nosy: phr
severity: normal
status: open
title: add coded for java modified utf-8
versions: Python 2.5

Tracker <report at bugs.python.org>

More information about the New-bugs-announce mailing list