[Python-Dev] PEP 383 update: utf8b is now the error handler

"Martin v. Löwis" martin at v.loewis.de
Sun May 3 19:39:41 CEST 2009


> If the error handler is supposed to be used for codecs other than utf-8,
> perhaps it should renamed something more generic, e.g. "surrogate-escape"?

Perhaps. However, utf-8b doesn't really have to do anything with utf-8 -
it's an algorithm based on 16-bit or 32-bit code points.

> Also, if utf8-b is not provided as a codec, will there be an easy way for user
> code to use the same encoding as the IO layer does? 

s.encode(os.getfilesystemencoding(), "utf8b") will do just that (in
fact, that's exactly what the IO layer does).

Regards,
Martin


More information about the Python-Dev mailing list