[Python-Dev] PEP 383 update: utf8b is now the error handler

Gregory P. Smith greg at krypto.org
Sun May 3 21:20:07 CEST 2009


On Sun, May 3, 2009 at 10:39 AM, "Martin v. Löwis" <martin at v.loewis.de>wrote:

> > If the error handler is supposed to be used for codecs other than utf-8,
> > perhaps it should renamed something more generic, e.g.
> "surrogate-escape"?
>
> Perhaps. However, utf-8b doesn't really have to do anything with utf-8 -
> it's an algorithm based on 16-bit or 32-bit code points.


To me that lack of relationship with utf8 suggests that it should not be
called utf8b...  But I don't have any good suggestions.


>
> > Also, if utf8-b is not provided as a codec, will there be an easy way for
> user
> > code to use the same encoding as the IO layer does?
>
> s.encode(os.getfilesystemencoding(), "utf8b") will do just that (in
> fact, that's exactly what the IO layer does).
>
> Regards,
> Martin
> _______________________________________________
> Python-Dev mailing list
> Python-Dev at python.org
> http://mail.python.org/mailman/listinfo/python-dev
> Unsubscribe:
> http://mail.python.org/mailman/options/python-dev/greg%40krypto.org
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-dev/attachments/20090503/259ceead/attachment-0001.htm>


More information about the Python-Dev mailing list